The $4B Bet on Smarter Code

Qualcomm paid $3.9 billion for a company most developers have never heard of.

The chipmaker agreed to acquire Modular, an AI software firm, in an all-stock deal announced Wednesday , according to MarketScreener. Modular builds software that allows AI models to run across different hardware architectures — including chips from multiple vendors — without requiring developers to rewrite code for each processor , the company said. The timing is not subtle. The acquisition moves Qualcomm into territory long dominated by Nvidia's CUDA platform, which has cemented developer loyalty to Nvidia's hardware , Reuters reported. For Qualcomm, buying Modular is a direct challenge to that lock-in. The acquisition gives Qualcomm a foothold in the AI software segment dominated by Nvidia's CUDA , according to NewKerala.

The deal matters because it signals where the AI infrastructure fight is heading: not just faster chips, but the software layer that determines which chips developers actually use. Qualcomm CEO Cristiano Amon called the industry's shift toward "disaggregated, multi-vendor architectures" a moment that demands "a more open and modern software foundation" , per Investing.com. Translation: Nvidia's moat is software, and Qualcomm just bought a shovel.

Can You Make AI Cheaper Without Making It Worse?

A different bet landed the same week. AI memory startup Engram raised $98 million from investors including General Catalyst, Kleiner Perkins, Sequoia, and OpenAI co-founder Andrej Karpathy , CNBC reported. The round valued the eight-month-old company at $600 million , according to Calcalist. Engram has 13 employees.

The pitch is straightforward. Engram claims its models can match or outperform frontier labs using up to 100 times fewer tokens , CNBC noted. Tokens are the billing unit for AI queries — the more you use, the more you pay. Engram counts Microsoft, Notion, and legal AI startup Harvey among its customers , the company said. For context, new and more sophisticated AI models are proving pricier than previous iterations, challenging the conventional view that greater scale would lead to lower costs , per CNBC.

Engram trains models to study an organization's world and anticipate its questions in advance, forming a compact, continuously improving memory unique to each customer, with results that match or outperform frontier models using up to 100x fewer tokens , the company said in a statement. The neuroscience metaphor is deliberate. CEO Dan Biderman said his fascination with memory started in childhood, when his grandmother began losing her memory and he would try to prompt her to remember small details — an experience that shaped his academic path , according to Tech Startups.

The cost problem is real. Kleiner partner Leigh Marie Braswell described the market as facing an "explosion of data, explosion of cost" , CNBC reported. Engram's answer is to stop treating every query like the first one. If the AI already knows your organization's workflows, it doesn't need to reread the same documents every time. That compression is where the 100x claim lives.

Why GitHub Rewired Copilot's Pricing

While startups chase efficiency, GitHub made a structural change that reshapes how millions of developers pay for AI. All GitHub Copilot plans transitioned to usage-based billing on June 1, 2026, replacing premium request counts with monthly allotments of GitHub AI Credits and the option for paid plans to purchase additional usage , the company announced. GitHub said the change reflects that Copilot now powers far more complex, agentic workflows that consume far more compute, and is designed to deliver a more sustainable and reliable product experience by aligning pricing to actual usage and costs , according to a GitHub community discussion.

The shift is more than accounting. GitHub Copilot billing moved to usage-based billing with GitHub AI Credits on June 1, 2026, with subscribers receiving a monthly included allocation and usage beyond that billed at the end of the month , Developers Digest reported. Copilot code review moved to an agentic architecture that runs on GitHub Actions, and starting June 1, reviewing a pull request with Copilot counts against included Actions minutes , GitHub noted.

For developers on individual plans, the math changed overnight. The billing change fundamentally alters the economics of using Copilot: developers who previously used it freely now need to monitor consumption, and teams need to forecast AI costs the same way they forecast cloud compute, with heavy users potentially seeing effective costs increase significantly , according to ClickUp's analysis. GitHub reopened sign-ups for Copilot Student, Pro, Pro+, and Max plans gradually starting June 16, 2026, after pausing new individual subscriptions , the company said.

The broader pattern is clear: AI coding tools are moving from flat subscriptions to metered usage. That aligns incentives — vendors get paid for what you actually use — but it also means developers need to think about token budgets the way they think about cloud spend.

The Open-Source Countermove

Not everyone is buying into proprietary stacks. OpenCode entered at #1 in LogRocket's June 2026 AI dev tool power rankings as the most significant shift in how developers work with AI coding agents, with 160K+ GitHub stars and 7.5M monthly active developers making it the most-adopted open-source coding agent ever built , LogRocket reported. OpenCode offers model-agnostic access to 75+ providers including Claude, GPT, Gemini, DeepSeek, and local models via Ollama, LSP integration that feeds compiler diagnostics back to the model, background subagents, a Scout agent for external research, and true air-gapped deployment for regulated industries , the analysis noted.

The trade-off is speed. OpenCode is 78% slower than Claude Code on the same model in Builder.io tests, but generates more thorough output with 21 extra tests in head-to-head comparisons , per LogRocket. The BYOK pricing model means your cost is your provider's cost, not OpenCode's , the report added. For teams managing AI budgets across multiple vendors or operating in regulated environments, that control matters more than latency.

The developer tools landscape is fragmenting along a familiar axis: proprietary polish versus open flexibility. Claude Code awareness among developers jumped from 31% in mid-2025 to 57% by January 2026, while workplace adoption grew roughly 6x in the same period , according to a Medium analysis citing The Pragmatic Engineer. But 46% of developers actively distrust the accuracy of AI output, while only 3% say they "highly trust" it, with the most common frustration — reported by 66% of respondents — being that AI produces solutions that are almost right , the same survey found.

That trust gap is shaping product strategy. GitHub improved Copilot efficiency with smarter prompt caching, deferred tool loading, and Auto model selection that routes tasks to the right model in VS Code and beyond, expanding Auto with task intent across more Copilot surfaces to help credits go further , GitHub's changelog noted. The message: we're making the same budget stretch further by being smarter about which model handles which task.

What Changed This Week

Qualcomm spent $3.9 billion to challenge Nvidia's software moat. An eight-month-old memory startup raised $98 million at a $600 million valuation by promising to cut token costs by 100x. GitHub moved every Copilot user to metered billing and reopened individual sign-ups after a two-week pause. The common thread: the AI developer tools market is consolidating around cost, control, and who owns the stack between the model and the code.

What to Watch

Qualcomm's Modular acquisition is expected to close in the second half of 2026 , MarketScreener reported — watch for integration details and whether Qualcomm can convert hardware customers into software users. Engram's Microsoft partnership will test whether enterprise memory layers can actually deliver the promised cost savings at scale. And GitHub's usage-based billing will produce its first full billing cycle in early July, giving the market real data on how developer behavior changes when every suggestion has a price.

The $4B Bet on Smarter Code

Can You Make AI Cheaper Without Making It Worse?

Why GitHub Rewired Copilot's Pricing

The Open-Source Countermove

What Changed This Week

What to Watch

More from Stake & Paper

Mining claims intelligence — from query to report, in minutes.

The $4B Bet on Smarter Code

Can You Make AI Cheaper Without Making It Worse?

Why GitHub Rewired Copilot's Pricing

The Open-Source Countermove

What Changed This Week

What to Watch

Keep Reading

Nuclear Startup X-energy Raises $1 Billion as Energy Markets Navigate Iran War Turbulence

When the Chips Are Down

The Coding Assistant Wars Heat Up

More from Stake & Paper

Mining claims intelligence — from query to report, in minutes.

One morning brief. The whole energy sector.