In November 2006, Jensen Huang made a bet most of Wall Street ignored. Nvidia launched CUDA — a programming toolkit that let developers use GPUs for general computation, not just graphics. For years it looked like a money pit. Nvidia poured over $10 billion into CUDA R&D before it turned profitable, subsidizing free toolkits, university programs, and developer conferences while rivals focused on cheaper chips. Then deep learning exploded. When Alex Krizhevsky used two Nvidia GTX 580s to win ImageNet in 2012, every AI researcher noticed. By 2015, both TensorFlow and Theano were CUDA-first. By 2018, PyTorch — which would become the dominant ML framework — was inseparable from CUDA. Each new framework attracted more developers, who wrote more libraries, which attracted more researchers, wh...
Popular framing: Nvidia got lucky that deep learning happened to need GPUs.
Structural analysis: A 15-year subsidy of free toolkits and university programs seeded a developer base, which attracted frameworks, which attracted researchers, which attracted more developers — network effects compounding on Metcalfe's-law math. The moat is not the chip; it is the switching cost embedded in 4 million people who already think in CUDA.
The visionary-leader framing obscures the structural dynamics that make the moat durable and nearly impossible to replicate intentionally. Understanding CUDA as a network-effect compounding engine rather than a product explains why AMD's technically competitive hardware consistently fails to capture market share — they are not losing a chip race, they are losing an ecosystem race governed by different rules. This distinction matters for any investor, regulator, or competitor trying to understand whether the moat can be challenged.