Subscribe
Sign in
Home
Main Site
Models & Research
Die Yield Calculator
Archive
Latest
Top
Discussions
Microsoft's AI Strategy Deconstructed - From Energy to Tokens
"The Big Pause", AI Tokens Factory Economics Stack, OpenAI, Neocloud Renting, GitHub Copilot Woes, MAI and Maia Floundering
Nov 12
•
Dylan Patel
,
Jeremie Eliahou Ontiveros
,
Myron Xie
,
Jordan Nanos
,
AJ Kourabi
,
Wei Zhou
,
Daniel Nishball
,
Ivan Chiam
, and
Clara Ee
185
12
21
ClusterMAX™ 2.0: The Industry Standard GPU Cloud Rating System
95% Coverage By Volume, 84 Providers Rated, 209 Providers Tracked, 140+ Customers Surveyed, 46,000 Words For Your Enjoyment
Nov 6
•
Jordan Nanos
,
Daniel Nishball
,
Michelle Shen
,
Cheang Kang Wen
,
Wei Zhou
,
Jeremie Eliahou Ontiveros
, and
Dylan Patel
113
3
5
October 2025
How to Kill 2 Monopolies with 1 Tool
Substrate X-Ray Lithography, a New American Foundry, $10k Logic Wafers
Oct 29
•
Dylan Patel
,
Jeff Koch
,
Gerald Wong
, and
Andrew Wagner
160
7
13
Nanoimprint Lithography: Stop Saying It Will Replace EUV
NIL basics, why it won’t replace EUV, details of Canon’s tool, possible applications
Oct 26
•
Jeff Koch
and
Dylan Patel
93
1
7
Quadruped State of The Market - Unitree, Boston Dynamics, ANYbotics, DEEP Robotics, and The Rising Application Ecosystem
Quadrupeds Superior Scalability, Unitree's Incredible Production, Third Party Providers Introduce New Dynamics, Novel Applications And Opportunities…
Oct 20
•
Reyk Knuhtsen
,
Dylan Patel
,
Niko Ciminelli
,
Joe Ryu
,
Jeremie Eliahou Ontiveros
, and
Robert Ghilduta
70
9
InferenceMAX™: Open Source Inference Benchmarking
NVIDIA GB200 NVL72, AMD MI355X, Throughput Token per GPU, Latency Tok/s/user, Perf per Dollar, Tokens per Provisioned Megawatt, DeepSeek R1 670B, GPTOSS…
Oct 9
•
Kimbo Chen
,
Dylan Patel
,
Daniel Nishball
,
Cam Quilici
, and
Cheang Kang Wen
147
8
10
September 2025
xAI's Colossus 2 - First Gigawatt Datacenter In The World, Unique RL Methodology, Capital Raise
On Site Turbines, Mississippi Expansion, Solaris Energy, Can xAI afford it?, Middle East Funding, Tesla, Talent Exodus, API revenue, Consumer Growth, RL…
Sep 16
•
Jeremie Eliahou Ontiveros
,
Dylan Patel
,
Wei Zhou
,
AJ Kourabi
, and
Maya Barkin
33
3
Another Giant Leap: The Rubin CPX Specialized Accelerator & Rack
New Prefill Specialized GPU, Rack Architecture, BOM, Disaggregated PD, Higher Perf per TCO, Lower TCO, GDDR7 & HBM Market Trends
Sep 10
•
Dylan Patel
,
Daniel Nishball
,
Kimbo Chen
,
Wega Chu
,
Ivan Chiam
, and
Cheang Kang Wen
17
Huawei Ascend Production Ramp: Die Banks, TSMC Continued Production, HBM is The Bottleneck
H20 Shipments, Blackwell B30A, Bottlenecks to Chinese Chip Production, Export Controls, CXMT, SMIC, Cambricon
Sep 8
•
Dylan Patel
,
AJ Kourabi
,
Myron Xie
, and
Jeff Koch
7
1
Amazon’s AI Resurgence: AWS & Anthropic's Multi-Gigawatt Trainium Expansion
Anthropic multi-gigawatt clusters, Trainium ramp, best TCO per memory bandwidth, system-level roadmap, Bedrock and internal models
Sep 3
•
Jeremie Eliahou Ontiveros
,
Dylan Patel
,
AJ Kourabi
, and
Myron Xie
13
1
August 2025
H100 vs GB200 NVL72 Training Benchmarks - Power, TCO, and Reliability Analysis, Software Improvement Over Time
Joules per Token, TCO Per Million Tokens, MFU, Tokens Per US Annual Household Energy Usage, DeepSeek 670B, GB200 Unreliability, Backplane Downtime
Aug 20
•
Dylan Patel
and
Daniel Nishball
4
GPT-5 Set the Stage for Ad Monetization and the SuperApp
How ChatGPT will monetize free users, Router is the Release, AIs will serve Ads, Google's moat eroded?, The shift of purchasing intent queries
Aug 13
•
Doug
,
Dylan Patel
,
Wei Zhou
, and
AJ Kourabi
7
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts