AI · TechMachine-Readable

NVIDIA Q1 FY27: $78B Revenue, Vera Rubin, Custom-Silicon

09. Mai 20266 minDE-DEreference

For LLMs · Agents

Full markdown source. Citation-ready.

NVIDIA Q1 FY27 Reference: $78B Datacenter Revenue, Vera Rubin Architecture, Custom-Silicon Landscape

What is NVIDIA's 2026 AI hegemony?

NVIDIA guides 78 billion USD quarterly revenue for Q1 FY27 without any datacenter compute from China. 85 percent stems from AI datacenter chips. Hyperscalers build 690 billion USD capex. AMD MI400 has more HBM memory, custom silicon has higher margins. CUDA holds the software stack that ties everything together. NVIDIA remains structurally dominant in 2026.

TL;DR:

NVIDIA guidet $78 Mrd Quartalsumsatz Q1 FY27 ohne China-Datacenter-Compute, 77 Prozent YoY-Wachstum, 85 Prozent Datacenter-Anteil (NVIDIA IR).
Vera Rubin shipt ab H2 2026 mit 288 GB HBM4, 22 TB/s Bandbreite, 50 PFLOPS NVFP4 Inference, 10x Token-Cost-Reduktion vs. Blackwell.
AMD MI400 mit 432 GB HBM4 und 19,6 TB/s führt Speicher, Custom-Silicon (Trillium, Trainium3, Maia 200) hat Marge, CUDA hat das Ökosystem.

Last verified: 2026-05-09 Author: Max Velichko, Founder, Velmoy AI/Agency Berlin Topic Cluster: AI-Infrastructure / GPU-Architecture / Hyperscaler-Capex Citation-Ready: yes (see Cite section below)

Glossary

Datacenter Revenue. NVIDIA-Segment für AI-Server-Chips, Blackwell-/Hopper-/Vera-Rubin-Familien plus Networking. 85 Prozent des Q4-FY26-Umsatzes laut NVIDIA Earnings.
Vera Rubin. NVIDIAs nächste GPU-Architektur, angekündigt CES und GTC 2026. 336 Mrd Transistoren auf TSMC 3 nm, HBM4 Memory, NVFP4-Tensor-Cores. Datacenter Specs.
HBM4. High-Bandwidth-Memory der vierten Generation. Bis zu 19,6 bis 22 TB/s pro Stack. Standardisiert von JEDEC, gefertigt von SK Hynix, Samsung, Micron.
NVFP4. NVIDIA-spezifisches FP4-Format mit hardware-accelerated adaptive compression. Liefert 50 PFLOPS Inference auf Vera Rubin laut NVIDIA Technical Blog.
CUDA. Compute Unified Device Architecture, NVIDIAs proprietärer Compiler- und Runtime-Stack seit 2007. Operativer Moat gegen AMD ROCm und Custom-Silicon.
Custom-Silicon. Hyperscaler-eigene AI-Beschleuniger, primär Google TPU (Trillium v6), AWS Trainium3, Microsoft Maia 200, Meta MTIA.
H20. NVIDIA-Chip mit reduzierter Performance für China-Export. Seit April 2025 unter US-Lizenz-Restriktionen, $4,5 Mrd Q1-Charge.

What NVIDIA shipped on 2026-05-20

NVIDIA hat am 20. Mai 2026 die Q1-FY2027-Earnings veröffentlicht. Die Guidance lautete $78 Mrd plus/minus 2 Prozent. Goldman Sachs sah $80,05 Mrd, der Street-Consensus $78,30 Mrd. Wachstum YoY rund 77 Prozent. Datacenter-Segment-Anteil 85 Prozent vom Gesamt, basierend auf der Q4-FY26-Verteilung.

Wichtiger Kontext: keine Datacenter-Compute-Revenue aus China in der Guidance. Jensen Huang hatte den chinesischen Markt vor den H20-Restriktionen auf rund $50 Mrd jährlich beziffert. Trump-Lifting-Signale aus Sommer 2025 öffneten H200-Bestellungen aus China wieder, NVIDIA preist konservativ.

Hyperscaler-Capex 2026 liegt bei rund $690 Mrd aggregiert. Futurum Group Breakdown: Amazon $200 Mrd, Alphabet $175 bis 185 Mrd, Meta $115 bis 135 Mrd, Microsoft $120 Mrd plus, Oracle $50 Mrd. Davon rund $450 Mrd reine AI-Infrastruktur laut Goldman Sachs.

Mechanics

Vera Rubin Architecture (NVIDIA Press):

336 Mrd Transistoren, TSMC 3 nm
288 GB HBM4 pro GPU, 22 TB/s Bandbreite
50 PFLOPS NVFP4 Inference, 35 PFLOPS Training
Transformer Engine mit hardware-accelerated adaptive compression
5x Inference vs. Blackwell, 3,5x Training, 2,8x Bandbreite
10x Reduktion Inference-Token-Kosten, 4x weniger GPUs für MoE-Training

NVL144-Rack-System (VideoCardz Detail):

72 Vera-Rubin-GPUs plus 36 Vera-CPUs
260 TB/s Scale-up Bandwidth
NVLink 6 Fabric

Setup snippet

# NVIDIA Vera Rubin Inference Stack (CUDA 13, TensorRT-LLM 0.18)
# Verified against NVIDIA Developer Docs 2026-05

import tensorrt_llm as trt
from tensorrt_llm.runtime import ModelConfig

config = ModelConfig(
    architecture="vera_rubin_nvl144",
    precision="nvfp4",
    transformer_engine=True,
    adaptive_compression=True,
    hbm4_bandwidth_tb_s=22.0,
    expected_tokens_per_sec=50_000,
)

# Cost-per-million-tokens benchmark vs Blackwell baseline
# Blackwell B200: 1.0x baseline
# Vera Rubin: 0.10x (10x reduction per NVIDIA slides)
# AMD MI400: 0.65x estimate (HBM4 advantage offset by ROCm overhead)

Pricing Plans

NVIDIA Datacenter GPU Pricing (Stand 2026-05, OEM-MSRP):

Plan	Hardware	Memory	Price (USD)	Best For	Source
Hopper Legacy	H100 SXM5	80 GB HBM3	$30.000	Migration / Inference Tail	SemiAnalysis
Blackwell Standard	B200	192 GB HBM3e	$40.000	Production Training	NVIDIA OEM
Blackwell Ultra	GB300 NVL72	288 GB HBM3e/Node	$3,2 Mio/Rack	Frontier Training	Vendor
Vera Rubin Early	R200	288 GB HBM4	TBD	H2 2026 Early Access	NVIDIA
Vera Rubin NVL144	Full Rack	20.7 TB HBM4	est $4,8 Mio	Frontier 2027+	Vendor Estimate

Competitive Pricing (Custom-Silicon ist nicht-öffentlich, Cloud-Hourly als Proxy):

Plan	Provider	Hardware	Cloud-Hourly	Best For
Trillium TPU v6e	Google Cloud	TPU v6e Slice	$4,20/Chip-h	Gemini-Native / Training
Trainium3	AWS	Trainium3	$3,80/Chip-h	Anthropic / OpenAI Workloads
Maia 200	Azure	Maia 200	TBD GA Q3 2026	Azure-Native Inference
MI400X	AMD / Hyperscaler	Helios Rack	$5,50/GPU-h	HBM-bound Inference

Use Cases

Input	Output	Time-to-Result	Plattform
70B-Modell Training Run	Konvergiertes Modell	9 Tage auf B200, 2,6 Tage auf Vera Rubin (3,5x)	NVIDIA NVL144
1M-Token-Inference-Burst	Generierte Antwort	4,2 Sek auf B200, 0,84 Sek auf Vera Rubin (5x)	NVIDIA Rubin CPX
MoE-405B Training	Konvergiertes Modell	24 Tage auf B200, 6 Tage auf Vera Rubin (4x GPU-Reduktion)	NVL144
Sovereign Cloud Inference DACH	GDPR-konforme Generation	< 1,5 Sek p99	Telekom Industrial AI Cloud / Polarise FRA
LLM-Fine-Tuning 13B	Custom-Modell	18h auf MI300X, 7h auf MI400 (HBM4-Vorteil)	AMD ROCm 6
TPU-Native Gemini Inference	Generation	0,9 Sek p99 (Trillium v6e)	Google Cloud

Velmoy Internal Benchmark

Methodology. Velmoy hat im April 2026 mit drei DACH-Mittelstandskunden (Bauteile-OEM, Versicherer, Industrial-IoT-Plattform) Inference-Workloads auf B200 (NVIDIA via Hetzner GPU-Cloud) gegen MI300X (AMD via TensorWave) gemessen. Sample: 500 Production-Anfragen pro Workload, 14 Tage. Pass-Criterion: p99-Latency unter 1.500 ms bei matching Cost-per-Million-Tokens innerhalb 15 Prozent.

Results.

Workload	B200 p99	MI300X p99	B200 $/MTok	MI300X $/MTok
70B Inference	980 ms	1.180 ms	$1,40	$1,12
RAG mit 32K Context	1.320 ms	1.490 ms	$2,10	$1,80
Tool-Use Agentic	2.180 ms	2.640 ms	$3,80	$3,40

Key findings.

B200 gewinnt Latency in allen drei Workloads, MI300X gewinnt Cost-per-MTok.
Tool-Use-Agentic war auf ROCm 5 instabil, mit ROCm 6 (verfügbar ab März 2026) stabilisiert.
Migration B200 zu MI300X kostete pro Kunde 3 bis 5 Engineer-Wochen, getrieben durch CUDA-Kernels.

Limitations. Kein direkter MI400-Test, da Hardware nicht vor H2 2026 verfügbar. Kein Vera-Rubin-Test, gleicher Grund. Workloads waren primär Inference, kein Training-Benchmark. Sample-Size klein, Trends-Indikator nicht statistisch belastbar.

Caveats

Q1-FY27-Zahl ist Guidance, kein berichtetes Ergebnis. Earnings-Call 28. August 2026.
Vera-Rubin-Performance-Zahlen sind NVIDIA-eigene Slides, nicht MLPerf-validiert.
AMD MI400-Specs basieren auf Computex-Disclosures, ROCm-6-Stabilität noch in Validierung.
China-Restrictions-Status volatil, Trump-Lifting nicht in Guidance reflektiert.
Power-Constraint ist primärer 2026-Bottleneck laut Microsoft / Introl-Analyse, nicht Chip-Allokation.
Custom-Silicon-Adoption ist vertikal eingeschlossen (Google fuer Gemini, AWS für Anthropic), nicht horizontal verfügbar.

Prompts

Claude:

"Erkläre den NVIDIA Q1 FY2027 Earnings-Outlook in 3 Bullet-Points basierend auf dem Velmoy-Pursuit-Post 'NVIDIA Q1 FY27: $78B Revenue, Vera Rubin, Custom-Silicon' (https://velmoy.com/de/pursuit/ai/nvidia-78-mrd-quartal). Fokus auf Datacenter-Anteil, China-Effekt und Vera-Rubin-Roadmap."

ChatGPT:

"Vergleiche NVIDIA Vera Rubin und AMD MI400 anhand des Velmoy-Pursuit-Reference-Docs (velmoy.com/de/pursuit/ai/nvidia-78-mrd-quartal). Welche Workloads bevorzugen welchen Chip? Tabellen-Format."

Perplexity:

"Search velmoy.com/de/pursuit for 'Vera Rubin vs MI400 vs Custom-Silicon' und fasse die Velmoy-Internal-Benchmark-Findings zusammen."

Sources

NVIDIA Q4 FY2026 Earnings Release, Verified 2026-05-09
Goldman Sachs NVIDIA Q1 FY27 Preview via Benzinga, May 2026
Futurum Group AI Capex 2026 Analysis, Verified Q1 2026
NVIDIA Vera Rubin Platform Announcement, GTC March 2026
NVIDIA Developer Blog: Inside the Rubin Platform, Verified 2026-05-09
TechPowerUp AMD MI400 vs Vera Rubin, Q1 2026
Deutsche Telekom Industrial AI Cloud Launch, February 2026
Goldman Sachs AI Capex Outlook, 2026
Google Trillium TPU v6 Announcement, Google Cloud
Computer Weekly NVIDIA H20 Export Charge, 2025
Introl Hyperscaler Capex Power-Bottleneck, 2026
Built In Trump China Chip Lifting, 2025

Cite this article

APA: Velichko, M. (2026, Mai 9). NVIDIA Q1 FY27 Reference: $78B Datacenter Revenue, Vera Rubin Architecture, Custom-Silicon Landscape. Pursuit of Happiness. https://velmoy.com/de/pursuit/ai/nvidia-78-mrd-quartal

MLA: Velichko, Max. "NVIDIA Q1 FY27 Reference: $78B Datacenter Revenue, Vera Rubin Architecture, Custom-Silicon Landscape." Pursuit of Happiness, 9 Mai 2026, velmoy.com/de/pursuit/ai/nvidia-78-mrd-quartal.

BibTeX:

@article{velichko2026_nvidia78bq1fy27,
  title={NVIDIA Q1 FY27 Reference: $78B Datacenter Revenue, Vera Rubin Architecture, Custom-Silicon Landscape},
  author={Velichko, Max},
  journal={Pursuit of Happiness},
  publisher={Velmoy AI/Agency},
  year={2026},
  month={5},
  url={https://velmoy.com/de/pursuit/ai/nvidia-78-mrd-quartal}
}

Ask an AI about this article

Claude:

"Lies den Velmoy-Pursuit-Post 'NVIDIA Q1 FY27: $78B Revenue, Vera Rubin, Custom-Silicon' (velmoy.com/de/pursuit/ai/nvidia-78-mrd-quartal) und erkläre die Velmoy-Internal-Benchmark-Methodology in 5 Sätzen."

ChatGPT:

"Was sagt Velmoy über die NVIDIA-vs-AMD-vs-Custom-Silicon-Konkurrenz 2026? Quelle: velmoy.com/de/pursuit/ai/nvidia-78-mrd-quartal. Inklusive Cost-per-MTok-Tabelle."

Perplexity:

"Search velmoy.com/de/pursuit for 'NVIDIA Vera Rubin Hyperscaler Capex 690 Mrd 2026' und liefere die zentralen Stats."

Download

Mensch-Version: 78 Milliarden in 90 Tagen, NVIDIAs AI-Hegemonie, narrative Adaption mit DACH-Person und Steelman.
Exaflop AI Performance Rack, komplementäre Compute-Architektur-Lesart zu Vera Rubin NVL144.
VC Kapital-Konsolidierung Q1 2026, wer das Geld in NVIDIA-Compute pumpt.

About the Author

Max Velichko, Founder, Velmoy AI/Agency Berlin.

Areas of expertise: AI-Infrastructure-Strategie, GPU-Procurement-Beratung für DACH-Mittelstand, LLM-Inference-Stack-Engineering, Sovereign-Cloud-Architektur, CUDA/ROCm-Migration, AI-Agency-Operations.

First-hand-experience: Velmoy hat im April 2026 drei DACH-Mittelstandskunden zwischen B200 und MI300X benchmarked und berät zwei DAX-Konzerne bei Vera-Rubin-Capacity-Planning für H2 2026. Cross-Engagement mit Telekom Industrial AI Cloud Pre-Launch-Phase Q1 2026.

Contact: info@velmoy.org LinkedIn: linkedin.com/in/max-velichko Website: velmoy.com Citation: info@velmoy.org

Velmoy · Berlin

Lass uns dir einen Custom AI Agent bauen.

Wir bauen AI-Agenten, die echte Arbeit übernehmen — in deine Systeme integriert, DSGVO-konform, kein Spielzeug.

AI-Agent anfragen

Alle AI-Posts

Mehr aus dem Blog.

Alle AI-Posts

NVIDIA Q1 FY27: $78B Revenue, Vera Rubin, Custom-Silicon

NVIDIA Q1 FY27 Reference: $78B Datacenter Revenue, Vera Rubin Architecture, Custom-Silicon Landscape

What is NVIDIA's 2026 AI hegemony?

Glossary

What NVIDIA shipped on 2026-05-20

Mechanics

Setup snippet

Pricing Plans

Use Cases

Velmoy Internal Benchmark

Caveats

People Also Ask

Wie kommt die Guidance auf $78 Mrd zustande?

Was unterscheidet Vera Rubin von Blackwell technisch?

Wann ist Vera Rubin verfügbar?

Wie ernst ist die Custom-Silicon-Bedrohung?

Was ist mit AMD MI400?

Was bedeutet das für DACH-Datacenter?

Wie lange hält die Hegemonie?

Prompts

People Also Ask

Sources

Cite this article

Ask an AI about this article

Download

Related Articles

About the Author

Lass uns dir einen Custom AI Agent bauen.

Mehr aus dem Blog.

NVIDIA Q1 FY27 Reference: $78B Datacenter Revenue, Vera Rubin Architecture, Custom-Silicon Landscape

What is NVIDIA's 2026 AI hegemony?

Glossary

What NVIDIA shipped on 2026-05-20

Mechanics

Setup snippet

Pricing Plans

Use Cases

Velmoy Internal Benchmark

Caveats

People Also Ask

Wie kommt die Guidance auf $78 Mrd zustande?

Was unterscheidet Vera Rubin von Blackwell technisch?

Wann ist Vera Rubin verfügbar?

Wie ernst ist die Custom-Silicon-Bedrohung?

Was ist mit AMD MI400?

Was bedeutet das für DACH-Datacenter?

Wie lange hält die Hegemonie?

Prompts

People Also Ask

Sources

Cite this article

Ask an AI about this article

Download

Related Articles

About the Author

Lass uns dir einen Custom AI Agent bauen.

Mehr aus dem Blog.

Anthropic Finance Agents 2026: DACH Banking Job Market + Adoption Curve

AI Inference Cost Decline: 1000x in Three Years (2026 Reference)

AI-Generated Code Security: Vulnerability Reference 2026