Microsoft and NVIDIA Unveil RTX Spark: 1 Petaflop AI Chip for Windows PCs

Microsoft and NVIDIA have unveiled RTX Spark, a new chip delivering 1 petaflop of AI performance for Windows PCs. The unified memory architecture enables local AI workloads previously requiring cloud compute, with devices launching this fall from major manufacturers.

By Inside AI June 22, 2026
AI neural network visualization

June 22, 2026, (Inside AI) — Microsoft and NVIDIA have revealed RTX Spark, a new system-on-chip designed to power what they call the most capable thin-and-light Windows PCs ever built. The chip was announced at NVIDIA GTC and packs 1 petaflop of AI performance, up to 6,144 Blackwell RTX cores, 20 Arm-based CPU cores, and up to 128GB of unified memory into a single silicon package.

The architecture merges GPU, NPU, and CPU functions to handle AI-native computing locally. RTX Spark targets developers, creators, and power users who need to run advanced AI workloads, graphically demanding creative applications, and agent-based tasks without cloud reliance.

Microsoft collaborated directly with NVIDIA to optimize Windows for the chip's heterogeneous design. This includes workload profile scheduling that distributes tasks across all 20 cores more efficiently. The Microsoft Power and Thermal Framework also maximizes performance-per-watt under sustained loads.

Unified Memory Unlocks On-Device AI

The unified memory architecture marks a leap for local AI. Systems with 128GB allow developers to load large language models entirely on-device. They can run complex rendering projects and build agentic workflows that previously required cloud compute.

Microsoft confirmed that NVIDIA will bring NVIDIA OpenShell to Windows. Hermes Agent and OpenClaw will integrate new Windows security and containment primitives for safe, local agent execution. This move could reshape how developers approach sensitive AI tasks.

At launch, the app ecosystem includes Blender, DaVinci Resolve, Photoshop, Premiere, CapCut, GitHub Copilot, Claude Code, Cursor, ComfyUI, and CUDA-accelerated PyTorch. Riot Games confirmed League of Legends and VALORANT will run on RTX Spark. PUBG: Battlegrounds, Alan Wake 2, and Naraka: Bladepoint also confirmed compatibility.

From Laptops to Enterprise Supercomputers

RTX Spark-powered Copilot+ PCs will launch this fall from Microsoft Surface, ASUS, Dell, HP, Lenovo, and MSI. Microsoft also outlined a longer-term roadmap scaling Windows from RTX Spark laptops up to the NVIDIA DGX Station for Windows. That system will use the GB300 Grace Blackwell Ultra Desktop Superchip, which the company frames as putting a trillion-parameter AI supercomputer on every enterprise desk.

Industry observers note the push toward local AI compute challenges the cloud-centric model. By embedding powerful AI hardware directly into client devices, Microsoft and NVIDIA could reduce latency, improve privacy, and lower operational costs for developers. However, questions remain about battery life, thermal constraints, and real-world performance of such a dense chip in thin-and-light form factors.

The RTX Spark announcement comes as competitors like Apple and Qualcomm also advance their own Arm-based AI silicon. Microsoft's deep integration with NVIDIA's software stack—including CUDA and AI frameworks—may give it an edge in developer adoption. Yet, the true test will be whether the fall hardware delivers on the promised petaflop without throttling.

More from Inside AI

  • Machine Learning

    Anthropic Accuses China’s Alibaba of Largest-Ever Claude AI Model Theft

    June 25, 2026
  • Generative AI

    China’s Z.ai Narrows AI Frontier Gap with GLM-5.2 After Anthropic Shutdown

    June 25, 2026
  • Artificial Intelligence (AI)

    Amazon Pours $13 Billion into India AI Data Centres as Cloud War Intensifies

    June 25, 2026
  • Artificial Intelligence (AI)

    Mumbai Embraces AI Crowd Monitoring at Top Sites Before Ganeshotsav

    June 25, 2026
  • Artificial Intelligence (AI)

    China’s AI and Rare Earth Leverage Exposes Fragile U.S. Ties, Scholar Warns

    June 25, 2026
  • Machine Learning

    IBM Unveils 0.7nm Chip Tech, Stacking Transistors in 3D for AI Era

    June 25, 2026
  • Generative AI

    Facebook Launches AI-Powered Creator Studio App in India to Boost Creator Growth

    June 25, 2026
  • Agentic AI

    MIT and Microsoft’s Murakkab Slashes AI Agent Energy Use by 73%

    June 25, 2026

Never Miss a Breakthrough

Join 50,000+ readers who get our daily AI intelligence briefing. No fluff, just what matters.

Inside AI is an independent publication covering artificial intelligence news, machine learning research, and the tools shaping the future of technology. No fluff. No hype. Just what matters.

Topics

  • Artificial Intelligence
  • Machine Learning
  • Generative AI
  • Agentic AI
  • Vibe Coding
  • Prompt Engineering
  • AI Tools & Reviews (Coming soon)

Company

  • Editorial Standards
  • Privacy Policy
  • Terms of Service
  • Contact

© 2026 Inside AI. All rights reserved.

Designed by Blue Flare Digital