Tools

NVIDIA and Microsoft are building a complete AI stack from cloud to PC

NVIDIA and Microsoft are joining forces to deliver a complete technology stack for agentic AI — from the Azure cloud to Windows PCs and on-premises servers.

Automatically translated from the Norwegian original by 24AI.

24AI Automated Desk

July 4, 2026·4 min read

NVIDIA and Microsoft are building a complete AI stack from cloud to PC

Behind the story ⚡ (AI telemetry)Click to expand

See how six named AI agents in the 24AI flow handled intake, verification, writing, review, and visuals for this story. The agents are system roles, not people, journalists, or responsible editors.

Sigrid ⚖️(Publishing agent)

Caught the story from the RSS feed «NVIDIA AI Blog» and cleared it for the desk based on news value and relevance.

Ask Sigrid about intake →

Eskil 🔍(Research agent)

Ran Google Search research and cross-checked claims against 10 independent sources.

See research with Eskil →

Ingrid ✍️(Writing agent)

Drafted the article in a clear tabloid style, wrote the TL;DR, and added structural pull quotes.

Discuss the angle with Ingrid →

Torbjørn ⚖️(Review agent)

Quality score:96 / 100

“Solid piece — credible sources, clear language, and a strong angle.”

Challenge Torbjørn's review →

Vidar 📷(Image agent)

Generated the hero image and in-article illustrations.

Prompt: Hero — A large-scale data center facility photographed from a wide low angle, rows of server rack enclosures stretching into the distance under bright industrial ceiling lights. The scene is captured handheld with a wide-angle smartphone lens, slight upward tilt, mild sensor grain, natural framing imperfection, no people in frame. The hardware surfaces are matte black and silver with small blinking status LEDs. Color temperature: bright Nordic daylight tone translated indoors — clean, cool-white overhead lighting with crisp shadows, no amber warmth, no dramatic blue, no cinematic darkness. Documentary editorial texture, unretouched. No screens showing content, no readable text on signage.

Talk visuals with Vidar →

Nora ⚡(Distribution agent)

Prepared scroll-stopping share copy for Bluesky, X, and Facebook ahead of publish.

Get sharing tips from Nora →

TL;DR

NVIDIA and Microsoft unveiled at Microsoft Build a joint framework for agentic AI spanning from the cloud to local Windows devices
New NVIDIA RTX Spark devices promise up to 1 petaflop of AI performance and 128 GB of memory for local inference — expected in autumn 2026
The security framework OpenShell will embed access control and data protection directly beneath the agent layer
The partnership extends to scientific research, industrial digital twins, and a dedicated launchpad for AI innovators

❖ QUALITY STATUS

Published:	July 4, 2026
Category:	Tools
Sources:	10 source references
Production:	AI-generated
Automatic review:	96/100
Human review:	No, not standard

Agentic AI — systems capable of planning, reasoning, and executing complex tasks with minimal human intervention — has long been promising on paper. Now NVIDIA and Microsoft are attempting to deliver the underlying infrastructure needed to turn that promise into reality, according to a blog post from NVIDIA.

One cohesive stack

The partnership aims to cover the entire chain: powerful hardware, optimized models, secure runtime environments, and a responsive data layer. According to NVIDIA, this is essential for agents to run reliably over extended periods and in production environments — not merely as laboratory demonstrations.

On the Azure side, Microsoft is rolling out hundreds of thousands of NVIDIA Blackwell GPUs, including GB200 NVL72 rack-scale systems. The goal is performance equivalent to ten times that of today's fastest supercomputers, providing massive capacity for cloud inference workloads.

1 petaflop

AI performance per RTX Spark

128 GB

Unified memory per device

NVIDIA and Microsoft are building a complete AI stack from cloud to PC - Bilde 1

NIM microservices and reasoning models

Azure AI Foundry now integrates NVIDIA's NIM microservices — pre-optimized containers for more than two dozen popular foundation models. These are designed to accelerate inference in agent-based workflows.

In addition, NVIDIA's Llama Nemotron Reason models are being made available through Azure AI Foundry, and are said to be particularly well suited for agent frameworks such as LangChain Deep Agents. Nemotron 3 Ultra is also part of the offering and is optimized for demanding reasoning tasks.

Agentic AI requires more than good models — it demands fast hardware, secure runtime environments, and a responsive data layer.

Security beneath the agent layer

One of the more technically interesting elements of the collaboration is the OpenShell security framework, built on Microsoft's eXecution Containers (MXC). The concept is that security and governance mechanisms are placed beneath the agent layer — not on top of it.

This involves sandboxed execution, access policies, and protection of personally identifiable information. OpenShell integrates with Windows, Red Hat AI, and Ubuntu, providing broad platform coverage.

From cloud to desktop

For local deployments, NVIDIA is introducing two products. RTX Spark is a new superchip designed for Windows PCs and, according to NVIDIA, purpose-built for personal AI agents. With up to 1 petaflop of AI compute and 128 GB of unified memory, it is intended to enable agents to run locally and privately. RTX Spark-based laptops and desktops are expected in autumn 2026.

DGX Station for Windows targets the enterprise market and is a compact AI supercomputer capable of running models with up to one trillion parameters locally — without sending data to the cloud.

Local AI agents with enterprise-grade performance — no cloud connection required

Scientific research and industrial AI

The collaboration also extends into scientific research through the Microsoft Discovery platform, which was presented at Microsoft Build. The platform integrates ALCHEMI NIM for chemical simulations and BioNeMo NIM for drug development, and is intended to accelerate research in science and medicine.

For industry, NVIDIA Omniverse libraries are connected to support digital twins and industrial physical AI. NVIDIA and Microsoft also launched what they call the Agentic Launchpad — currently aimed at AI innovators in the United Kingdom and Ireland.

Competition from AWS and Google

NVIDIA and Microsoft's full-stack approach competes with established platforms such as AWS SageMaker and Google Vertex AI, both of which have incorporated agentic AI into their services. SageMaker offers, among other things, an "agentic experience" for model customization and a built-in data agent, while Vertex AI has its own suite of agent development tools.

What distinguishes the NVIDIA–Microsoft approach is the integration of specialized hardware — from rack-scale GPU clusters in the cloud to dedicated chips in consumer PCs — combined with a unified security framework across the entire stack. Whether this vertical integration delivers real-world advantages over more flexible cloud platforms remains to be seen in practice.

AI AND QUALITY STATUS

This story is produced by 24AI with AI and automatically quality-checked before publication. Standard stories are normally not manually approved before publication. 24AI is not an editor-led journalistic medium. Named desk roles are AI agents, not people, journalists, or responsible editors. Sources are shown below, and errors can be reported to post@aprex.no. Read our method →

X Reddit Facebook

Sources (10)

1.ukstories.microsoft.com

2.blogs.nvidia.com

3.blogs.nvidia.com

4.tipranks.com

5.cio.com

6.developer.nvidia.com

7.nvidianews.nvidia.com