NVIDIA Technical Blog
-
Data Center / CloudIntegrating Semi-Custom Compute into Rack-Scale Architecture with NVIDIA NVLink Fusion
-
RoboticsAdvanced Sensor Physics, Customization, and Model Benchmarking Coming to NVIDIA Isaac Sim and NVIDIA Isaac Lab
-
RoboticsCurating Synthetic Datasets to Train Physical AI Models with NVIDIA Cosmos Reason
-
AI Platforms / DeploymentNVIDIA ConnectX-8 SuperNICs Advance AI Platform Architecture with PCIe Gen6 Connectivity
-
Edge ComputingDeploy AI-RAN at Cell Sites with NVIDIA ARC-Compact
Recent

May 19, 2025
Spotlight: Atgenomix SeqsLab Scales Health Omics Analysis for Precision Medicine
In traditional clinical medical practice, treatment decisions are often based on general guidelines, past experiences, and trial-and-error approaches. Today,...
9 MIN READ

May 19, 2025
NVIDIA TensorRT for RTX Introduces an Optimized Inference AI Library on Windows 11
AI experiences are rapidly expanding on Windows in creativity, gaming, and productivity apps. There are various frameworks available to accelerate AI inference...
9 MIN READ

May 18, 2025
Advance Video Analytics AI Agents Using the NVIDIA AI Blueprint for Video Search and Summarization
Vision language models (VLMs) have transformed video analytics by enabling broader perception and richer contextual understanding compared to traditional...
15 MIN READ

May 18, 2025
Announcing NVIDIA Exemplar Clouds for Benchmarking AI Cloud Infrastructure
Developers and enterprises training large language models (LLMs) and deploying AI workloads in the cloud have long faced a fundamental challenge: it’s nearly...
4 MIN READ

May 18, 2025
Spotlight: Perfect Corp. Delivers Personalized Digital Beauty Experiences Using NVIDIA TensorRT and NVENC
Augmented reality (AR) and AI are revolutionizing the beauty and fashion industry by offering hyperpersonalized experiences, from virtual try-ons to AI-driven...
4 MIN READ

May 18, 2025
Designing AI Factories Using OpenUSD and SimReady Assets
Announced at COMPUTEX 2025, the NVIDIA Omniverse Blueprint for AI factory digital twins has expanded to support OpenUSD schemas. The blueprint features new...
4 MIN READ

May 18, 2025
Integrating Semi-Custom Compute into Rack-Scale Architecture with NVIDIA NVLink Fusion
Data centers are being re-architected for efficient delivery of AI workloads. This is a hugely complicated endeavor, and NVIDIA is now delivering AI factories...
7 MIN READ

May 18, 2025
Advanced Sensor Physics, Customization, and Model Benchmarking Coming to NVIDIA Isaac Sim and NVIDIA Isaac Lab
At COMPUTEX 2025, NVIDIA announced new updates to its robotics simulation reference application NVIDIA Isaac Sim, and robot learning framework, NVIDIA Isaac...
10 MIN READ
Inference Performance

May 06, 2025
LLM Inference Benchmarking Guide: NVIDIA GenAI-Perf and NIM
This is the second post in the LLM Benchmarking series, which shows how to use GenAI-Perf to benchmark the Meta Llama 3 model when deployed with NVIDIA NIM. ...
11 MIN READ

Apr 21, 2025
Optimizing Transformer-Based Diffusion Models for Video Generation with NVIDIA TensorRT
State-of-the-art image diffusion models take tens of seconds to process a single image. This makes video diffusion even more challenging, requiring significant...
8 MIN READ

Apr 02, 2025
NVIDIA Blackwell Delivers Massive Performance Leaps in MLPerf Inference v5.0
The compute demands for large language model (LLM) inference are growing rapidly, fueled by the combination of growing model sizes, real-time latency...
10 MIN READ

Apr 02, 2025
LLM Inference Benchmarking: Fundamental Concepts
This is the first post in the large language model latency-throughput benchmarking series, which aims to instruct developers on common metrics used for LLM...
15 MIN READ

Mar 20, 2025
Boost Llama Model Performance on Microsoft Azure AI Foundry with NVIDIA TensorRT-LLM
Microsoft, in collaboration with NVIDIA, announced transformative performance improvements for the Meta Llama family of models on its Azure AI Foundry platform....
4 MIN READ

Mar 18, 2025
Introducing NVIDIA Dynamo, A Low-Latency Distributed Inference Framework for Scaling Reasoning AI Models
NVIDIA announced the release of NVIDIA Dynamo today at GTC 2025. NVIDIA Dynamo is a high-throughput, low-latency open-source inference serving framework for...
14 MIN READ

Mar 18, 2025
NVIDIA Blackwell Delivers World-Record DeepSeek-R1 Inference Performance
NVIDIA announced world-record DeepSeek-R1 inference performance at NVIDIA GTC 2025. A single NVIDIA DGX system with eight NVIDIA Blackwell GPUs can achieve over...
14 MIN READ

Feb 14, 2025
Optimizing Qwen2.5-Coder Throughput with NVIDIA TensorRT-LLM Lookahead Decoding
Large language models (LLMs) that specialize in coding have been steadily adopted into developer workflows. From pair programming to self-improving AI agents,...
7 MIN READ
Generative AI

May 16, 2025
Build Agents and Understand Long Docs with Mistral Medium 3 and NVIDIA NIM
Developers building powerful multimodal applications now have a new state-of-the-art model designed for enterprise-scale performance with Mistral Medium 3....
2 MIN READ

May 15, 2025
AI Helps Uncover Potential Alzheimer’s Cause and Treatment
A gene that can be an early indicator for Alzheimer’s disease actually is a cause of the degenerative-brain disorder, said researchers at the University of...
3 MIN READ

May 14, 2025
Build Custom Reasoning Models with Advanced, Open Post-Training Datasets
Synthetic data has become a standard part of large language model (LLM) post-training procedures. Using a large number of synthetically generated examples from...
5 MIN READ

May 14, 2025
Get Trained and Certified at GTC Paris at VivaTech 2025
Join us at GTC Paris on June 10th and choose from six full-day, instructor-led workshops.
1 MIN READ

May 12, 2025
Accelerated AI Inference with NVIDIA NIM on Azure AI Foundry
The integration of NVIDIA NIM microservices into Azure AI Foundry marks a major leap forward in enterprise AI development. By combining NIM microservices with...
8 MIN READ

May 12, 2025
Run Hugging Face Models Instantly with Day-0 Support from NVIDIA NeMo Framework
As organizations strive to maximize the value of their generative AI investments, accessing the latest model developments is crucial to continued success. By...
6 MIN READ

May 09, 2025
Applying Specialized LLMs with Reasoning Capabilities to Accelerate Battery Research
Scientific research in complex fields like battery innovation is often slowed by manual evaluation of materials, limiting progress to just dozens of candidates...
11 MIN READ

May 08, 2025
Extending the NVIDIA Agent Intelligence Toolkit to Support New Agentic Frameworks
NVIDIA Agent Intelligence toolkit is an open-source library for efficiently connecting and optimizing teams of AI agents. It focuses on enabling developers to...
12 MIN READ
Data Science

May 15, 2025
Simplify Setup and Boost Data Science in the Cloud using NVIDIA CUDA-X and Coiled
Imagine analyzing millions of NYC ride-share journeys—tracking patterns across boroughs, comparing service pricing, or identifying profitable pickup...
10 MIN READ

May 15, 2025
Predicting Performance on Apache Spark with GPUs
The world of big data analytics is constantly seeking ways to accelerate processing and reduce infrastructure costs. Apache Spark has become a leading platform...
9 MIN READ

May 15, 2025
Accelerating Embedding Lookups with cuEmbed
NVIDIA recently released cuEmbed, a high-performance, header-only CUDA library that accelerates embedding lookups on NVIDIA GPUs. If you're building...
8 MIN READ

May 08, 2025
Accelerate Deep Learning and LLM Inference with Apache Spark in the Cloud
Apache Spark is an industry-leading platform for big data processing and analytics. With the increasing prevalence of unstructured data—documents, emails,...
10 MIN READ

May 08, 2025
Spotlight: Accelerating the Discovery of New Battery Materials with SES AI's Molecular Universe
From the Stone Age to the digital era, materials have been the foundation of our civilization across all epochs. Today, finding new materials leads to progress...
7 MIN READ

May 07, 2025
Building Nemotron-CC, A High-Quality Trillion Token Dataset for LLM Pretraining from Common Crawl Using NVIDIA NeMo Curator
Curating high-quality pretraining datasets is critical for enterprise developers aiming to train state-of-the-art large language models (LLMs). To enable...
7 MIN READ

May 07, 2025
Using Python to Automate 3D Workflows with OpenUSD
Universal Scene Description (OpenUSD) offers a powerful, open, and extensible ecosystem for describing, composing, simulating, and collaborating within complex...
7 MIN READ

May 02, 2025
An Even Easier Introduction to CUDA (Updated)
Note: This blog post was originally published on Jan 25, 2017, but has been edited to reflect new updates. This post is a super simple introduction to CUDA, the...
16 MIN READ
Robotics

May 18, 2025
Curating Synthetic Datasets to Train Physical AI Models with NVIDIA Cosmos Reason
How can an AI system understand the difference between a plausible accident and a physically impossible event? Or plan a multi-step interaction across humans,...
5 MIN READ

May 16, 2025
R²D²: Unlocking Robotic Assembly and Contact Rich Manipulation with NVIDIA Research
This edition of NVIDIA Robotics Research and Development Digest (R2D2) explores several contact-rich manipulation workflows for robotic assembly tasks from...
9 MIN READ

May 12, 2025
Just Released: NVIDIA Warp is Now Open-Source Under Apache 2.0
NVIDIA Warp, a simulation computing framework, is now accessible to all developers.
1 MIN READ

Apr 25, 2025
R²D²: Adapting Dexterous Robots with NVIDIA Research Workflows and Models
Robotic arms are used today for assembly, packaging, inspection, and many more applications. However, they are still preprogrammed to perform specific and often...
8 MIN READ

Mar 31, 2025
Simulating Robots in Industrial Facility Digital Twins
Industrial enterprises are embracing physical AI and autonomous systems to transform their operations. This involves deploying heterogeneous robot fleets that...
6 MIN READ

Mar 27, 2025
R²D²: Advancing Robot Mobility and Whole-Body Control with Novel Workflows and AI Foundation Models from NVIDIA Research
Welcome to the first edition of the NVIDIA Robotics Research and Development Digest (R2D2). This technical blog series will give developers and researchers...
10 MIN READ

Mar 26, 2025
Just Released: Omniverse Kit 107.0
Kit SDK 107.0 is a major update release with primary updates for robotics development.
1 MIN READ

Mar 20, 2025
Easily Build Edge AI Apps with Dynamic Flow Control in NVIDIA Holoscan 3.0
NVIDIA announced at GTC 2025 the release of NVIDIA Holoscan 3.0, the real-time AI sensor processing platform. This latest version provides dynamic flow control,...
4 MIN READ
Simulation / Modeling / Design

May 09, 2025
CUDA C++ Compiler Updates Impacting ELF Visibility and Linkage
In the next CUDA major release, CUDA 13.0, NVIDIA is introducing two significant changes to the NVIDIA CUDA Compiler Driver (NVCC) that will impact ELF...
11 MIN READ

May 08, 2025
Revolutionizing Neural Reconstruction and Rendering in gsplat with 3DGUT
Realistic 3D simulation is becoming a cornerstone of modern AI and graphics, from training autonomous vehicles (AV) to powering robotics and digital twins....
5 MIN READ

May 06, 2025
Powering Next-Gen XR Design at Rivian with NVIDIA RTX PRO Blackwell Desktop GPUs
For professionals pushing the boundaries of XR, creating the most immersive and highest fidelity experiences is always challenging. Demanding XR workflows push...
6 MIN READ

May 01, 2025
NVIDIA Blackwell and NVIDIA CUDA 12.9 Introduce Family-Specific Architecture Features
One of the earliest architectural design decisions that went into the CUDA platform for NVIDIA GPUs was support for backward compatibility of GPU code. This...
14 MIN READ

Apr 28, 2025
How SETI Uses AI to Search for Intelligent Alien Life
A researcher from the SETI Institute described to a packed audience at GTC 2025 how SETI had successfully trialed a novel method to identify interstellar radio...
3 MIN READ

Apr 24, 2025
Fast Ray Tracing of Dynamic Scenes Using NVIDIA OptiX 9 and NVIDIA RTX Mega Geometry
Real-time ray tracing is a powerful rendering technique that can create incredibly realistic images. NVIDIA OptiX and RTX technology make this possible, even...
9 MIN READ

Apr 23, 2025
NVIDIA cuPyNumeric 25.03 Now Fully Open Source with PIP and HDF5 Support
NVIDIA cuPyNumeric is a library that aims to provide a distributed and accelerated drop-in replacement for NumPy built on top of the Legate framework. It brings...
4 MIN READ

Apr 21, 2025
AI Inspires Artists and Industrialists to Reimagine their Crafts
AI has become nearly synonymous with innovation. As it rushes onto the world stage, AI is seeding inspiration in creators and problem-solvers of all...
4 MIN READ
Computer Vision / Video Analytics

Apr 24, 2025
Benchmarking Agentic LLM and VLM Reasoning for Gaming with NVIDIA NIM
This is the first post in the LLM Benchmarking series, which shows how to use GenAI-Perf to benchmark the Meta Llama 3 model when deployed with NVIDIA NIM. ...
7 MIN READ

Apr 16, 2025
AI-Generated Heat Maps Keep Seniors and their Privacy Safe
By 2030, more than one in five Americans will be 65 or older, becoming the United States’ largest group of seniors ever. Silicon Valley-based startup Butlr...
4 MIN READ

Apr 11, 2025
AI Advances Parkinson’s Detection Using Standard MRI Scans
A simple brain scan may soon be all that's needed to accurately diagnose Parkinson’s disease, thanks to a new AI-powered tool. The advancement could help...
3 MIN READ

Apr 05, 2025
NVIDIA Accelerates Inference on Meta Llama 4 Scout and Maverick
The newest generation of the popular Llama AI models is here with Llama 4 Scout and Llama 4 Maverick. Accelerated by NVIDIA open-source software, they can...
4 MIN READ

Mar 11, 2025
Build Real-Time Multimodal XR Apps with NVIDIA AI Blueprint for Video Search and Summarization
With the recent advancements in generative AI and vision foundational models, VLMs present a new wave of visual computing wherein the models are capable of...
9 MIN READ

Mar 03, 2025
AI Model Offers Conservationists New Tools to Protect Fisheries, Wildlife at Scale
In an effort to rein in illicit fishing, researchers have unveiled a new open-source AI model that can accurately identify what virtually all of the world’s...
5 MIN READ

Feb 26, 2025
Latest Multimodal Addition to Microsoft Phi SLMs Trained on NVIDIA GPUs
Large language models (LLMs) have permeated every industry and changed the potential of technology. However, due to their massive size they are not practical...
4 MIN READ

Feb 26, 2025
Vision Language Model Prompt Engineering Guide for Image and Video Understanding
Vision language models (VLMs) are evolving at a breakneck speed. In 2020, the first VLMs revolutionized the generative AI landscape by bringing visual...
12 MIN READ
Content Creation / Rendering

May 15, 2025
Path Tracing Optimization in Indiana Jones™: Shader Execution Reordering and Live State Reductions
This post is part of the Path Tracing Optimizations in Indiana Jones™ series. While adding a path-tracing mode to Indiana Jones and the Great Circle™...
13 MIN READ

May 15, 2025
Path Tracing Optimizations in Indiana Jones™: Opacity MicroMaps and Compaction of Dynamic BLASs
The first post in this series, Path Tracing Optimization in Indiana Jones™: Shader Execution Reordering and Live State Reductions, covered ray-gen shader...
13 MIN READ

May 14, 2025
NVIDIA TensorRT Unlocks FP4 Image Generation for NVIDIA Blackwell GeForce RTX 50 Series GPUs
The launch of the NVIDIA Blackwell platform ushered in a new era of improvements in generative AI technology. At its forefront is the newly launched GeForce RTX...
11 MIN READ

Apr 23, 2025
Real-Time GPU-Accelerated Gaussian Splatting with NVIDIA DesignWorks Sample vk_gaussian_splatting
Gaussian splatting is a novel approach to rendering complex 3D scenes by representing them as a collection of anisotropic Gaussians in 3D space. This technique...
3 MIN READ

Apr 17, 2025
Neural Rendering in NVIDIA OptiX Using Cooperative Vectors
The release of NVIDIA OptiX 9.0 introduces a new feature called cooperative vectors that enables AI workflows as part of ray tracing kernels. The feature...
13 MIN READ

Apr 09, 2025
Just Released: NVIDIA AI Workbench 2025.03.10
NVIDIA AI Workbench 2025.03.10 features streamlined onboarding and enhanced UX for multicontainer projects.
1 MIN READ

Mar 25, 2025
Kickstart Your AI Journey on RTX AI PCs and Workstations with NVIDIA NIM Microservices
With emerging use cases such as digital humans, agents, podcasts, images, and video generation, generative AI is changing the way we interact with PCs. This...
7 MIN READ

Mar 18, 2025
Scale Synthetic Data and Physical AI Reasoning with NVIDIA Cosmos World Foundation Models
The next generation of AI-driven robots like humanoids and autonomous vehicles depends on high-fidelity, physics-aware training data. Without diverse and...
7 MIN READ
Conversational AI

May 07, 2025
Concept‑Driven AI Teaching Assistant Guides Students to Deeper Insights
In today's educational landscape, generative AI tools have become both a blessing and a challenge. While these tools offer unprecedented access to information,...
8 MIN READ

Apr 29, 2025
Spotlight: Personal AI Brings AI Receptionists to Small Business Owners with NVIDIA Riva
It's 10 p.m. on a Tuesday when the phone rings at the Sapochnick Law Firm, a specialized law practice in San Diego, California. The caller, a client of the...
6 MIN READ

Apr 22, 2025
NVIDIA GTC Training Labs Now Available On Demand
Missed GTC? This year’s training labs are now available on demand to watch anywhere, anytime.
1 MIN READ

Apr 18, 2025
Upcoming Event: NVIDIA Agent Toolkit Hackathon
Build a high-performance agentic AI system using the open-source NVIDIA Agent Intelligence toolkit — contest runs May 12 to May 23.
1 MIN READ

Apr 10, 2025
Curating Biological Findings from Scientific Literature with NVIDIA NIM
Scientific papers are highly heterogeneous, often employing diverse terminologies for the same entities, using varied methodologies to study biological...
7 MIN READ

Apr 09, 2025
Prevent LLM Hallucinations with the Cleanlab Trustworthy Language Model in NVIDIA NeMo Guardrails
As more enterprises integrate LLMs into their applications, they face a critical challenge: LLMs can generate plausible but incorrect responses, known as...
9 MIN READ

Mar 04, 2025
Top Conversational AI Sessions at NVIDIA GTC 2025
Learn how to accelerate the full pipeline, from multilingual speech recognition and translation to generative AI and speech synthesis.
1 MIN READ

Mar 03, 2025
Measuring the Effectiveness and Performance of AI Guardrails in Generative AI Applications
Safeguarding AI agents and other conversational AI applications to ensure safe, on-brand and reliable behavior is essential for enterprises. NVIDIA NeMo...
12 MIN READ
Edge Computing

May 18, 2025
Deploy AI-RAN at Cell Sites with NVIDIA ARC-Compact
Wireless networks are the backbone of modern connectivity, serving billions of 5G users through millions of cell sites globally. The opportunities and benefits...
11 MIN READ

Apr 16, 2025
Efficient Federated Learning in the Era of LLMs with Message Quantization and Streaming
Federated learning (FL) has emerged as a promising approach for training machine learning models across distributed data sources while preserving data privacy....
8 MIN READ

Apr 15, 2025
Event: Data Filtering Challenge for Training Edge Language Models
You’re invited to join the challenge. Develop and apply innovative data filtering techniques to curate datasets that enhance edge LM performance.
1 MIN READ

Apr 11, 2025
Effortless Federated Learning on Mobile with NVIDIA FLARE and Meta ExecuTorch
NVIDIA and the PyTorch team at Meta announced a groundbreaking collaboration that brings federated learning (FL) capabilities to mobile devices through the...
12 MIN READ

Apr 08, 2025
Using AI to Better Understand the Ocean
Humans know more about deep space than we know about Earth’s deepest oceans. But scientists have plans to change that—with the help of AI. “We have...
3 MIN READ

Mar 12, 2025
Lightweight, Multimodal, Multilingual Gemma 3 Models Are Streamlined for Performance
Building AI systems with foundation models requires a delicate balancing of resources such as memory, latency, storage, compute, and more. One size does not fit...
3 MIN READ

Mar 10, 2025
Streamline LLM Deployment for Autonomous Vehicle Applications with NVIDIA DriveOS LLM SDK
Large language models (LLMs) have shown remarkable generalization capabilities in natural language processing (NLP). They are used in a wide range of...
7 MIN READ

Feb 20, 2025
Featured Computer Vision and Video Analytics Sessions at NVIDIA GTC 2025
Explore visually perceptive AI agents, the latest vision AI technologies, hands-on training, and inspiring deployments.
1 MIN READ
Data Center / Cloud

May 18, 2025
NVIDIA ConnectX-8 SuperNICs Advance AI Platform Architecture with PCIe Gen6 Connectivity
As AI workloads grow in complexity and scale—from large language models (LLMs) to agentic AI reasoning and physical AI—the demand for faster, more scalable...
5 MIN READ

May 16, 2025
Building the Modular Foundation for AI Factories with NVIDIA MGX
The exponential growth of generative AI, large language models (LLMs), and high-performance computing has created unprecedented demands on data center...
6 MIN READ

May 14, 2025
AI Fabric Resiliency and Why Network Convergence Matters
High-performance computing and deep learning workloads are extremely sensitive to latency. Packet loss forces retransmission or stalls in the communication...
7 MIN READ

May 13, 2025
Connect Simulations with the Real World Using NVIDIA Air Services
NVIDIA Air enables cloud-scale efficiency by creating identical replicas of real-world data center infrastructure deployments. With NVIDIA Air, you can spin up...
6 MIN READ

May 08, 2025
Turbocharge LLM Training Across Long-Haul Data Center Networks with NVIDIA Nemo Framework
Multi-data center training is becoming essential for AI factories as pretraining scaling fuels the creation of even larger models, leading the demand for...
6 MIN READ

May 06, 2025
New NVIDIA NV-Tesseract Time Series Models Advance Dataset Processing and Anomaly Detection
Time-series data has evolved from a simple historical record into a real-time engine for critical decisions across industries. Whether it’s streamlining...
5 MIN READ

Apr 29, 2025
NVIDIA NIM Operator 2.0 Boosts AI Deployment with NVIDIA NeMo Microservices Support
The first release of NVIDIA NIM Operator simplified the deployment and lifecycle management of inference pipelines for NVIDIA NIM microservices, reducing the...
5 MIN READ

Apr 23, 2025
Announcing NVIDIA Secure AI General Availability
As many enterprises move to running AI training or inference on their data, the data and the code need to be protected, especially for large language models...
3 MIN READ