News (EN)

Nvidia releases free update for RTX AI with up to 40% faster LLMs and optimized NVFP4

By Redação Portal

Published on January 6, 2026

Nvidia - Foto: Hepha1st0s / Shutterstock.com

Follow Mix Vale on GoogleGet world news featured in Google SearchFollow

Nvidia announced a free update that increases artificial intelligence performance on computers equipped with RTX cards. Essa improvement directly benefits users performing large language models (LLMs) and generative content creation tasks. The optimizations arrive in January 2026 and include native support for new precision formats that reduce VRAM memory consumption. The company remains focused on making RTX GPUs the leading platform for local AI workloads. Essas changes consolidate gains accumulated over the years in accelerating AI for consumers. The package combines improvements in processing speed and graphics resource efficiency.

The update breaks down into core components that affect different aspects of generative AI. Usuários of RTX PCs gain immediate access to these tools at no additional cost.

This initiative reinforces the position of RTX cards as a premium option for local execution of advanced models.

Improvements in the performance of LLMs

The first part of the update focuses on increasing speed for large language models. Testes internals at Nvidia indicate gains of up to 40% on popular LLMs such as Nemotron Nano V2 and open source GPT variants.

These optimizations apply directly to Windows environments with TensorRT-LLM. Usuários report shorter response times on local chatbots and text assistants.

The improvement especially benefits creators who integrate LLMs into daily workflows. The Nvidia has accumulated similar advances since 2023, when it introduced initial accelerations for the RTX 30 and 40 series.

Native NVFP4 support and VRAM reduction

Native support for NVFP4 represents one of the most technical new features of the update. Esse precision format allows you to compress models by up to 60% compared to traditional BF16 versions.

Compression transfers part of the processing to system memory, freeing VRAM for other tasks. In tools like ComfyUI, NVFP4 enables gains of up to 4.6x in imaging with Flux.1 and Flux.2.

Users with previous generation cards maintain broad compatibility. The reduction in graphics memory usage makes it feasible to run larger models at modest settings.

Video generation with LTX-2 model

Nvidia collaborates with Lightricks to optimize the LTX-2 model, a leader in open source audio-to-video generation. Model Esse produces synchronized clips in native 4K resolution at 50 frames per second.

With NVFP8 support, LTX-2 achieves double the speed of modern RTX cards. A high-quality video generates in about 20 seconds on compatible hardware.

The model stands out for its ability to create long-form content with integrated audio. Criadores of short videos gain a powerful tool for quick local production.

Super resolution for generative videos

The RTX Video Super Resolution functionality now extends to AI-created videos. Essa tool upscales content from 720p to 4K with significant gains in detail and sharpness.

The integration comes to ComfyUI in February 2026. The entire process of generating and upscaling a 10-second 4K clip reduces it from 15 minutes to just 3 minutes.

This optimization benefits producers who need high-resolution output quickly. The technology takes advantage of specific accelerations from RTX GPUs to maintain high quality.

General benefits for RTX users

Execution of LLMs with up to 40% faster speed on local tasks.
Up to 60% smaller generative models with NVFP4 and NVFP8.
Gains of up to 4.6x on image pipelines in ComfyUI.
Generation of 4K video with synchronized audio in reduced times.
Automatically upscale GenAI videos to higher resolution.

Integration with ComfyUI ecosystem

ComfyUI receives Nvidia-specific optimizations for generative workflows. The platform directly benefits from NVFP4 support on Flux and similar models.

Users configure complex pipelines with less demand on graphics resources. Continuous collaboration between developers ensures regular updates.

These changes make it easier to experiment with large models on regular desktops.

Accumulated advances on RTX platforms

Nvidia builds on foundations established since the release of TensorRT-LLM speedups. Previous Atualizações brought performance multipliers to RTX 30 and 40 series.

The company positions RTX GPUs as a complete solution for local AI. Diferenciação occurs against integrated NPUs, which are limited to basic tasks.

New card owners access the broadest suite of tools.

Practical applications in content creation

Image and video creators gain speed in daily iterations. The LTX-2 lets you produce professional clips without relying on cloud services.

The reduction in VRAM makes multitasking possible with simultaneous models. Profissionais design and editing incorporate generative AI with greater fluidity.

These tools maintain complete privacy when running locally.

Technical Perspective of NVFP Formats

The NVFP4 and NVFP8 formats balance accuracy and efficiency on Blackwell and earlier architectures. Quantization maintains quality close to higher precision versions.

Developers quickly adapt existing models to these formats. Load transfer to RAM expands capabilities on GPUs with limited VRAM.

This approach extends the lifespan of past generation hardware.

The update reinforces Nvidia’s commitment to local AI performance. Usuários of RTX receive advanced tools at no extra cost. The improvements range from text to high-resolution video. The platform continues to evolve with a focus on efficiency and speed.

TagsAI PC, NVFP4, Nvidia RTX, Nvidia RTX AI performance upgrade, Nvidia update