NVIDIA Launches NIM Microservices to Power AI on RTX PCs | Gpu vs cpu architecture | List of hardware components with examples | List of hardware components and their functions | Turtles AI
NVIDIA unveiled its AI Foundation models for RTX PCs at CES, offering powerful NIM microservices and low-code tools for developers and enthusiasts. The new GeForce RTX 50 Series GPUs, with FP4 support, accelerate the execution of local AI models, enabling the creation of intelligent agents and generative content. With integration with development frameworks and AI Blueprints, access to the technology is easier than ever.
Key Points:
- NVIDIA Launches NIM Microservices to Run Generative AI Models on PCs with GeForce RTX 50 Series GPUs.
- Developers and enthusiasts can build intelligent agents and creative workflows using simple interfaces.
- The new RTX 50 Series GPUs deliver improved AI performance, supporting FP4 compute for faster inference.
- AI Blueprints and NIM models can be used for advanced content creation projects, such as image generation and podcasting.
At CES 2025, NVIDIA unveiled a series of innovations that are set to revolutionize the use of generative AI on PCs. Among the highlights are AI Foundation models, which include NVIDIA Inference Models (NIM) microservices to harness the power of GeForce RTX™ 50 Series GPUs. Featuring the Blackwell architecture, these GPUs deliver up to 3.352 trillion operations per second of AI performance, a significant improvement over previous models. The highlight of the new GPUs is the introduction of FP4 compute, which allows for double the AI inference performance while reducing memory consumption. Using RTX 50 Series GPUs to run generative AI models locally opens up new possibilities for developers and content creators, who can operate without relying on large data centers.
At the heart of NVIDIA’s offering are NIM microservices, which enable advanced AI models to be run directly on PCs with the power of RTX GPUs. These microservices are designed to simplify the integration of AI models into everyday applications, from productivity tools to intelligent agents. For example, users can use models to generate images, process natural language, and even handle computer vision tasks. Additionally, with the introduction of new low-code and no-code graphical interfaces, such as AnythingLLM, ComfyUI, and Langflow, even those with no programming experience can integrate AI models into complex workflows.
The development ecosystem is further enriched with AI Blueprints, which offer pre-configured workflows for common tasks. Developers will be able to draw from a large library of templates, optimized for automatic content generation, such as creating podcasts from PDF documents. These blueprints greatly simplify the creative process, allowing you to create, edit, and even generate audio recordings from a script. The possibilities are also extended to image generation, where the use of 3D scenes guides the creation of visual content through a simplified interface.
Integration with platforms such as Windows Subsystem for Linux (WSL) allows the use of these models on Windows 11 PCs, making the environment ideal for developers who want to fully exploit the potential of NVIDIA GPUs. NIM microservices are compatible with development frameworks such as VSCode AI Toolkit, LangChain, and CrewAI, allowing applications and workflows to be connected to AI models, both locally and in the cloud. Developers can therefore implement advanced AI technologies with ease, optimizing workflows and improving execution efficiency.
To complete the proposal, NVIDIA presented Project R2X, an AI-powered digital avatar that can interact in real time with users, improving productivity and user experience in remote work or automated assistance contexts. The avatar leverages the power of NVIDIA RTX Neural Faces and Audio2Face™-3D for lifelike visuals and audio, taking interactions with digital assistants to a new level.
Finally, NIM microservices, coming in February 2025, will support multiple NVIDIA GPUs, including the new GeForce RTX 50 Series and the professional RTX 6000 and 5000. Major PC manufacturers such as Acer, ASUS, HP, and Lenovo, as well as custom builders, will offer systems compatible with these technologies, making the power of generative AI on desktop PCs more accessible than ever.
With these developments, NVIDIA is giving developers and enthusiasts the tools they need to experiment and create AI-based content directly on their PCs, creating an ecosystem that puts the power of AI in everyone’s hands.