Build a Custom LLM with ChatRTX – NVIDIA

Welcome to the forefront of conversational AI as we explore the fascinating world of AI chatbots in our dedicated blog series. Discover the latest advancements, applications, and strategies that propel the evolution of chatbot technology. From enhancing customer interactions to streamlining business processes, these articles delve into the innovative ways artificial intelligence is shaping the landscape of automated conversational agents. Whether you’re a business owner, developer, or simply intrigued by the future of interactive technology, join us on this journey to unravel the transformative power and endless possibilities of AI chatbots.
Visit your regional NVIDIA website for local content, pricing, and where to buy partners specific to your country.
AI-driven platform for life sciences research and discovery
Fully managed end-to-end AI platform on leading clouds
Build, customize, and deploy multimodal generative AI
Integrate advanced simulation and AI into complex 3D workflows
Guide for using NVIDIA NGC private registry with GPU cloud
Accelerated, containerized AI models and SDKs
Modernizing data centers with AI and accelerated computing
Enterprise AI factory for model development and deployment
Architecture for data centers that transform data into intelligence
A supercomputer purpose-built for AI and HPC
Advanced functional safety and security for edge AI
Accelerated computing with modular servers
Scalable data center infrastructure for high-performance AI
Leading platform for autonomous machines and embedded applications
Powerful in-vehicle computing for AI-driven autonomous vehicle systems
AI-powered computing for innovative medical devices and imaging
Explore graphics cards, gaming solutions, AI technology, and more
RTX graphics cards bring game-changing AI capabilities
Thinnest and longest lasting RTX laptops, optimized by Max-Q
Smooth, tear-free gaming with NVIDIA G-SYNC monitors
Neural rendering tech boosts FPS and enhances image quality
Ultimate responsiveness for faster reactions and better aim
AI PCs for gaming, creating, productivity and development
High performance laptops and desktops, purpose-built for creators
RTX-powered cloud gaming. Choose from 3 memberships
Optimize gaming, streaming, and AI-powered creativity
AI-enhanced voice and video for next-level streams, videos, and calls
World-class streaming media performance
The engine of the new industrial revolution
High performance, scalability, and security for every data center
Performance and energy efficiency for endless possibilities
RTX graphics cards bring game-changing AI capabilities
Accelerating professional AI, graphics, rendering and compute workloads
Virtual solutions for scalable, high-performance computing
GPU-powered laptops for gamers and creators
High performance laptops purpose-built for creators
Accelerate professional AI and visual computing from anywhere
Accelerated networks for modern workloads
Software-defined hardware accelerators for networking, storage, and security
Ethernet performance, availability, and ease of use across a wide range of applications
High-performance networking for super computers, AI, and cloud data centers
Networking software for optimized performance and scalability
IO subsystem for modern, GPU-accelerated data centers
Accelerating professional AI, graphics, rendering, and compute workloads
A Grace Blackwell AI Supercomputer on your desk
The ultimate desktop AI supercomputer powered by NVIDIA Grace Blackwell
Accelerate innovation and productivity in AI workflows
Powerful AI, graphics, rendering, and compute workloads
Accelerate professional AI and visual computing from anywhere
Simplify AI development with NVIDIA AI Workbench on GPUs
Explore NVIDIA’s AI models, blueprints, and tools for developers
AI and HPC software solutions for data center acceleration
Monitor and manage GPU performance in cluster environments
Explore NVIDIA developer tools for AI, graphics, and HPC
Discover GPU-optimized AI, HPC, and data science software
Optimize enterprise GPU management
Accelerate AI and HPC workloads with NVIDIA GPU Cloud solutions
Enhance multi-display productivity with NVIDIA RTX Desktop Manager
Creative tools and AI-powered apps for artists and designers
AI-powered audio and video enhancement
Add intelligence and efficiency to your business with AI and machine learning
Build AI agents designed to reason, plan, and act
Powering a new class of enterprise infrastructure for AI
Enables natural, personalized interactions with real-time speech AI
AI-driven solutions to strengthen cybersecurity and AI infrastructure
Iterate on large datasets, deploy models more frequently, and lower total cost
Instantly run and deploy Generative AI
Drive breakthrough performance with AI-enabled applications and services
Powering AI, HPC, and modern workloads with NVIDIA
Bringing enterprise storage into the era of agentic AI
Accelerated computing uses specialized hardware to boost IT performance
On-demand IT resources and services, enabling scalability and intelligent insights
Accelerate the scaling of AI across your organization
Accelerate AI with MLOps
High speed ethernet interconnect solutions and services
Save energy and lower cost with AI and accelerated computing
NVIDIA virtual GPU software delivers powerful GPU performance
Streamline building, operating, and connecting metaverse apps
Develop real-time interactive design using AI-accelerated real-time digital twins
Harness the power of large-scale, physically-based OpenUSD simulation
Bring state-of-the-art rendering to professional workflows
Innovative solutions to take on your robotics, edge, and vision AI challenges
Enablies researchers to visualize their large datasets at interactive speeds
AI-defined vehicles are transforming the future of mobility
Transform workflows with immersive, scalable interactions in virtual environments
Discover NVIDIA’s HPC solutions for AI, simulation, and accelerated computing
Boost accuracy with GPU-accelerating HPC and AI
Enables researchers to visualize large datasets at interactive speeds
Accelerate simulation workloads
Fast-tracking the advancement of scientific innovations with QPUs
Innovative solutions to take on robotics, edge, and vision AI challenges
GPU-accelerated advances in AI perception, simulation, and software
Bring the power of NVIDIA AI to the edge for real-time decision-making solutions
Transform data into valuable insights using vision AI
AI-enhanced vehicles are transforming the future of mobility
Essential data center tools for safe autonomous vehicle development
Explore high-fidelity sensor simulation for safe autonomous vehicle development
Develop automated driving functions and immersive in-cabin experiences
State-of-the-art system for AV safety, from the cloud to the car
AI-driven platform for life sciences research and discovery
Fully managed end-to-end AI platform on leading clouds
Build, customize, and deploy multimodal generative AI
Integrate advanced simulation and AI into complex 3D workflows
Guide for using NVIDIA NGC private registry with GPU cloud
Accelerated, containerized AI models and SDKs
Modernizing data centers with AI and accelerated computing
Enterprise AI factory for model development and deployment
Architecture for data centers that transform data into intelligence
A supercomputer purpose-built for AI and HPC
Advanced functional safety and security for edge AI
Accelerated computing with modular servers
Scalable data center infrastructure for high-performance AI
Leading platform for autonomous machines and embedded applications
Powerful in-vehicle computing for AI-driven autonomous vehicle systems
AI-powered computing for innovative medical devices and imaging
Explore graphics cards, gaming solutions, AI technology, and more
RTX graphics cards bring game-changing AI capabilities
Thinnest and longest lasting RTX laptops, optimized by Max-Q
Smooth, tear-free gaming with NVIDIA G-SYNC monitors
Neural rendering tech boosts FPS and enhances image quality
Ultimate responsiveness for faster reactions and better aim
AI PCs for gaming, creating, productivity and development
High performance laptops and desktops, purpose-built for creators
RTX-powered cloud gaming. Choose from 3 memberships
Optimize gaming, streaming, and AI-powered creativity
AI-enhanced voice and video for next-level streams, videos, and calls
World-class streaming media performance
The engine of the new industrial revolution
High performance, scalability, and security for every data center
Performance and energy efficiency for endless possibilities
RTX graphics cards bring game-changing AI capabilities
Accelerating professional AI, graphics, rendering and compute workloads
Virtual solutions for scalable, high-performance computing
GPU-powered laptops for gamers and creators
High performance laptops purpose-built for creators
Accelerate professional AI and visual computing from anywhere
Accelerated networks for modern workloads
Software-defined hardware accelerators for networking, storage, and security
Ethernet performance, availability, and ease of use across a wide range of applications
High-performance networking for super computers, AI, and cloud data centers
Networking software for optimized performance and scalability
IO subsystem for modern, GPU-accelerated data centers
Accelerating professional AI, graphics, rendering, and compute workloads
A Grace Blackwell AI Supercomputer on your desk
The ultimate desktop AI supercomputer powered by NVIDIA Grace Blackwell
Accelerate innovation and productivity in AI workflows
Powerful AI, graphics, rendering, and compute workloads
Accelerate professional AI and visual computing from anywhere
Simplify AI development with NVIDIA AI Workbench on GPUs
Explore NVIDIA’s AI models, blueprints, and tools for developers
AI and HPC software solutions for data center acceleration
Monitor and manage GPU performance in cluster environments
Explore NVIDIA developer tools for AI, graphics, and HPC
Discover GPU-optimized AI, HPC, and data science software
Optimize enterprise GPU management
Accelerate AI and HPC workloads with NVIDIA GPU Cloud solutions
Enhance multi-display productivity with NVIDIA RTX Desktop Manager
Creative tools and AI-powered apps for artists and designers
AI-powered audio and video enhancement
Add intelligence and efficiency to your business with AI and machine learning
Build AI agents designed to reason, plan, and act
Powering a new class of enterprise infrastructure for AI
Enables natural, personalized interactions with real-time speech AI
AI-driven solutions to strengthen cybersecurity and AI infrastructure
Iterate on large datasets, deploy models more frequently, and lower total cost
Instantly run and deploy Generative AI
Drive breakthrough performance with AI-enabled applications and services
Powering AI, HPC, and modern workloads with NVIDIA
Bringing enterprise storage into the era of agentic AI
Accelerated computing uses specialized hardware to boost IT performance
On-demand IT resources and services, enabling scalability and intelligent insights
Accelerate the scaling of AI across your organization
Accelerate AI with MLOps
High speed ethernet interconnect solutions and services
Save energy and lower cost with AI and accelerated computing
NVIDIA virtual GPU software delivers powerful GPU performance
Streamline building, operating, and connecting metaverse apps
Develop real-time interactive design using AI-accelerated real-time digital twins
Harness the power of large-scale, physically-based OpenUSD simulation
Bring state-of-the-art rendering to professional workflows
Innovative solutions to take on your robotics, edge, and vision AI challenges
Enablies researchers to visualize their large datasets at interactive speeds
AI-defined vehicles are transforming the future of mobility
Transform workflows with immersive, scalable interactions in virtual environments
Discover NVIDIA’s HPC solutions for AI, simulation, and accelerated computing
Boost accuracy with GPU-accelerating HPC and AI
Enables researchers to visualize large datasets at interactive speeds
Accelerate simulation workloads
Fast-tracking the advancement of scientific innovations with QPUs
Innovative solutions to take on robotics, edge, and vision AI challenges
GPU-accelerated advances in AI perception, simulation, and software
Bring the power of NVIDIA AI to the edge for real-time decision-making solutions
Transform data into valuable insights using vision AI
AI-enhanced vehicles are transforming the future of mobility
Essential data center tools for safe autonomous vehicle development
Explore high-fidelity sensor simulation for safe autonomous vehicle development
Develop automated driving functions and immersive in-cabin experiences
State-of-the-art system for AV safety, from the cloud to the car
Demo
Your Personalized AI Chatbot
Version 0.5
ChatRTX is a demo app that lets you personalize a GPT large language model (LLM) connected to your own content—docs, notes, images, or other data. Leveraging retrieval-augmented generation (RAG), TensorRT™-LLM, NVIDIA NIM™ microservices, and RTX™ acceleration, you can query a custom chatbot to quickly get contextually relevant answers. And because it all runs locally on your Windows RTX PC or workstation, you’ll get fast and private results.
ChatRTX supports various file formats, including TXT, PDF, DOC/DOCX, JPG, PNG, GIF, and XML. Simply point the application at the folder containing your files, and it’ll load them into the library in a matter of seconds.
ChatRTX features an automatic speech recognition system that uses AI to process spoken language and provide text responses with support for multiple languages. Simply click the microphone icon and talk to ChatRTX to get started.
Let ChatRTX do the work—sort through your photo albums with a simple text or voice search, keeping everything private and hassle-free.
ChatRTX provides access to NVIDIA NIM microservices, featuring the latest AI models optimized for RTX. With NVIDIA NIM, you can easily download, set up, and build AI-powered applications to accelerate workflows, boost productivity, and unlock the full potential of AI models.
Simply download, install, and start chatting right away.
The ChatRTX tech demo is built from the TensorRT-LLM RAG developer reference project available from GitHub. Developers can use that reference to develop and deploy their own RAG-based applications for RTX, accelerated by TensorRT-LLM.
NVIDIA NIM Microservices and Blueprints
Harness the latest generative AI models locally on your RTX AI PC with NVIDIA NIM microservices and AI Blueprints. Optimized for your RTX GPU, NIM microservices unlock access to cutting-edge AI from your favorite apps—like ChatRTX, AnythingLLM, ComfyUI, and LM Studio. Combine them with Blueprints to quickly set up, customize, and deploy AI-driven workflows that automate tasks and unlock new capabilities. Shape and experience the AI of tomorrow, today. Powered by NVIDIA RTX™
Upgrade to advanced AI with NVIDIA GeForce RTX™ GPUs and accelerate gaming, creating, productivity, and development. Thanks to specialized built-in AI processors, you get world-leading AI technology and performance powering everything you do—plus, your data always stays local on your Windows PC.
Explore NVIDIA’s generative AI developer tools and enterprise solutions.
NVIDIA Privacy Policy