Enhanced AI Efficiency: NVIDIA NIM and AgentIQ Integrated into Azure AI Foundry
Microsoft, in partnership with NVIDIA, has integrated NVIDIA NIM microservices and the NVIDIA AgentIQ toolkit into Azure AI Foundry. This integration promises significant advancements in how AI projects are developed and deployed, leading to improved efficiency, performance, and cost optimization.
With enterprise AI initiatives often requiring 9 to 12 months for deployment, any advancements that accelerate the process are valuable. This integration is designed to simplify the AI development lifecycle, reducing the time to market and improving overall efficiency.
NVIDIA NIM on Azure AI Foundry
NVIDIA NIM, part of the NVIDIA AI Enterprise software suite, presents a collection of easy-to-use microservices optimized for secure, reliable, and high-performance AI inferencing. These services leverage technologies such as NVIDIA Triton Inference Server, TensorRT, TensorRT-LLM, and PyTorch, and are built to scale seamlessly on Azure’s managed compute infrastructure. The key benefits include:
- Zero-configuration deployment: Quick setup with out-of-the-box optimization.
- Seamless Azure integration: Effortless compatibility with Azure AI Agent Service and Semantic Kernel.
- Enterprise-grade reliability: Benefit from NVIDIA AI Enterprise support, ensuring consistent performance and security.
- Scalable inference: Leverage Azure’s NVIDIA-accelerated infrastructure for demanding workloads.
- Optimized workflows: Accelerate applications ranging from large language models to advanced analytics.
Deploying these services is simplified, requiring only a few clicks to integrate models—like Llama-3.3-70B-NIM—from the Azure AI Foundry model catalog into existing AI workflows. This enables building generative AI applications that perform flawlessly within the Azure ecosystem.
Optimizing Performance with NVIDIA AgentIQ
Once the NVIDIA NIM microservices are deployed, NVIDIA AgentIQ enhances performance. This open-source toolkit is designed to connect, profile, and optimize AI agents, enabling optimal system performance. AgentIQ offers:
- Profiling and optimization: Real-time telemetry helps fine-tune AI agent placement, reducing latency and compute requirements.
- Dynamic inference enhancements: Continuous collection and analysis of metadata (e.g., output tokens per call, estimated time to next inference) dynamically improves agent performance.
- Integration with Semantic Kernel: Direct integration with Azure AI Foundry Agent Service enhances AI agents with semantic reasoning and task execution.
This intelligent profiling reduces compute costs while improving accuracy and responsiveness throughout the agentic AI workflow.

NVIDIA intends to integrate its Llama Nemotron Reason open reasoning model, which, according to NVIDIA, excels in coding, complex math, and scientific reasoning while understanding user intent.
Real-world Impact
Industry leaders are witnessing the practical benefits of these innovations. Drew McCombs, VP of Cloud and Analytics at Epic, mentioned, “The launch of NVIDIA NIM microservices in Azure AI Foundry offers a secure and efficient way for Epic to deploy open-source generative AI models that improve patient care, boost clinician and operational efficiency, and uncover new insights to drive medical innovation.”
In collaboration with UW Health and UC San Diego Health, research is underway to evaluate clinical summaries with these advanced models. Together, they’re implementing cutting-edge AI to benefit clinicians and patients in meaningful ways.
Jon Sigler, EVP, Platform and AI at ServiceNow, emphasized how “This combination of ServiceNow’s AI platform with NVIDIA NIM and Microsoft Azure AI Foundry and Azure AI Agent Service helps us bring to market industry-specific, out-of-the-box AI agents, delivering full-stack agentic AI solutions to help resolve problems faster, deliver great customer experiences, and accelerate improvements in organizations’ productivity and efficiency.”
Unlock AI-powered Innovation
By combining NVIDIA NIM’s deployment capabilities with AgentIQ’s dynamic optimization, Azure AI Foundry provides a comprehensive solution for building, deploying, and scaling enterprise-grade agentic applications. This integration accelerates AI deployments, enhances agentic workflows, and lowers infrastructure costs, freeing resources to drive innovation.
Are you ready to accelerate your AI journey? Deploy NVIDIA NIM microservices and optimize your AI agents with the NVIDIA AgentIQ toolkit on Azure AI Foundry.