NVIDIA Spotlights Hermes Agent and Qwen 3.6 Models for RTX and DGX Spark

Details

NVIDIA highlights Hermes Agent, a reliable, self-improving open-source AI framework from Nous Research, achieving 140,000 GitHub stars in under three months and topping OpenRouter usage.
Hermes pairs with Alibaba's Qwen 3.6 LLMs (27B and 35B parameters), optimized for NVIDIA RTX PCs, RTX PRO workstations, and DGX Spark for continuous local agentic AI.
Key features include self-evolving skills via task feedback, contained sub-agents for tidy task handling, curated reliability, and superior orchestration yielding better results than other frameworks on identical models.
Qwen 3.6 35B outperforms prior 120B models on 20GB memory; 27B matches 400B accuracy at one-sixteenth size, succeeding Qwen 3.5 series for efficient local inference.
Follows OpenClaw success; NVIDIA Tensor Cores enable fast inference, with DGX Spark offering 128GB memory and 1 petaFLOP for all-day 120B-scale MoE models.

Impact

Hermes and Qwen 3.6 advance on-device agentic AI, enabling persistent, self-improving agents on consumer RTX hardware and boosting local inference adoption amid rising open model demand. This accelerates developer ecosystems around frameworks like NemoClaw, potentially shifting R&D from cloud to edge over 12-24 months. NVIDIA's hardware optimization strengthens its lead in personal AI workflows, influencing funding toward efficient local LLMs.