AI

NVIDIA Spotlights Hermes Agent and Qwen 3.6 Models for RTX and DGX Spark

Wednesday, May 13, 2026Read Original

Details

  • NVIDIA highlights Hermes Agent, a reliable, self-improving open-source AI framework from Nous Research, achieving 140,000 GitHub stars in under three months and topping OpenRouter usage.
  • Hermes pairs with Alibaba's Qwen 3.6 LLMs (27B and 35B parameters), optimized for NVIDIA RTX PCs, RTX PRO workstations, and DGX Spark for continuous local agentic AI.
  • Key features include self-evolving skills via task feedback, contained sub-agents for tidy task handling, curated reliability, and superior orchestration yielding better results than other frameworks on identical models.
  • Qwen 3.6 35B outperforms prior 120B models on 20GB memory; 27B matches 400B accuracy at one-sixteenth size, succeeding Qwen 3.5 series for efficient local inference.
  • Follows OpenClaw success; NVIDIA Tensor Cores enable fast inference, with DGX Spark offering 128GB memory and 1 petaFLOP for all-day 120B-scale MoE models.

Impact

Hermes and Qwen 3.6 advance on-device agentic AI, enabling persistent, self-improving agents on consumer RTX hardware and boosting local inference adoption amid rising open model demand. This accelerates developer ecosystems around frameworks like NemoClaw, potentially shifting R&D from cloud to edge over 12-24 months. NVIDIA's hardware optimization strengthens its lead in personal AI workflows, influencing funding toward efficient local LLMs.

Rift Dispatchpractical systems & stories, weekly
NVIDIA Spotlights Hermes Agent and Qwen 3.6 Models for RTX and DGX Spark | riftlab.ai