Agentic AI
195 articles in this category (Page 6 of 9)
LangWatch Open Sources Evaluation Layer for AI Agents to Solve Non-Determinism
LangWatch launches an open-source platform for AI agent evaluation and tracing, enabling developers to move beyond anecdotal testing with end-to-end simulations and OTel-native monitoring.
Nous Research Unveils Hermes Agent: Solving LLM Forgetfulness with Multi-Level Memory and Persistent Terminal Access
Nous Research releases Hermes Agent, an open-source system featuring a multi-level memory hierarchy and persistent machine access to eliminate AI state decay.
Beyond Simple API Requests: How OpenAI’s WebSocket Mode Changes the Game for Low Latency Voice Powered AI Experiences
OpenAI's Realtime API collapses the STT-LLM-TTS stack using WebSocket protocols to enable full-duplex, multimodal GPT-4o interactions with sub-millisecond latency improvements.
VectifyAI Launches Mafin 2.5 and PageIndex: Achieving 98.7% Financial RAG Accuracy
VectifyAI has launched Mafin 2.5 and the open-source PageIndex framework, achieving a record-breaking 98.7% accuracy on FinanceBench. By replacing traditional vector similarity with hierarchical tree indexing, PageIndex solves the problem of structural context loss in complex financial documents like SEC filings and balance sheets.
Designing a Multi-Tool Research Agent: Integrating Web Search, PDF Vision, and Automated Reporting
Build a Swiss Army Knife research agent that automates multi-step problems using tool-calling AI, vision-based chart analysis, and PDF ingestion to generate professional Markdown and DOCX reports.
Gemini 3.1 Pro: 1M Token Context and 77.1% ARC-AGI-2 Reasoning for AI Agents
Google releases Gemini 3.1 Pro with a 1M token context window and 77.1% ARC-AGI-2 reasoning score, targeting the high-performance autonomous AI agent market. This release focuses on reasoning stability, software engineering, and tool-use reliability for developers building next-generation autonomous agents and complex technical workflows.
Building Glass-Box AI Agents: A Guide to Auditable Decision Loops and Human Gates
Learn to build transparent AI agents using LangGraph and hash-chained audit trails to ensure every decision is traceable and governed by human approval gates for high-risk operations.
Building a Production-Grade Agentic AI System with Hybrid Retrieval and Episodic Memory
A tutorial on building an ultra-advanced agentic AI workflow with hybrid retrieval, provenance-first citations, repair loops, and episodic memory, achieving a 99% success rate in retrieving relevant information.