AI Agents
112 articles in this category (Page 2 of 5)
Model Context Protocol (MCP) vs. AI Agent Skills: A Deep Dive into Structured Tools and Behavioral Guidance for LLMs
A technical comparison of MCP's standardized tool interfaces and Skills' natural-language behavioral guidance for scaling AI agent capabilities and external system integration.
Building Risk-Aware AI Agents with Internal Critics and Uncertainty Estimation
Develop reliable AI agents using internal critics and uncertainty estimation to quantify risk through entropy and consistency scores, ensuring robust decision-making in production environments.
Designing a Multi-Tool Research Agent: Integrating Web Search, PDF Vision, and Automated Reporting
Build a Swiss Army Knife research agent that automates multi-step problems using tool-calling AI, vision-based chart analysis, and PDF ingestion to generate professional Markdown and DOCX reports.
DSGym Offers a Reusable Container Based Substrate for Building and Benchmarking Data Science Agents
DSGym introduces a framework for evaluating data science agents across 1,000+ challenges, revealing significant performance gaps in complex data analysis tasks.
Moonshot AI Releases Kimi K2.5: An Open Source Visual Agentic Intelligence Model with Native Swarm Execution
Moonshot AI launched Kimi K2.5, an open-source visual agentic intelligence model boasting a 1T parameter scale and achieving state-of-the-art results in agentic benchmarks.
How an AI Agent Chooses What to Do Under Tokens, Latency, and Tool-Call Budget Constraints?
This article details a cost-aware AI planning agent that balances output quality against real-world constraints, achieving up to a 20% improvement in resource efficiency.