Salesforce's eVerse Simulates Realistic Customer Service Interactions
These articles are AI-generated summaries. Please check the original sources for full details.
Simulating Lousy Conversations: Q&A with Silvio Savarese, Chief Scientist & Head of AI Research at Salesforce
Salesforce is developing eVerse, a simulation environment designed to “battle-test” AI agents against the complexities of real-world customer service calls. Chief Scientist Silvio Savarese explains the need to move beyond scripted interactions, noting that current phone menus struggle with nuanced, multi-step problems, and that eVerse allows for the creation of synthetic data to address these edge cases.
Why This Matters
Current LLM training often relies on clean, curated datasets, a stark contrast to the messy reality of human communication. This disparity leads to AI agents failing in unpredictable environments, resulting in poor customer experiences and increased costs for human intervention—estimated at 60-70% of inbound healthcare contact center calls requiring human assistance.
Key Insights
- Synthetic Data Generation: Salesforce leverages small amounts of real data to create a wide range of synthetic scenarios, ensuring agents aren’t simply memorizing responses.
- Agentforce Determinism: Salesforce’s Agentforce platform allows for control over agent behavior, enabling developers to dial in levels of determinism or creativity.
- Human-in-the-Loop Learning: UCSF Health pilot shows agents can learn from human experts, improving coverage from 60-70% to 84-88% of routine inquiries.
Practical Applications
- Healthcare Billing: UCSF Health uses eVerse to automate routine billing inquiries, freeing up human agents for complex cases.
- Pitfall: Over-reliance on pre-training data can lead to agents failing to generalize to real-world scenarios and exhibiting inappropriate responses.
References:
Continue reading
Next article
The Impact of Robotic Process Automation (RPA) on Identity and Access Management
Related Content
Unrolling the Codex agent loop
A technical deep dive into the Codex agent loop, explaining how Codex CLI orchestrates models, tools, prompts, and performance, achieving efficient agent behavior.
Privacy in Action: Realistic mitigation and evaluation for agentic LLMs
New research from Microsoft demonstrates two approaches to reducing privacy leaks in AI agents, achieving up to a 25% reduction in information leakage while preserving task completion.
Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective
LinkedIn successfully enabled agentic reinforcement learning training for the GPT-OSS-20B model, achieving comparable performance to OpenAI’s o3-mini and o4-mini.