VeriTrail: Detect hallucination and trace provenance in AI workflows

2025年8月5日
Dasha Metropolitansky, Microsoft

Dasha Metropolitansky, Research Data Scientist, Microsoft Research Special Projects, introduces VeriTrail, a new method for closed-domain hallucination detection in multi-step AI workflows. Unlike prior methods, VeriTrail provides traceability: it identifies where hallucinated content was likely introduced, and it establishes the provenance of faithful content by tracing a path to the source text. VeriTrail also outperforms baseline methods in hallucination detection. The combination of traceability and effective hallucination detection makes VeriTrail a powerful tool for auditing the integrity of content generated by language models.

- Dasha Metropolitansky
  
  Research Data Scientist
研究领域
- Artificial intelligence
- Human language technologies
研究院
- Microsoft Research Lab - New England
- Microsoft Research Lab - Redmond
组
- Microsoft Research Special Projects
论文与出版物
- VeriTrail: Closed-Domain Hallucination Detection with Traceability

接下来观看

AI for Africa’s Future: Innovation, Equity, and Impact
April 23, 2025
Millicent Ochieng,

John Wamburu,

Jacqueline Wang'ombe

等。
Magma: A foundation model for multimodal AI Agents
February 25, 2025
Jianwei Yang
Keynote: Multimodal Generative AI for Precision Health
February 25, 2025
Hoifung Poon
Claimify: Extracting high-quality claims from language model outputs
November 19, 2024
Dasha Metropolitansky
Fostering appropriate reliance on AI
September 3, 2024
Mihaela Vorvoreanu
Insights into the Challenges and Opportunities of Large Multi-Modal Models for Blind and Low Vision Users: CLIP
June 4, 2024
Daniela Massiceti
Panel: Generative AI for Global Impact: Challenges and Opportunities
June 4, 2024
Jacki O'Neill,

Tanuja Ganu,

Sunayana Sitaram

等。
Keynote: Building Globally Equitable AI
June 4, 2024
Jacki O'Neill
Making Sentence Embeddings Robust to User-Generated Content
May 29, 2024
Lydia Nishimwe
Panel: AI Frontiers
January 30, 2024
Ashley Llorens,

Sébastien Bubeck,

Ahmed Awadallah

等。