VeriTrail: Detect hallucination and trace provenance in AI workflows
- Dasha Metropolitansky, Microsoft
Dasha Metropolitansky, Research Data Scientist, Microsoft Research Special Projects, introduces VeriTrail, a new method for closed-domain hallucination detection in multi-step AI workflows. Unlike prior methods, VeriTrail provides traceability: it identifies where hallucinated content was likely introduced, and it establishes the provenance of faithful content by tracing a path to the source text. VeriTrail also outperforms baseline methods in hallucination detection. The combination of traceability and effective hallucination detection makes VeriTrail a powerful tool for auditing the integrity of content generated by language models.
-
-
Dasha Metropolitansky
Research Data Scientist
-
-
接下来观看
-
-
Magma: A foundation model for multimodal AI Agents
- Jianwei Yang
-
-
-
-
-
-
-
Making Sentence Embeddings Robust to User-Generated Content
- Lydia Nishimwe
-
Panel: AI Frontiers
- Ashley Llorens,
- Sébastien Bubeck,
- Ahmed Awadallah