Lies, Damned Lies and Large Language Models
Jodie Burchell - yesterday
Would you like to use large language models (LLMs) in your own project, but are troubled by their tendency to frequently “hallucinate”, or produce incorrect information? Have you ever wondered if there was a way to easily measure an LLM’s hallucination rate, and compare this against other models? And would you like to learn how to help LLMs produce more accurate information? In this talk, we’ll have a look at some of the main reasons that hallucinations occur in LLMs, and then focus on how we can measure one specific type of hallucination: the tendency of models to regurgitate misinformation that they have learned from their training data. We’ll explore how we can easily measure this type of hallucination in LLMs using a dataset called TruthfulQA in conjunction with Python tooling including Hugging Face’s datasets, and LangChain. We’ll end by looking at initiatives to reduce hallucinations in LLMs, and how complex this can be.
Jobs with related skills
AI Architect & Consultant (m/f/d)
Riverty Group GmbH
·
9 days ago
Berlin, Germany
+4
Hybrid
(Senior) Cloud Data Engineer Supply Solutions (m/w/d)
msg
·
1 month ago
Frankfurt am Main, Germany
+8
Hybrid
Python Developer (x|f|m) - Hybrid
Sartorius
·
yesterday
Municipality of Madrid, Spain
Hybrid
Software Engineer (f/m/x)
Raiffeisen Bank International AG
·
8 days ago
Vienna, Austria
Hybrid
Related Videos