Computerphile

Sleeper Agents in Large Language Models

2025 • E35    Sep 12, 2025    14m
It's an older paper, but it checks out. Rob Miles discusses the problem of 'Sleeper Agents' - where LLMs could have hidden traits we don't know about until it's too late.

Where to Watch Computerphile - 2025 • E35

 

  •   
  •   
  •   
  •   
  •   
  •   
  •   

Take Plex everywhere

Watch free anytime, anywhere, on almost any device.
See the full list of supported devices