Computerphile

Ai Will Try to Cheat & Escape (aka Rob Miles was Right!)

2025 • E12    Apr 2, 2025    20m
As Large Language Models improve, the tokens they predict form ever more complicated and nuanced outcomes. Rob Miles and Ryan Greenblatt discuss "Alignment Faking" a paper Ryan's team created - ideas about which Rob made a series of videos on Computerphile in 2017.

Where to Watch Computerphile - 2025 • E12

 

  •   
  •   
  •   
  •   
  •   
  •   
  •   

Take Plex everywhere

Watch free anytime, anywhere, on almost any device.
See the full list of supported devices