Find Movies & TV

Computerphile

Ai Will Try to Cheat & Escape (aka Rob Miles was Right!)

2025 • E12 Apr 2, 2025 20m

As Large Language Models improve, the tokens they predict form ever more complicated and nuanced outcomes. Rob Miles and Ryan Greenblatt discuss "Alignment Faking" a paper Ryan's team created - ideas about which Rob made a series of videos on Computerphile in 2017.

Where to Watch Computerphile - 2025 • E12

Take Plex everywhere

Watch free anytime, anywhere, on almost any device.

See the full list of supported devices

Explore

Categories

Explore

Featured Channels

Categories

Computerphile

Ai Will Try to Cheat & Escape (aka Rob Miles was Right!)

Where to Watch Computerphile - 2025 • E12

Take Plex everywhere

Company

Go Premium

Downloads

Support

Watch Free