Home / Series / Computerphile / Aired Order / Season 2025 / Episode 49

Constraining AI Agents

As AI systems become more capable, rule-based safeguards, hard-coded restrictions, and simple alignment strategies start to break down. Buck Shlegeris talks about some tactics we might use as detailed in a recent paper.

English
  • Originally Aired December 4, 2025
  • Runtime 21 minutes
  • Production Code JAcwtV_bFp4
  • Network YouTube
  • On Other Sites Official Website
  • Created December 5, 2025 by
    shriek
  • Modified December 5, 2025 by
    shriek