54 Episodes

  1. Owain Evans - AI Situational Awareness, Out-of-Context Reasoning

    Published: 23/08/2024
  2. [Crosspost] Adam Gleave on Vulnerabilities in GPT-4 APIs (+ extra Nathan Labenz interview)

    Published: 17/05/2024
  3. Ethan Perez on Selecting Alignment Research Projects (ft. Mikita Balesni & Henry Sleight)

    Published: 9/04/2024
  4. Emil Wallner on Sora, Generative AI Startups and AI optimism

    Published: 20/02/2024
  5. Evan Hubinger on Sleeper Agents, Deception and Responsible Scaling Policies

    Published: 12/02/2024
  6. [Jan 2023] Jeffrey Ladish on AI Augmented Cyberwarfare and compute monitoring

    Published: 27/01/2024
  7. Holly Elmore on pausing AI

    Published: 22/01/2024
  8. Podcast Retrospective and Next Steps

    Published: 9/01/2024
  9. Kellin Pelrine on beating the strongest go AI

    Published: 4/10/2023
  10. Paul Christiano's views on "doom" (ft. Robert Miles)

    Published: 29/09/2023
  11. Neel Nanda on mechanistic interpretability, superposition and grokking

    Published: 21/09/2023
  12. Joscha Bach on how to stop worrying and love AI

    Published: 8/09/2023
  13. Erik Jones on Automatically Auditing Large Language Models

    Published: 11/08/2023
  14. Dylan Patel on the GPU Shortage, Nvidia and the Deep Learning Supply Chain

    Published: 9/08/2023
  15. Tony Wang on Beating Superhuman Go AIs with Advesarial Policies

    Published: 4/08/2023
  16. David Bau on Editing Facts in GPT, AI Safety and Interpretability

    Published: 1/08/2023
  17. Alexander Pan on the MACHIAVELLI benchmark

    Published: 26/07/2023
  18. Vincent Weisser on Funding AI Alignment Research

    Published: 24/07/2023
  19. [JUNE 2022] Aran Komatsuzaki on Scaling, GPT-J and Alignment

    Published: 19/07/2023
  20. Nina Rimsky on AI Deception and Mesa-optimisation

    Published: 18/07/2023

1 / 3

The goal of this podcast is to create a place where people discuss their inside views about existential risk from AI.

Visit the podcast's native language site