Best AI papers explained
A podcast by Enoch H. Kang
518 Episodes
-  Learning to summarize user information for personalized reinforcement learning from human feedbackPublished: 4/10/2025
-  Distributional Preference Learning: Understanding and Accounting for Hidden Context in RLHFPublished: 3/10/2025
-  LIMI: Less is More for AgencyPublished: 1/10/2025
-  LoRA Without RegretPublished: 1/10/2025
-  Actor-Critic without Actor: Critic-Guided Denoising for RLPublished: 29/09/2025
-  DELTA-Code: How Does RL Unlock and Transfer New Programming Algorithms in LLMs?Published: 29/09/2025
-  Linear Transformers Implicitly Discover Unified Numerical AlgorithmsPublished: 29/09/2025
-  Regularizing Extrapolation in Causal InferencePublished: 27/09/2025
-  DoubleGen - Debiased Generative Modeling of CounterfactualsPublished: 27/09/2025
-  What Characterizes Effective Reasoning? Revisiting Length, Review, and Structure of CoTPublished: 27/09/2025
-  Compute as Teacher: Turning Inference Compute Into Reference-Free SupervisionPublished: 27/09/2025
-  Learning without training: The implicit dynamics of in-context learningPublished: 24/09/2025
-  Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base ModelPublished: 24/09/2025
-  Open Problems in Mechanistic InterpretabilityPublished: 21/09/2025
-  Maestro: Joint Graph & Config Optimization for Reliable AI AgentsPublished: 21/09/2025
-  Thought Anchors: Which LLM Reasoning Steps Matter?Published: 21/09/2025
-  Sample Complexity and Representation Ability of Test-time Scaling ParadigmsPublished: 9/09/2025
-  RL's Razor: Why Online RL Forgets LessPublished: 7/09/2025
-  Why Language Models HallucinatePublished: 6/09/2025
-  ALFA: Aligning LLMs to Ask Good Questions A Case Study in Clinical ReasoningPublished: 6/09/2025
Cut through the noise. We curate and break down the most important AI papers so you don’t have to.
