Best AI papers explained

A podcast by Enoch H. Kang

550 Episodes

Score Matching Enables Causal Discovery of Nonlinear Additive Noise Models
Published: 27/05/2025
Improved Techniques for Training Score-Based Generative Models
Published: 27/05/2025
Your Pre-trained LLM is Secretly an Unsupervised Confidence Calibrator
Published: 27/05/2025
AlphaEvolve: A coding agent for scientific and algorithmic discovery
Published: 27/05/2025
Harnessing the Universal Geometry of Embeddings
Published: 27/05/2025
Goal Inference using Reward-Producing Programs in a Novel Physics Environment
Published: 27/05/2025
Trial-Error-Explain In-Context Learning for Personalized Text Generation
Published: 27/05/2025
Reinforcement Learning for Reasoning in Large Language Models with One Training Example
Published: 27/05/2025
Test-Time Reinforcement Learning (TTRL)
Published: 27/05/2025
Interpreting Emergent Planning in Model-Free Reinforcement Learning
Published: 26/05/2025
Agentic Reward Modeling_Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems
Published: 26/05/2025
Beyond Reward Hacking: Causal Rewards for Large LanguageModel Alignment
Published: 26/05/2025
Learning How Hard to Think: Input-Adaptive Allocation of LM Computation
Published: 26/05/2025
Highlighting What Matters: Promptable Embeddings for Attribute-Focused Image Retrieval
Published: 26/05/2025
UFT: Unifying Supervised and Reinforcement Fine-Tuning
Published: 26/05/2025
Understanding High-Dimensional Bayesian Optimization
Published: 26/05/2025
Inference time alignment in continuous space
Published: 25/05/2025
Efficient Test-Time Scaling via Self-Calibration
Published: 25/05/2025
Conformal Prediction via Bayesian Quadrature
Published: 25/05/2025
Predicting from Strings: Language Model Embeddings for Bayesian Optimization
Published: 25/05/2025

15 / 28

Cut through the noise. We curate and break down the most important AI papers so you don’t have to.

Visit the podcast's native language site

550 Episodes

Score Matching Enables Causal Discovery of Nonlinear Additive Noise Models

Improved Techniques for Training Score-Based Generative Models

Your Pre-trained LLM is Secretly an Unsupervised Confidence Calibrator

AlphaEvolve: A coding agent for scientific and algorithmic discovery

Harnessing the Universal Geometry of Embeddings

Goal Inference using Reward-Producing Programs in a Novel Physics Environment

Trial-Error-Explain In-Context Learning for Personalized Text Generation

Reinforcement Learning for Reasoning in Large Language Models with One Training Example

Test-Time Reinforcement Learning (TTRL)

Interpreting Emergent Planning in Model-Free Reinforcement Learning

Agentic Reward Modeling_Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems

Beyond Reward Hacking: Causal Rewards for Large LanguageModel Alignment

Learning How Hard to Think: Input-Adaptive Allocation of LM Computation

Highlighting What Matters: Promptable Embeddings for Attribute-Focused Image Retrieval

UFT: Unifying Supervised and Reinforcement Fine-Tuning

Understanding High-Dimensional Bayesian Optimization

Inference time alignment in continuous space

Efficient Test-Time Scaling via Self-Calibration

Conformal Prediction via Bayesian Quadrature

Predicting from Strings: Language Model Embeddings for Bayesian Optimization