Arxiv Papers
Podcast image
[QA] Llama-Nemotron: Efficient Reasoning Models
7 mins; May 04, 2025
Llama-Nemotron: Efficient Reasoning Models
26 mins; May 04, 2025
[QA] Evaluating Frontier Models for Stealth and Situational Awareness
7 mins; May 04, 2025
Evaluating Frontier Models for Stealth and Situational Awareness
37 mins; May 04, 2025
[QA] Llama-3.1-FoundationAI-SecurityLLM-Base-8B Technical Report
7 mins; May 03, 2025
Llama-3.1-FoundationAI-SecurityLLM-Base-8B Technical Report
17 mins; May 03, 2025
[QA] COMPACT: COMPositional Atomic-to-Complex Visual Capability Tuning
8 mins; May 03, 2025
COMPACT: COMPositional Atomic-to-Complex Visual Capability Tuning
16 mins; May 03, 2025
[QA] DeepCritic: Deliberate Critique with Large Language Models
7 mins; May 03, 2025
DeepCritic: Deliberate Critique with Large Language Models
17 mins; May 03, 2025
[QA] Direct Motion Models for Assessing Generated Videos
7 mins; May 03, 2025
Direct Motion Models for Assessing Generated Videos
17 mins; May 03, 2025
[QA] MINERVA: Evaluating Complex Video Reasoning
7 mins; May 01, 2025
MINERVA: Evaluating Complex Video Reasoning
20 mins; May 01, 2025
[QA] T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT
7 mins; May 01, 2025
T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT
22 mins; May 01, 2025
[QA] The Leaderboard Illusion
7 mins; May 01, 2025
The Leaderboard Illusion
27 mins; May 01, 2025
[QA] Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math
7 mins; May 01, 2025
Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math
17 mins; May 01, 2025
[QA] Reinforcement Learning for Reasoning in Large Language Models with One Training Example
9 mins; April 30, 2025
Reinforcement Learning for Reasoning in Large Language Models with One Training Example
29 mins; April 30, 2025
[QA] ReasonIR: Training Retrievers for Reasoning Tasks
8 mins; April 30, 2025
ReasonIR: Training Retrievers for Reasoning Tasks
24 mins; April 30, 2025
[QA] Scaling Laws For Scalable Oversight
8 mins; April 27, 2025
Scaling Laws For Scalable Oversight
27 mins; April 27, 2025
[QA] Think, Prune, Train, Improve: Scaling Reasoning Without Scaling Models
7 mins; April 27, 2025
Think, Prune, Train, Improve: Scaling Reasoning Without Scaling Models
16 mins; April 27, 2025
[QA] Reasoning LLMs Are Just Efficient Samplers: RL Training Elicits No Transcending Capacity
8 mins; April 26, 2025
Reasoning LLMs Are Just Efficient Samplers: RL Training Elicits No Transcending Capacity
23 mins; April 26, 2025
[QA] Learning Adaptive Parallel Reasoning with Language Models
7 mins; April 26, 2025
Learning Adaptive Parallel Reasoning with Language Models
21 mins; April 26, 2025
[QA] Boosting Generative Image Modeling via Joint Image-Feature Synthesis
7 mins; April 26, 2025
Boosting Generative Image Modeling via Joint Image-Feature Synthesis
18 mins; April 26, 2025
[QA] Step1X-Edit: A Practical Framework for General Image Editing
8 mins; April 25, 2025
Step1X-Edit: A Practical Framework for General Image Editing
15 mins; April 25, 2025
[QA] Token-Shuffle: Towards High-Resolution Image Generation with Autoregressive Models
7 mins; April 25, 2025
Token-Shuffle: Towards High-Resolution Image Generation with Autoregressive Models
24 mins; April 25, 2025
[QA] Exploring How LLMs Capture and Represent Domain-Specific Knowledge
7 mins; April 23, 2025
Exploring How LLMs Capture and Represent Domain-Specific Knowledge
19 mins; April 23, 2025
[QA] I-Con: A Unifying Framework for Representation Learning
7 mins; April 23, 2025
I-Con: A Unifying Framework for Representation Learning
16 mins; April 23, 2025
[QA] Tina: Tiny Reasoning Models via LoRA
7 mins; April 22, 2025
Tina: Tiny Reasoning Models via LoRA
17 mins; April 22, 2025
[QA] LLMs are Greedy Agents: Effects of RL Fine-tuning on Decision-Making Abilities
8 mins; April 22, 2025
LLMs are Greedy Agents: Effects of RL Fine-tuning on Decision-Making Abilities
15 mins; April 22, 2025
[QA] UFO2: The Desktop AgentOS
8 mins; April 22, 2025
UFO2: The Desktop AgentOS
57 mins; April 22, 2025
[QA] NEMOTRON-CROSSTHINK: Scaling Self-Learning beyond Math Reasoning
8 mins; April 21, 2025
NEMOTRON-CROSSTHINK: Scaling Self-Learning beyond Math Reasoning
31 mins; April 21, 2025
[QA] Not All Rollouts are Useful: Down-Sampling Rollouts in LLM Reinforcement Learning
7 mins; April 20, 2025
Not All Rollouts are Useful: Down-Sampling Rollouts in LLM Reinforcement Learning
7 mins; April 20, 2025
[QA] Let Me Grok for You: Accelerating Grokking via Embedding Transfer from a Weaker Model
7 mins; April 20, 2025
Let Me Grok for You: Accelerating Grokking via Embedding Transfer from a Weaker Model
16 mins; April 20, 2025
[QA] Reasoning Models Can Be Effective Without Thinking
7 mins; April 19, 2025
Reasoning Models Can Be Effective Without Thinking
20 mins; April 19, 2025
[QA] A Minimalist Approach to LLM Reasoning: from Rejection Sampling to Reinforce
8 mins; April 19, 2025
A Minimalist Approach to LLM Reasoning: from Rejection Sampling to Reinforce
14 mins; April 19, 2025
[QA] CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training
7 mins; April 18, 2025
CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training
20 mins; April 18, 2025
[QA] Antidistillation Sampling
7 mins; April 18, 2025
Antidistillation Sampling
10 mins; April 18, 2025
[QA] Position: The Most Expensive Part of an LLM should be its Training Data
7 mins; April 17, 2025
Position: The Most Expensive Part of an LLM should be its Training Data
20 mins; April 17, 2025
[QA] Activated LoRA: Fine-tuned LLMs for Intrinsics
8 mins; April 17, 2025
Activated LoRA: Fine-tuned LLMs for Intrinsics
18 mins; April 17, 2025
[QA] COLORBENCH: Can VLMs See and Understand the Colorful World?
7 mins; April 17, 2025
COLORBENCH: Can VLMs See and Understand the Colorful World?
20 mins; April 17, 2025
[QA] ReTool: Reinforcement Learning for Strategic Tool Use in LLMs
8 mins; April 17, 2025
ReTool: Reinforcement Learning for Strategic Tool Use in LLMs
14 mins; April 17, 2025
[QA] Looking beyond the next token
7 mins; April 16, 2025
Looking beyond the next token
16 mins; April 16, 2025
[QA] How to Predict Best Pretraining Data with Small Experiments
8 mins; April 16, 2025
How to Predict Best Pretraining Data with Small Experiments
20 mins; April 16, 2025
[QA] Have we unified image generation and understanding yet? An empirical study of GPT-4o's image generation ability
7 mins; April 14, 2025
Have we unified image generation and understanding yet? An empirical study of GPT-4o's image generation ability
7 mins; April 14, 2025
[QA] DUMP: Automated Distribution-Level Curriculum Learning for RL-based LLM Post-training
7 mins; April 14, 2025
DUMP: Automated Distribution-Level Curriculum Learning for RL-based LLM Post-training
10 mins; April 14, 2025
[QA] Steering CLIP's vision transformer with sparse autoencoders
8 mins; April 14, 2025
Steering CLIP's vision transformer with sparse autoencoders
17 mins; April 14, 2025
[QA] Genius: A Generalizable and Purely Unsupervised Self-Training Framework For Advanced Reasoning
7 mins; April 14, 2025
Genius: A Generalizable and Purely Unsupervised Self-Training Framework For Advanced Reasoning
18 mins; April 14, 2025
[QA] Rethinking Reflection in Pre-Training
8 mins; April 12, 2025
Rethinking Reflection in Pre-Training
17 mins; April 12, 2025
[QA] Self-Steering Language Models
7 mins; April 12, 2025
Self-Steering Language Models
8 mins; April 12, 2025
[QA] Missing Premise exacerbates Overthinking: Are Reasoning Models losing Critical Thinking Skill?
7 mins; April 11, 2025
Missing Premise exacerbates Overthinking: Are Reasoning Models losing Critical Thinking Skill?
16 mins; April 11, 2025
DDT: Decoupled Diffusion Transformer
8 mins; April 11, 2025
DDT: Decoupled Diffusion Transformer
19 mins; April 11, 2025
[QA] Dynamic Cheatsheet: Test-Time Learning with Adaptive Memory
7 mins; April 11, 2025
Dynamic Cheatsheet: Test-Time Learning with Adaptive Memory
15 mins; April 11, 2025
[QA] Scaling Laws for Native Multimodal Models
7 mins; April 11, 2025
Scaling Laws for Native Multimodal Models
18 mins; April 11, 2025
[QA] OLMOTRACE: Tracing Language Model Outputs Back to Trillions of Training Tokens
7 mins; April 09, 2025
OLMOTRACE: Tracing Language Model Outputs Back to Trillions of Training Tokens
18 mins; April 09, 2025
[QA] Wanting to be Understood
7 mins; April 09, 2025
Wanting to be Understood
16 mins; April 09, 2025
[QA] A Sober Look at Progress in Language Model Reasoning: Pitfalls and Paths to Reproducibility
7 mins; April 09, 2025
A Sober Look at Progress in Language Model Reasoning: Pitfalls and Paths to Reproducibility
19 mins; April 09, 2025