Want to create an interactive transcript for this episode?
Podcast: Arxiv Papers
Episode: [QA] LLMs are Greedy Agents: Effects of RL Fine-tuning on Decision-Making Abilities