LanguaTalk

Want to create an interactive transcript for this episode?

View more episodes

Podcast: Data Skeptic

Episode: [MINI] Multi-armed Bandit Problems

Description: The multi-armed bandit problem is named with reference to slot machines (one armed bandits). Given the chance to play from a pool of slot machines, all with unknown payout frequencies, how can you maximize your reward? If you knew in advance which machine was best, you would play exclusively that machine. Any strategy less than this will, on average, earn less payout, and the difference can be called the "regret". You can try each slot machine to learn about it, which we refer to as exploration. When you've spent enough time to be convinced you've identified the...

Click any word to see translations, usage examples & similar words. Then learn them using saved words.

Text not synced with the audio? See here for why certain podcasts won't sync.

Key for transcripts:

saved words | learned words

Colours will update after you refresh the page.

Useful pages

Find a tutor

Languages