LanguaTalk

Want to create an interactive transcript for this episode?

View more episodes

Podcast: Machine Learning Street Talk (MLST)

Episode: OpenAI GPT-3: Language Models are Few-Shot Learners

Description: In this episode of Machine Learning Street Talk, Tim Scarfe, Yannic Kilcher and Connor Shorten discuss their takeaways from OpenAI’s GPT-3 language model. With the help of Microsoft’s ZeRO-2 / DeepSpeed optimiser, OpenAI trained an 175 BILLION parameter autoregressive language model. The paper demonstrates how self-supervised language modelling at this scale can perform many downstream tasks without fine-tuning. 00:00:00 Intro 00:00:54 ZeRO1+2 (model + Data parallelism) (Connor) 00:03:17 Recent history of NLP (Tim) 00:06:04 Yannic "Light-speed" Kilcher's brief overview of GPT-3 00:14:25 Reviewing Yannic's YT comments on his GPT-3 video (Tim) 00...

Click any word to see translations, usage examples & similar words. Then learn them using saved words.

Text not synced with the audio? See here for why certain podcasts won't sync.

Key for transcripts:

saved words | learned words

Colours will update after you refresh the page.

Useful pages

Langua

Find a tutor