Want to create an interactive transcript for this episode?
Podcast: DataFramed
Episode: #234 High Performance Generative AI Applications with Ram Sriharsha, CTO at Pinecone
Description: Perhaps the biggest complaint about generative AI is hallucination. If the text you want to generate involves facts, for example, a chatbot that answers questions, then hallucination is a problem. The solution to this is to make use of a technique called retrieval augmented generation, where you store facts in a vector database and retrieve the most appropriate ones to send to the large language model to help it give accurate responses. So, what goes into building vector databases and how do they improve LLM performance so much?Ram Sriharsha is currently the CTO at Pinecone. Before...