InfoQ Homepage Large language models Content on InfoQ
-
Meryem Arik on LLM Deployment, State-of-the-Art RAG Apps, and Inference Architecture Stack
In this podcast, Meryem Arik, co-founder/CEO at TitanML, discusses the innovations in Generative AI and Large Language Model (LLM) technologies including current state of large language models, LLM Deployment, state-of-the-art Retrieval Augmented Generation (RAG) apps, and inference architecture stack for LLM applications.
-
Edo Liberty on Vector Databases for Successful Adoption of Generative AI and LLM based Applications
In this podcast, Edo Liberty, founder and CEO at Pinecone, discusses the importance of vector databases in the successful adoption of Generative AI and LLM based applications and how vector databases are different from traditional data stores.
-
If LLMs Do the Easy Programming Tasks - How are Junior Developers Trained? What Have We Done?
In this podcast Michael Stiefel spoke to Anthony Alford and Roland Meertens about the future of software development and the training of new developers, in a world where Large Language Models heavily contribute to software development.
-
InfoQ Architecture and Design Trends in 2024
The panel discussion in this episode is around the annual InfoQ Architecture and Design Trends Report. InfoQ trends reports provide the InfoQ readers with a high-level overview of the topics to pay attention to this year, and also help the editorial team focus on innovative technologies across all the content on InfoQ.
-
Sam Partee on Retrieval Augmented Generation (RAG)
In this podcast, Sam Partee shares his insights on Redis' vector database offering, different approaches to embeddings, how to enhance large language models by adding a search component for retrieval augmented generation, and the use of hybrid search in Redis.