InfoQ Homepage Hugging Face Content on InfoQ
Podcasts
RSS Feed-
Apoorva Joshi on LLM Application Evaluation and Performance Improvements
In this podcast, Apoorva Joshi, senior AI developer advocate at MongoDB, discusses how to evaluate software applications that use the Large Language Models or LLMs and how to improve the performance of LLM based applications.
-
Meryem Arik on LLM Deployment, State-of-the-Art RAG Apps, and Inference Architecture Stack
In this podcast, Meryem Arik, co-founder/CEO at TitanML, discusses the innovations in Generative AI and Large Language Model (LLM) technologies including current state of large language models, LLM Deployment, state-of-the-art Retrieval Augmented Generation (RAG) apps, and inference architecture stack for LLM applications.