InfoQ Homepage Large language models Content on InfoQ

Presentations

RSS Feed

AI, ML & Data Engineering

Navigating LLM Deployment: Tips, Tricks, and Techniques

Meryem Arik discusses some of the best practices in model optimization, serving and monitoring - with practical tips and real case-studies.

Meryem Arik
on Nov 19, 2024

Icon

44:21
AI, ML & Data Engineering

Manipulating the Machine: Prompt Injections and Countermeasures

Georg Dresler discusses various methods to perform prompt injection to extract system prompts and documents used by GPTs, and ways to integrate countermeasures to protect against stealing information.

Georg Dresler
on Nov 01, 2024

Icon

36:42
AI, ML & Data Engineering

Poetry4Shellz – Avoiding Limerick Based Exploitation and Safely Using AI in Your Apps

Rich Smith provides a case study of a real world LLM based app that is vulnerable to a variety of attack vectors that illustrate the challenges to account for when integrating today's LLM technologies

Rich Smith
on Oct 31, 2024

Icon

50:23
AI, ML & Data Engineering

Mind Your Language Models: an Approach to Architecting Intelligent Systems

Nischal HP discusses the intricacies of designing and implementing intelligent systems powered by LLMs, drawing upon practical insights gained from real-world deployments.

Nischal HP
on Oct 15, 2024

Icon

50:13
AI, ML & Data Engineering

Generative Search: Practical Advice for Retrieval Augmented Generation (RAG)

Sam Partee discusses Vector embeddings in LLMs, a tool capable of capturing the essence of unstructured data used by LLMs to gain access to a wealth of contextually relevant knowledge.

Sam Partee
on Jul 05, 2024

Icon

49:38
AI, ML & Data Engineering

Defensible Moats: Unlocking Enterprise Value with Large Language Models

Nischal HP discusses risk mitigation, environmental, social, and governance (ESG) framework implementation to achieve sustainability goals, strategic procurement, spend analytics, data compliance.

Nischal HP
on Jun 28, 2024

Icon

53:55
AI, ML & Data Engineering

When AIOps Meets MLOps: What it Takes to Deploy ML Models at Scale

Ghida Ibrahim introduces the concept of AIOps referring to using AI and data-driven tooling to provision, manage and scale distributed IT infra.

Ghida Ibrahim
on Jun 27, 2024

Icon

39:42
AI, ML & Data Engineering

Reach Next-Level Autonomy with LLM-Based AI Agents

Tingyi Li discusses the AI Agent, exploring how it extends the frontiers of Generative AI applications and leads to next-level autonomy in combination with enterprise data.

Tingyi Li
on Jun 20, 2024

Icon

46:47
AI, ML & Data Engineering

Retrieval-Augmented Generation (RAG) Patterns and Best Practices

Jay Alammar discusses the common schematics of RAG systems and tips on how to improve them.

Jay Alammar
on May 30, 2024

Icon

45:15
AI, ML & Data Engineering

Large Language Models for Code: Exploring the Landscape, Opportunities, and Challenges

Loubna Ben Allal discusses Large Language Models (LLMs), exploring the current developments of these models, how they are trained, and how they can be leveraged with custom codebases.

Loubna Ben Allal
on May 23, 2024

Icon

49:56
AI, ML & Data Engineering

A Bicycle for the (AI) Mind: GPT-4 + Tools

Sherwin Wu and Atty Eleti discuss how to use the OpenAI API to integrate large language models into your application, and extend GPT’s capabilities by connecting it to the external world via APIs.

Sherwin Wu Atty Eleti
on Aug 08, 2023

Icon

46:52

Topics

Unleashing the Kernel With eBPF

Empirical Observations on the The Future of Scalable UI Architecture

Denys Linkov on Micro Metrics for LLM System Evaluation

Building Effective Engineering Teams and Avoiding Cargo Cult Practices

From Anti-patterns to Best Practices: A Practical Guide to DevSecOps Automation and Security

Helpful links

Choose your language

Presentations

Navigating LLM Deployment: Tips, Tricks, and Techniques

Manipulating the Machine: Prompt Injections and Countermeasures

Poetry4Shellz – Avoiding Limerick Based Exploitation and Safely Using AI in Your Apps

Mind Your Language Models: an Approach to Architecting Intelligent Systems

Generative Search: Practical Advice for Retrieval Augmented Generation (RAG)

Defensible Moats: Unlocking Enterprise Value with Large Language Models

When AIOps Meets MLOps: What it Takes to Deploy ML Models at Scale

Reach Next-Level Autonomy with LLM-Based AI Agents

Retrieval-Augmented Generation (RAG) Patterns and Best Practices

Large Language Models for Code: Exploring the Landscape, Opportunities, and Challenges

A Bicycle for the (AI) Mind: GPT-4 + Tools

Redis Creator 'antirez' Returns: Can He Shift Momentum Away from Valkey?

Unleashing the Kernel With eBPF

Beat the Plan: Probabilistic Strategies for Successful Software Delivery at Scale

Key Takeaways from QCon & InfoQ Dev Summits with a Look ahead to 2025 Conferences

Empirical Observations on the The Future of Scalable UI Architecture

Software Architecture and the Art of Experimentation

Building Effective Engineering Teams and Avoiding Cargo Cult Practices

How to Use Property-Based Testing as Fuzzy Unit Testing

Developing Regulated Software at the Speed of Innovation: Insights from Erez Kaminski

Google Willow Sets New Quantum Supremacy Milestone

Recap of OpenAI Highlights Key Updates in 12-Day "Shipmas"

NVIDIA Unveils Jetson Orin Nano Generative AI Supercomputer

From Anti-patterns to Best Practices: A Practical Guide to DevSecOps Automation and Security

Ruby on Rails 8.0 Released, Introduces Kamal 2 for Improved Deployments

Cloudflare Experiences Major Incident in November, Resulting in Log Loss

QCon London

InfoQ Dev Summit Boston

InfoQ Dev Summit Munich

QCon San Francisco

InfoQ Dev Summit New York

InfoQ Software Architects' Newsletter

Login with:

Don't have an InfoQ account?

Presentations