InfoQ Homepage AI, ML & Data Engineering Content on InfoQ
-
LLM and Generative AI for Sensitive Data - Navigating Security, Responsibility, and Pitfalls in Highly Regulated Industries
Stefania Chaplin and Azhir Mahmood discuss responsible, secure, and explainable AI in regulated industries. Learn MLOps, legislation, and future trends.
-
Responsible AI for FinTech
Lexy Kassan discusses responsible AI: regulation (EU AI Act, FinTech), ethical principles, governance, and FinTech's disruptive response.
-
Unleashing Llama's Potential: CPU-Based Fine-Tuning
Anil Rajput and Rema Hariharan detail CPU-based LLM (Llama) optimization strategies for performance and TCO reduction.
-
Rockset - Building a Modern Analytics Database on Top of RocksDB
Igor Canadi discusses building a real-time search analytics database on RocksDB, covering cloud-native design, replication, shared storage, and analytics.
-
Navigating LLM Deployment: Tips, Tricks, and Techniques
Meryem Arik shares best practices for self-hosting LLMs in corporate environments, highlighting the importance of cost efficiency and performance optimization.
-
AI in the Age of Climate Change
Nischal HP shares insights on building a data-driven economy to incentivize sustainable farming and reduce carbon emissions.
-
How GitHub Copilot Serves 400 Million Completion Requests a Day
David Cheney explains the architecture powering GitHub Copilot, detailing how they achieve sub-200ms response times for millions of daily requests.
-
The Harsh Reality of Building a Real-Time ML Feature Platform
Ivan Burmistrov shares how ShareChat built their own Real-Time Feature Platform serving more than 1 billion features per second, and how they managed to make it cost efficient.
-
Recommender and Search Ranking Systems in Large Scale Real World Applications
Moumita Bhattacharya overviews the industry search and recommendations systems, goes into modeling choices, data requirements and infrastructural requirements, while highlighting challenges.
-
Powering User Experiences with Streaming Dataflow
Alana Marzoev discusses the fundamentals of streaming dataflow and the architecture of ReadySet, a streaming dataflow system designed specifically for operational workloads.
-
Pioneering the Future: Advancing Infrastructure for AI Agents
AI agents, powered by RAG and vector databases, will anticipate needs, automate workflows, and supervise agents. This talk explores infrastructure, security, and impact to help enterprises harness AI.
-
Elevate Developer Experience with Generative AI Capabilities on AWS
Olalekan Elesin discusses how generative AI tools can improve productivity, streamline workflows, and foster a more efficient and effective development environment.