InfoQ Homepage Machine Learning Content on InfoQ
-
Beyond Notebook: Building Observable Machine Learning Systems
In this article, the author discusses a machine learning pipeline with observability built-in for credit card fraud detection use case, with tools like MLflow, FastAPI, Streamlit, Apache Kafka, Prometheus, Grafana, and Evidently AI.
-
Secure AI-Powered Early Detection System for Medical Data Analysis & Diagnosis
In this article, the author discusses the techniques for securing AI applications in healthcare with an use case of early detection system for medical data analysis & diagnosis. The proposed layered architecture includes application components to support secure computation, ai modeling, governance and compliance, and monitoring and auditing.
-
Building Trust in AI: Security and Risks in Highly Regulated Industries
Explore the transformative power of responsible AI across industries, emphasizing security, MLOps, and compliance. As AI drives innovation—from predicting hurricanes to enhancing legal workflows—organizations must prioritize ethical practices, transparency, and robust governance to safeguard sensitive data while navigating an evolving regulatory landscape.
-
A Framework for Building Micro Metrics for LLM System Evaluation
LLM accuracy is a challenging topic to address and is much more multi-dimensional than a simple accuracy score. Denys Linkov introduces a framework for creating micro metrics to evaluate LLM systems, focusing on goal-aligned metrics that improve performance and reliability. By adopting an iterative "crawl, walk, run" methodology, teams can incrementally develop observability.
-
Reaching Your Automatic Testing Goals by Enhancing Your Test Architecture
If you have automatic end-to-end tests, you have test architecture, even if you’ve never given it a thought. Test architecture encompasses everything from code to more theoretical concerns like enterprise architecture, but with concrete, immediate consequences. Let's explore how you can achieve the goals you have for your automatic testing effort.
-
Efficient Resource Management with Small Language Models (SLMs) in Edge Computing
Small Language Models (SLMs) bring AI inference to the edge without overwhelming the resource-constrained devices. In this article, author Suruchi Shah dives into how SLMs can be used in edge computing applications for learning and adapting to patterns in real-time, reducing the computational burden and making edge devices smarter.
-
Article Series: Practical Applications of Generative AI
Generative AI (GenAI) has become a major component of the artificial intelligence (AI) and machine learning (ML) industry. However, using GenAI comes with challenges and risks. In the InfoQ "Practical Applications of Generative AI" article series, we present real-world solutions and hands-on practices from leading GenAI practitioners.
-
Llama 3 in Action: Deployment Strategies and Advanced Functionality for Real-World Applications
This article details the enhanced capabilities of the open-source Llama 3 LLM, and how businesses can adopt the model in their applications. The author gives step-by-step instructions for deploying Llama 3 in the cloud or on-premise, and how to leverage fine-tuned versions for specific tasks.
-
InfoQ AI, ML and Data Engineering Trends Report - September 2024
InfoQ editorial staff and friends of InfoQ are discussing the current trends in the domain of AI, ML and Data Engineering as part of the process of creating our annual trends report.
-
Adding a Natural Language Interface to Your Application
In this article, author Ashley Davis discusses how to add a natural language interface to a chatbot application using OpenAI REST API. He also shows how to extend the chatbot by adding voice commands using MediaRecorder API and OpenAI's speech transcription API.
-
Unpacking How Ad Ranking Works at Pinterest
Aayush Mudgal describes how Pinterest serves advertisements. He discussed in detail how Machine Learning is used to serve ads at large scale. He went over ads marketplaces and the ad delivery funnel, the ad serving architecture, and two of the main problems: ad retrieval and ranking. Finally, he discussed some of the challenges and solutions for training and serving large models.
-
Testing Machine Learning: Insight and Experience from Using Simulators to Test Trained Functionality
When testing machine learning systems, we must apply existing test processes and methods differently. Machine Learning applications consist of a few lines of code, with complex networks of weighted data points that form the implementation. The data used in training is where the functionality is ultimately defined, and that is where you will find your issues and bugs.