InfoQ Homepage Machine Learning Content on InfoQ
-
Microsoft Launches Azure AI Inference SDK for .NET
Microsoft launched Azure AI Inference SDK for .NET, streamlining access to generative AI models in the Azure AI Studio model catalog. This catalog includes models from providers like Azure OpenAI Service, Mistral, Meta, Cohere, NVIDIA, and Hugging Face, organized into three collections: Curated by Azure AI, Azure OpenAI Models, and Open Models from Hugging Face Hub.
-
AWS Announces General Availability of EC2 P5e Instances, Powered by NVIDIA H100 Tensor Core GPUs
Amazon Web Services (AWS) has launched EC2 P5e instances featuring NVIDIA H100 Tensor Core GPUs, substantially boosting AI and HPC performance. With enhanced memory bandwidth, these instances reduce latency for real-time applications. Ideal for tasks like LLM training and simulations, they offer improved scalability and cost-efficiency, making them pivotal for modern cloud computing.
-
Leveraging the Transformer Architecture for Music Recommendation on YouTube
Google has described an approach to use transformer models, which ignited the current generative AI boom, for music recommendation. This approach, which is currently being applied experimentally on YouTube, aims to build a recommender that can understand sequences of user actions when listening to music to better predict user preferences based on their context.
-
Pinterest Modernises Machine Learning Infrastructure with Ray
Pinterest, the visual discovery platform, has revealed details about its journey to modernise its machine learning infrastructure using Ray, an open-source distributed computing framework. In a recent blog post, the company shared insights into the challenges faced and solutions implemented as they integrated Ray into their large-scale production environment.
-
Meta Releases Llama 3.1 405B, Largest Open-Source Model to Date
Meta recently unveiled its latest language model, Llama 3.1 405B. This AI model is the largest of the new Llama models, which also include 8B and 70B versions. With 405 billion parameters, 15 trillion tokens, and 16,000 GPUs, Llama 3.1 405B offers a range of impressive features.
-
AWS Introduces Amazon Q Developer in SageMaker Studio to Streamline ML Workflows
AWS announced that Amazon SageMaker Studio now includes Amazon Q Developer as a new capability. This generative AI-powered assistant is built natively into SageMaker’s JupyterLab experience and provides recommendations for the best tools for each task, step-by-step guidance, code generation, and troubleshooting assistance.
-
Amazon SageMaker Now Offers Managed MLflow Capability for Enhanced Experiment Tracking
AWS has announced the general availability of MLflow capability in Amazon SageMaker. MLflow is an open-source tool commonly used for managing ML experiments. Users can now compare model performance, parameters, and metrics across experiments in the MLflow UI, keep track of their best models in the MLflow Model Registry, and automatically register them as a SageMaker model.
-
Apple WWDC: iOS18 and Apple Intelligence Announcements
At WWDC 2024 Apple unveiled "Apple Intelligence," a suite of AI features coming to iOS 18, iPadOS 18, and macOS Sequoia. Apple’s aim with Apple Intelligence is to seamlessly integrate AI into the core of the iPhone, iPad, and Mac experience.
-
AI and Software Development: Preview of Sessions at InfoQ Events
Explore the transformative impact of AI on software development at InfoQ's upcoming events. Senior software developers will share practical applications and ethical considerations of AI technology through technical talks.
-
AWS Introduces Amazon Bedrock Studio for Building Generative AI Applications
AWS has recently announced Amazon Bedrock Studio, a web interface for developers to collaborate and build generative AI applications. Currently in public preview, the rapid prototyping environment provides access to multiple foundation models, knowledge bases, agents, and guardrails.
-
Enhanced Security for Enterprises: Google Launches Google Threat Intelligence
At the recent RSA Conference in San Francisco, Google Cloud introduced Google Threat Intelligence, a new security offering for large organizations. The new solution provides users with actionable insights, external threat monitoring, attack surface management, digital risk protection, and in-depth analysis of Indicators of Compromise (IOC).
-
Hugging Face Unveils LeRobot, an Open-Source Machine Learning Model for Robotics
Hugging Face has unveiled LeRobot, a new machine learning model trained for real-world robotics applications. LeRobot functions as a platform, offering a versatile library for data sharing, visualization, and training of advanced models.
-
Amazon Q Business and Amazon Q Developer Now Generally Available
AWS has recently announced the general availability of Amazon Q a generative AI-powered assistant tailored for businesses and developers. Amazon Q Developer provides code suggestions and recommendations in real time, while Amazon Q Business enables companies to get insights from structured and unstructured data.
-
Modern Data Architecture, ML, and Resilience Topics Announced for QCon San Francisco 2024
QCon San Francisco returns November 18-22, focusing on innovations and emerging trends you should pay attention to in 2024. With technical talks from international software practitioners, QCon will provide actionable insights and skills you can take back to your teams.
-
People, Planet, Cloud and AI: Key Takeaways from QCon London
This year’s QCon London brought a wealth of talks directly or indirectly related to software architecture, ranging from the rise of AI to more established areas like anything cloud-related to the usual classics like architecture quality traits . The conference also featured many talks about sociotechnical aspects of software architecture and engineering and broadly considered sustainability.