InfoQ Software Architects' Newsletter

A monthly overview of things you need to know as an architect or aspiring architect.

Enter your e-mail address

Select your country

We protect your privacy.

InfoQ Homepage Machine Learning Content on InfoQ

News

RSS Feed

Newer Older

Cloud

Microsoft Launches Azure AI Inference SDK for .NET

Microsoft launched Azure AI Inference SDK for .NET, streamlining access to generative AI models in the Azure AI Studio model catalog. This catalog includes models from providers like Azure OpenAI Service, Mistral, Meta, Cohere, NVIDIA, and Hugging Face, organized into three collections: Curated by Azure AI, Azure OpenAI Models, and Open Models from Hugging Face Hub.

Robert Krzaczyński
on Sep 24, 2024
Cloud

AWS Announces General Availability of EC2 P5e Instances, Powered by NVIDIA H100 Tensor Core GPUs

Amazon Web Services (AWS) has launched EC2 P5e instances featuring NVIDIA H100 Tensor Core GPUs, substantially boosting AI and HPC performance. With enhanced memory bandwidth, these instances reduce latency for real-time applications. Ideal for tasks like LLM training and simulations, they offer improved scalability and cost-efficiency, making them pivotal for modern cloud computing.

Steef-Jan Wiggers
on Sep 18, 2024
AI, ML & Data Engineering

Leveraging the Transformer Architecture for Music Recommendation on YouTube

Google has described an approach to use transformer models, which ignited the current generative AI boom, for music recommendation. This approach, which is currently being applied experimentally on YouTube, aims to build a recommender that can understand sequences of user actions when listening to music to better predict user preferences based on their context.

Sergio De Simone
on Sep 06, 2024
DevOps

Pinterest Modernises Machine Learning Infrastructure with Ray

Pinterest, the visual discovery platform, has revealed details about its journey to modernise its machine learning infrastructure using Ray, an open-source distributed computing framework. In a recent blog post, the company shared insights into the challenges faced and solutions implemented as they integrated Ray into their large-scale production environment.

Matt Saunders
on Aug 19, 2024
AI, ML & Data Engineering

Meta Releases Llama 3.1 405B, Largest Open-Source Model to Date

Meta recently unveiled its latest language model, Llama 3.1 405B. This AI model is the largest of the new Llama models, which also include 8B and 70B versions. With 405 billion parameters, 15 trillion tokens, and 16,000 GPUs, Llama 3.1 405B offers a range of impressive features.

Andrew Hoblitzell
on Jul 31, 2024
AI, ML & Data Engineering

AWS Introduces Amazon Q Developer in SageMaker Studio to Streamline ML Workflows

AWS announced that Amazon SageMaker Studio now includes Amazon Q Developer as a new capability. This generative AI-powered assistant is built natively into SageMaker’s JupyterLab experience and provides recommendations for the best tools for each task, step-by-step guidance, code generation, and troubleshooting assistance.

Daniel Dominguez
on Jul 24, 2024
AI, ML & Data Engineering

Amazon SageMaker Now Offers Managed MLflow Capability for Enhanced Experiment Tracking

AWS has announced the general availability of MLflow capability in Amazon SageMaker. MLflow is an open-source tool commonly used for managing ML experiments. Users can now compare model performance, parameters, and metrics across experiments in the MLflow UI, keep track of their best models in the MLflow Model Registry, and automatically register them as a SageMaker model.

Daniel Dominguez
on Jul 12, 2024
AI, ML & Data Engineering

Apple WWDC: iOS18 and Apple Intelligence Announcements

At WWDC 2024 Apple unveiled "Apple Intelligence," a suite of AI features coming to iOS 18, iPadOS 18, and macOS Sequoia. Apple’s aim with Apple Intelligence is to seamlessly integrate AI into the core of the iPhone, iPad, and Mac experience.

Andrew Hoblitzell
on Jun 16, 2024
AI, ML & Data Engineering

AI and Software Development: Preview of Sessions at InfoQ Events

Explore the transformative impact of AI on software development at InfoQ's upcoming events. Senior software developers will share practical applications and ethical considerations of AI technology through technical talks.

Ian Robins
on Jun 07, 2024
AI, ML & Data Engineering

AWS Introduces Amazon Bedrock Studio for Building Generative AI Applications

AWS has recently announced Amazon Bedrock Studio, a web interface for developers to collaborate and build generative AI applications. Currently in public preview, the rapid prototyping environment provides access to multiple foundation models, knowledge bases, agents, and guardrails.

Renato Losio
on May 24, 2024
Cloud

Enhanced Security for Enterprises: Google Launches Google Threat Intelligence

At the recent RSA Conference in San Francisco, Google Cloud introduced Google Threat Intelligence, a new security offering for large organizations. The new solution provides users with actionable insights, external threat monitoring, attack surface management, digital risk protection, and in-depth analysis of Indicators of Compromise (IOC).

Renato Losio
on May 19, 2024
AI, ML & Data Engineering

Hugging Face Unveils LeRobot, an Open-Source Machine Learning Model for Robotics

Hugging Face has unveiled LeRobot, a new machine learning model trained for real-world robotics applications. LeRobot functions as a platform, offering a versatile library for data sharing, visualization, and training of advanced models.

Daniel Dominguez
on May 16, 2024
Cloud

Amazon Q Business and Amazon Q Developer Now Generally Available

AWS has recently announced the general availability of Amazon Q a generative AI-powered assistant tailored for businesses and developers. Amazon Q Developer provides code suggestions and recommendations in real time, while Amazon Q Business enables companies to get insights from structured and unstructured data.

Renato Losio
on May 11, 2024
AI, ML & Data Engineering

Modern Data Architecture, ML, and Resilience Topics Announced for QCon San Francisco 2024

QCon San Francisco returns November 18-22, focusing on innovations and emerging trends you should pay attention to in 2024. With technical talks from international software practitioners, QCon will provide actionable insights and skills you can take back to your teams.

Artenisa Chatziou
on May 10, 2024
Architecture & Design

People, Planet, Cloud and AI: Key Takeaways from QCon London

This year’s QCon London brought a wealth of talks directly or indirectly related to software architecture, ranging from the rise of AI to more established areas like anything cloud-related to the usual classics like architecture quality traits . The conference also featured many talks about sociotechnical aspects of software architecture and engineering and broadly considered sustainability.

Rafal Gancarz
on May 10, 2024

Newer News

Older News

InfoQ Software Architects' Newsletter

Login with:

Don't have an InfoQ account?

News