InfoQ Software Architects' Newsletter

A monthly overview of things you need to know as an architect or aspiring architect.

Enter your e-mail address

Select your country

We protect your privacy.

InfoQ Homepage Benchmark Content on InfoQ

News

RSS Feed

Newer Older

AI, ML & Data Engineering

Google Introduces Gemini 2.5 Pro with Improved Reasoning and Coding Capabilities

Google has released Gemini 2.5 Pro, an updated AI model focused on enhanced reasoning, code generation, and multimodal processing. The model is ranked first on LMArena, a benchmark for human preference in AI responses, and achieves strong results in math, science, and logic-based tasks. It also features a 1 million token context window, with plans to expand to 2 million.

Robert Krzaczyński
on Mar 28, 2025
AI, ML & Data Engineering

Google DeepMind Enhances AMIE for Long-Term Disease Management

Google DeepMind has extended the capabilities of its Articulate Medical Intelligence Explorer (AMIE) beyond diagnosis to support longitudinal disease management. The system is now designed to assist clinicians in monitoring disease progression, adjusting treatments, and adhering to clinical guidelines across multiple patient visits.

Robert Krzaczyński
on Mar 07, 2025
AI, ML & Data Engineering

Mistral AI Introduces Saba: Regional Language Model for Arabic and South Indian Language

Mistral AI has introduced Mistral Saba, a 24-billion-parameter language model designed to improve AI performance in Arabic and several Indian-origin languages, particularly South Indian languages like Tamil.

Robert Krzaczyński
on Mar 06, 2025
AI, ML & Data Engineering

Perplexity Unveils Deep Research: AI-Powered Tool for Advanced Analysis

Perplexity has introduced Deep Research, an AI-powered tool designed for conducting in-depth analysis across various fields, including finance, marketing, and technology. The system automates the research process by performing multiple searches, analyzing extensive sources, and synthesizing findings into structured reports within minutes.

Robert Krzaczyński
on Feb 24, 2025
AI, ML & Data Engineering

OmniHuman-1: Advancing AI-Generated Human Animation

OmniHuman-1, an advanced AI-driven human video generation model, has been introduced, marking a significant leap in multimodal animation technology. OmniHuman-1 enables the creation of highly lifelike human videos using minimal input, such as a single image and motion cues like audio or video.

Robert Krzaczyński
on Feb 20, 2025
AI, ML & Data Engineering

Microsoft Introduces CoRAG: Enhancing AI Retrieval with Iterative Reasoning

Microsoft AI has introduced Chain-of-Retrieval Augmented Generation (CoRAG), a new AI framework designed to enhance Retrieval-Augmented Generation (RAG) models. Unlike traditional RAG systems, which rely on a single retrieval step, CoRAG enables iterative search and reasoning, allowing AI models to refine their retrievals dynamically before generating answers.

Robert Krzaczyński
on Feb 11, 2025
AI, ML & Data Engineering

Microsoft Research Unveils rStar-Math: Advancing Mathematical Reasoning in Small Language Models

Microsoft Research unveiled rStar-Math, a framework that demonstrates the ability of small language models (SLMs) to achieve mathematical reasoning capabilities comparable to, and in some cases exceeding, larger models like OpenAI's o1-mini. This is accomplished without the need for more advanced models, representing a novel approach to enhancing the inference capabilities of AI.

Robert Krzaczyński
on Jan 23, 2025
AI, ML & Data Engineering

HuatuoGPT-o1: Advancing Complex Medical Reasoning with AI

Researchers from The Chinese University of Hong Kong, Shenzhen, and the Shenzhen Research Institute of Big Data have introduced HuatuoGPT-o1, a medical large language model (LLM) designed to improve reasoning in complex healthcare scenarios.

Robert Krzaczyński
on Jan 14, 2025
AI, ML & Data Engineering

NVIDIA Unveils Hymba 1.5B: a Hybrid Approach to Efficient NLP Models

NVIDIA researchers have unveiled Hymba 1.5B, an open-source language model that combines transformer and state-space model (SSM) architectures to achieve unprecedented efficiency and performance. Designed with NVIDIA’s optimized training pipeline, Hymba addresses the computational and memory limitations of traditional transformers while enhancing the recall capabilities of SSMs.

Robert Krzaczyński
on Jan 03, 2025
AI, ML & Data Engineering

Qwen Team Unveils QwQ-32B-Preview: Advancing AI Reasoning and Analytics

Qwen Team introduced QwQ-32B-Preview, an experimental research model designed to improve AI reasoning and analytical capabilities. Featuring a 32,768-token context and cutting-edge transformer architecture, it excels in math, programming, and scientific benchmarks like GPQA and MATH-500. Available on Hugging Face, it invites researchers to explore its features and contribute to its development.

Robert Krzaczyński
on Dec 31, 2024
AI, ML & Data Engineering

Meta Releases Llama 3.3: a Multilingual Model with Enhanced Performance and Efficiency

Meta has released Llama 3.3, a multilingual large language model aimed at supporting a range of AI applications in research and industry. Featuring a 128k-token context window and architectural improvements for efficiency, the model demonstrates strong performance in benchmarks for reasoning, coding, and multilingual tasks. It is available under a community license on Hugging Face.

Robert Krzaczyński
on Dec 14, 2024
AI, ML & Data Engineering

Nexa AI Unveils Omnivision: a Compact Vision-Language Model for Edge AI

Nexa AI unveiled Omnivision, a compact vision-language model tailored for edge devices. By significantly reducing image tokens from 729 to 81, Omnivision lowers latency and computational requirements while maintaining strong performance in tasks like visual question answering and image captioning.

Robert Krzaczyński
on Dec 03, 2024
AI, ML & Data Engineering

Epoch AI Unveils FrontierMath: A New Frontier in Testing AI's Mathematical Reasoning Capabilities

Epoch AI in collaboration with over 60 mathematicians from leading institutions worldwide has introduced FrontierMath, a new benchmark designed to evaluate AI systems' capabilities in advanced mathematical reasoning.

Vinod Goje
on Nov 28, 2024
AI, ML & Data Engineering

Rhymes AI Unveils Aria: Open-Source Multimodal Model with Development Resources

Rhymes AI has introduced Aria, an open-source multimodal native Mixture-of-Experts (MoE) model capable of processing text, images, video, and code effectively. In benchmarking tests, Aria has outperformed other open models and demonstrated competitive performance against proprietary models such as GPT-4o and Gemini-1.5.

Robert Krzaczyński
on Oct 28, 2024
AI, ML & Data Engineering

Hugging Face Upgrades Open LLM Leaderboard v2 for Enhanced AI Model Comparison

Hugging Face has recently released Open LLM Leaderboard v2, an upgraded version of their benchmarking platform for large language models. Hugging Face created the Open LLM Leaderboard to provide a standardized evaluation setup for reference models, ensuring reproducible and comparable results.

Vinod Goje
on Oct 10, 2024

Newer News

Older News

InfoQ Software Architects' Newsletter

Login with:

Don't have an InfoQ account?

News