InfoQ Homepage Generative AI Content on InfoQ
-
Harnessing AI-Generated Cloudformation with Application Composer
The AWS Toolkit for VS Code has recently extended its support to include AWS Application Composer, introduced a year ago in the AWS Management Console. This enhancement empowers users to seamlessly craft Infrastructure as Code (IaC) for a comprehensive range of over 1100 AWS CloudFormation resources.
-
Mistral AI's Open-Source Mixtral 8x7B Outperforms GPT-3.5
Mistral AI recently released Mixtral 8x7B, a sparse mixture of experts (SMoE) large language model (LLM). The model contains 46.7B total parameters, but performs inference at the same speed and cost as models one-third that size. On several LLM benchmarks, it outperformed both Llama 2 70B and GPT-3.5, the model powering ChatGPT.
-
Google Announces Video Generation LLM VideoPoet
Google Research recently published their work on VideoPoet, a large language model (LLM) that can generate video. VideoPoet was trained on 2 trillion tokens of text, audio, image, and video data, and in evaluations by human judges its output was preferred over that of other models.
-
OpenAI Adopts Preparedness Framework for AI Safety
OpenAI recently published a beta version of their Preparedness Framework for mitigating AI risks. The framework lists four risk categories and definitions of risk levels for each, as well as defining OpenAI's safety governance procedures.
-
OpenAI Publishes GPT Prompt Engineering Guide
OpenAI recently published a guide to Prompt Engineering. The guide lists six strategies for eliciting better responses from their GPT models, with a particular focus on examples for their latest version, GPT-4.
-
Amazon Q Code Transformation: Automating Java Application Upgrades
AWS has recently announced the preview of Amazon Q Code Transformation, a service designed to simplify the process of upgrading existing Java application code through generative artificial intelligence. The new feature aims to minimize legacy code and automate common language upgrade tasks required to move off older language versions.
-
Microsoft Announces Small Language Model Phi-2
Microsoft Research announced Phi-2, a 2.7 billion-parameter Transformer-based language model. Phi-2 is trained on 1.4T tokens of synthetic data generated by GPT-3.5 and outperforms larger models on a variety of benchmarks.
-
Microsoft's Orca 2 LLM Outperforms Models That Are 10x Larger
Microsoft Research released its Orca 2 LLM, a fine-tuned version of Llama 2 that performs as well as or better than models that contain 10x the number of parameters. Orca 2 uses a synthetic training dataset and a new technique called Prompt Erasure to achieve this performance.
-
Amazon Unveils Titan AI Image Generator
Amazon unveiled Titan Image Generator, currently in preview for AWS customers on Bedrock, Amazon's AI development platform. As a member of Amazon's Titan family of generative AI models, Titan Image Generator has the capability to generate new images based on a text description or customize existing images.
-
Stability AI Open-Sources Video Generation Model Stable Video Diffusion
Stability AI released the code and model weights for Stable Video Diffusion (SVD), a video generation AI model. When given an input image as context, the model can generate 25 video frames at a resolution of 576x1024 pixels.
-
Recap of AWS re:Invent 2023: Amazon Q, Frugal Architectures, Database Upgrades
The 12th edition of re:Invent has just ended in Las Vegas. As expected, artificial intelligence was a key topic of the conference, with Amazon Bedrock and Amazon Q, a new type of generative AI-powered assistant, the main focus of Adam Selipsky’s keynote.
-
AWS Announces Amazon Q, a New Generative AI–Powered Assistant
AWS has introduced Amazon Q, a new generative AI-powered assistant designed for professional applications. This assistant is configurable to align with your company's requirements, facilitating conversations, issue resolution, content generation, and action-taking through the utilization of information present in your code, enterprise systems, and data repositories.
-
Meta Announces Generative AI Models Emu Video and Emu Edit
Meta AI Research announced two new generative AI models: Emu Video, which can generate short videos given a text prompt, and Emu Edit, which can edit images given text-based instructions. Both models are based on Meta's Emu foundation model and exhibit state-of-the-art performance on several benchmarks.
-
Anthropic Announces Claude 2.1 LLM with Wider Context Window and Support for AI Tools
According to Anthropic, the newest version of Claude delivers many “advancements in key capabilities for enterprises—including an industry-leading 200K token context window, significant reductions in rates of model hallucination, system prompts and our new beta feature: tool use.” Anthropic also announced reduced pricing to improve cost efficiency for our customers across models.
-
KubeCon NA 2023: Ishan Sharma on Real-Time Generative AI for Gaming Apps Running on Kubernetes
Kubernetes provides a great platform for gaming applications using generative artificial intelligence (GenAI) for both game development and gameplay. Ishan Sharma from Google spoke at the recent KubeCon CloudNativeCon NA 2023 Conference about real-time GenAI inference integrated with distributed game servers running on Kubernetes.