InfoQ Homepage Artificial Intelligence Content on InfoQ
-
Stability AI Releases 3D Model Generation AI Stable Video 3D
Stability AI recently released Stable Video 3D (SV3D), an AI model that can generate 3D mesh object models from a single 2D image. SV3D is based on the Stable Video Diffusion model and produces state-of-the-art results on 3D object generation benchmarks.
-
Mistral Large Foundation Model Now Available on Amazon Bedrock
AWS announced the availability of the Mistral Large Foundation Model on Amazon Bedrock during the recent AWS Paris Summit. This announcement comes days after the release of Mistral AI Models on Amazon Bedrock.
-
How to Do Sustainable Software Development
Software sustainability includes computing for environmental purposes and using resources appropriately. According to Coral Calero, software engineers need a holistic way of looking at software and should be aware of the environmental impact of software. Several tools and frameworks are available for software engineers to do sustainable software development.
-
Google Announces Agent Builder, Expanded Gemini 1.5, Open-Source Additions
At the Google Cloud Next 2024 event, Google announced the launch of Vertex AI Agent Builder, the public preview of Google's most advanced generative AI model, Gemini 1.5 Pro, and the addition of open-source language models to the Vertex AI platform.
-
Fix Your SDLC before Adopting Gen AI in Your Organisation: Bannon’s Call to Action at QCon London
During her keynote at QCon London, Tracy Bannon, architect and researcher at MITRE, argued that AI will be able to enhance the software development lifecycle, though currently it’s at the “code completion” rather than “code generation” phase. Throughout her presentation, she continuously stresses the importance of keeping humans in the loop and fixing your company’s SDLC before embracing AI.
-
QCon London: Lessons Learned from Building LinkedIn’s AI/ML Data Platform
At the QCon London 2024 conference, Félix GV from LinkedIn discussed the AI/ML platform powering the company’s products. He specifically delved into Venice DB, the NoSQL data store used for feature persistence. The presenter shared the lessons learned from evolving and operating the platform, including cluster management and library versioning.
-
QCon London: gRPC Migration Automation at LinkedIn
At QCon London 2024, Karthik Ramgopal and Min Chen described how AI helped LinkedIn change the remote procedure calls (RPC) protocol for 50,000 production endpoints from Rest.li to Google's gRPC. A planned 2-3 year manual migration turned into an AI-supported migration lasting 2-3 quarters. It changed 20 million lines of code across 2000 services – without business interruption.
-
Large Language Models for Code by Loubna Ben Allal at QCon London
At QCon London, Loubna Ben Allal discussed Large Language Models (LLMs) for code. She discussed the lifecycle of code completion models, which consists of pre-training on vast codebases and finetuning and continuous adaptation. She specifically discussed open-source models, which are powered by platforms like Hugging Face.
-
Efficient DevSecOps Workflows with a Little Help from AI: Q&A with Michael Friedrich
At QCon London, Michael Friedrich, senior developer advocate at GitLab, discussed how AI can help in DevSecOps workflows. His session was part of the Cloud-Native Engineering track on the first day of the conference. InfoQ interviewed Friedrich after the session.
-
Navigating LLM Deployment: Tips, Tricks and Techniques by Meryem Arik at QCon London
At QCon London, Meryem Arik discussed deploying Large Language Models (LLMs). While initial proofs of concept benefit from hosted solutions, scaling demands self-hosting to cut costs, enhance performance with tailored models, and meet privacy and security requirements. She emphasized understanding deployment limits, quantization for efficiency, and optimizing inference to fully use GPU resources.
-
Nvidia Announces Robotics-Oriented AI Foundational Model
At its recent GTC 2024 event, Nvidia announced a new foundational model to build intelligent humanoid robots. Dubbed GR00T, short for Generalist Robot 00 Technology, the model will understand natural language and be able to observe human actions and emulate human movements.
-
Microsoft AI-Driven Security Tool Copilot for Security is Now GA
Microsoft recently announced the general availability of Copilot for Security, a generative Artificial Intelligence (AI) security product designed to help security and IT teams with the capabilities to protect their digital assets.
-
KubeCon EU Keynotes: a Call to Action to Innovate Responsibly with Generative AI
The KubeCon EU morning keynotes were a veritable call to action encouraging the cloud-native community's involvement in building the scalable infrastructure needed by generative AI. This call was balanced with encouragement to make a cloud-native platform’s “golden path” green and sustainable, ensuring that any innovation is also responsible.
-
Java News Roundup: Jakarta Data and Jakarta NoSQL Milestones, Class-File API Targeted for JDK 23
This week's Java roundup for March 25th, 2024, features news highlighting: JEP 466, Class-File API (Second Preview), targeted for JDK 23; milestone releases of Jakarta Data and Jakarta NoSQL specifications; the second release candidate for JobRunr 7.0.0; and point releases for Spring projects, Quarkus, Helidon and LangChain4j.
-
Databrix Announces DBRX, an Open Source General Purpose LLM
Databricks launched DBRX, a new open-source large language model (LLM) that aims to redefine the standards of open models and outperform well-known competitors on industry benchmarks.