InfoQ Software Architects' Newsletter

A monthly overview of things you need to know as an architect or aspiring architect.

Enter your e-mail address

Select your country

We protect your privacy.

InfoQ Homepage News PayPal Adds GenAI Support with LLMs to Its Cosmos.AI MLOps Platform

Architecture & Design

PayPal Adds GenAI Support with LLMs to Its Cosmos.AI MLOps Platform

This item in japanese

Oct 09, 2024 2 min read

Write for InfoQ

Feed your curiosity. Help 550k+ global
senior developers
each month stay ahead.Get in touch

PayPal extended its MLOps platform Cosmos.AI to support the development of generative AI applications using large language models (LLMs). The company incorporated support for vendor, open-source, and self-tuned LLMs and provided capabilities around retrieval-augmented generation (RAG), semantic caching, prompt management, orchestration, and AI application hosting.

PayPal conceived the Cosmos.AI Platform around 2020 and made it generally available in mid-2022. The company decided to consolidate many bespoke and fragmented solutions that teams had previously built independently into an enterprise platform supporting the end-to-end Machine Learning Development Lifecycle (MLDCL). Since its launch, Cosmos.AI has become a de facto AI/ML platform for the company and has been used by thousands of data scientists, analysts, and developers.

The unified platform combines capabilities similar to those from cloud providers, like Amazon SageMaker, Azure Machine Learning, and GCP Vertex AI. Furthermore, Cosmos.AI decouples platform capabilities from their implementations, allowing users to choose between bespoke, in-house implementations and those offered by open-source solutions or third-party vendors that the platform integrates with. Moreover, the platform supports multi-tenancy and self-service and can operate in multi-cloud and hybrid-cloud environments.

Architecture of Cosmos.AI MLOps Platform (Source: PayPal Technology Blog)

PayPal recognized the importance of generative AI and embraced it early on. Building on the flexible and extensible architecture of Cosmos.AI, the company invested in developing GenAI capabilities that leverage large language models (LLMs) to foster product innovation. Jun Yang, engineering director at PayPal, provides the overview of the company’s efforts to provide centralized support for Gen AI:

Thanks to the solid foundations we have in place for platform with its remarkable extensibility, we were able to develop a Gen AI horizontal platform on PayPal Cosmos.AI in the span of a few months, allowing PayPal to fully tap into this technology and rapidly scale Gen AI application development across the company, while reducing costs by minimizing duplicated efforts on Gen AI adoptions among different teams.

The company augmented Cosmos.AI's training capabilities to allow fine-tuning of open-source and vendor-hosted LLMs. The model repo was also extended to enable easy onboarding of LLMs from public model gardens such as Hugging Face while supporting legal and licensing checks. Cosmos.AI now also provides LLMOps capabilities, including multi-GPU deployments, LLM optimizations, streaming interfaces, and enhanced logging/monitoring.

LLMOps Capabilities of Cosmos.AI (Source: PayPal Technology Blog)

To support Gen AI applications, PayPal developed a range of new capabilities, including an RAG framework leveraging Vector DB and Cosmos.AI pipelines, semantic caching for LLM inferencing, a prompt management framework, a platform orchestration framework, LLM evaluation tools, and application hosting. In the future, the team is planning to evolve the AI/ML platform further and transition self-service/manual processes into autonomous workflows.

About the Author

Rafal Gancarz

Rafał is an experienced technology leader and expert. He's currently helping Starbucks make its Commerce Platform scalable, resilient and cost-effective. Previously, Rafał has been involved in designing and building large-scale, distributed and cloud-based systems for Cisco, Accenture, Capita, ICE, Callsign and others. His interests span architecture & design, continuous delivery, observability and operability, as well as sociotechnical and organisational aspects of software delivery.

Show moreShow less

The InfoQ Newsletter

A round-up of last week’s content on InfoQ sent out every Tuesday. Join a community of over 250,000 senior developers. View an example

We protect your privacy.

InfoQ Software Architects' Newsletter

Login with:

Don't have an InfoQ account?

PayPal Adds GenAI Support with LLMs to Its Cosmos.AI MLOps Platform

Write for InfoQ

About the Author

Rafal Gancarz

This content is in the AI Architecture topic

Related Topics:

Related Editorial

Related Sponsored Content

Popular across InfoQ

The InfoQ Newsletter