Google unveils open source projects for generative AI

Google introduced an LLM inference engine, a library of reference diffusion models, and TPU optimizations for transformer models at Google Cloud Next ’24.

google neon sign google new york office lobby by arthur osipyan via unsplash

Google at Google Cloud Next 24 unveiled three open source projects for building and running generative AI models. The company also introduced new large language models to its MaxText project of JAX-built LLMs.

The new LLM models in MaxText include Gemma, GPT-3, Llama 2, and Mistral, which are supported across both Google Cloud TPUs and Nvidia GPUs, the company said.

The newly unveiled open source projects are MaxDiffusion, JetStream, and Optimum-TPU.

MaxDiffusion is a collection of high-performance and scalable reference implementations for diffusion models such as Stable Diffusion. Like the MaxText models, the MaxDiffusion models are built on JAX, which is a framework for high-performance numerical computing and large-scale machine learning.

JAX in turn is integrated with the OpenXLA compiler, which optimizes numerical functions and delivers excellent performance at scale, allowing model builders to focus on the math and let the software drive the most effective implementation.

“We’ve heavily optimized JAX and OpenXLA performance on Cloud TPU and partnered closely with Nvidia to optimize OpenXLA performance on large Cloud GPU clusters,” Google said.

The company also introduced Jetstream, which is an open source optimized LLM inference engine supporting XLA compilers.

“As customers bring their AI workloads to production, there’s an increasing demand for a cost-efficient inference stack that delivers high performance. JetStream helps with this need and offers support for models trained with both JAX and PyTorch/XLA, and includes optimizations for popular open models such as Llama 2 and Gemma,” Mark Lohmeyer, general manager of compute and ML infrastructure at Google Cloud, said.

Finally, Google’s open source announcements included the launch of Optimum-TPU for PyTorch users in the Hugging Face community. Optimum-TPU brings Google Cloud TPU performance optimizations for both training and inference. It supports the Gemma 2b model now and Llama and Mistral soon, Google said.

Topics

About

Policies

Our Network

More

Google unveils open source projects for generative AI

Google introduced an LLM inference engine, a library of reference diffusion models, and TPU optimizations for transformer models at Google Cloud Next ’24.

More from this author

Amazon Bedrock updated with contextual grounding, RAG connectors

AWS approach to RAG evaluation could help enterprises reduce AI spending

Google Cloud’s Vertex AI gets new grounding options

SingleStoreDB joins the Apache Iceberg bandwagon

Oracle HeatWave’s in-database LLMs to help reduce infra costs

DataStax updates tools for building gen AI applications

Oracle APEX adds generative AI assistant

Databricks expands Mosaic AI support for generative AI apps

Most popular authors

Show me more

OpenSilver 3.0 previews AI-powered UI designer for .NET

How to use FastEndpoints in ASP.NET Core

How Azure Functions is evolving

How to use dbm to stash data quickly in Python

How to auto-generate Python type hints with Monkeytype

How to make HTML GUIs in Python with NiceGUI

Google unveils open source projects for generative AI

Google introduced an LLM inference engine, a library of reference diffusion models, and TPU optimizations for transformer models at Google Cloud Next ’24.

Related content

Beyond the usual suspects: 5 fresh data science tools to try today

Generative AI won’t fix cloud migration

HR professionals trust AI recommendations

Safety off: Programming in Rust with `unsafe`

More from this author

Amazon Bedrock updated with contextual grounding, RAG connectors

AWS approach to RAG evaluation could help enterprises reduce AI spending

Google Cloud’s Vertex AI gets new grounding options

SingleStoreDB joins the Apache Iceberg bandwagon

Oracle HeatWave’s in-database LLMs to help reduce infra costs

DataStax updates tools for building gen AI applications

Oracle APEX adds generative AI assistant

Databricks expands Mosaic AI support for generative AI apps

Most popular authors

Show me more

OpenSilver 3.0 previews AI-powered UI designer for .NET

How to use FastEndpoints in ASP.NET Core

How Azure Functions is evolving

How to use dbm to stash data quickly in Python

How to auto-generate Python type hints with Monkeytype

How to make HTML GUIs in Python with NiceGUI