Le Lézard
Classified in: Science and technology
Subjects: Contract/Agreement, Product/Service

Cerebras Selects Qualcomm to Deliver Unprecedented Performance in AI Inference


Cerebras Systems, a pioneer in accelerating generative artificial intelligence (AI), today announced the company's plans to deliver groundbreaking performance and value for production artificial intelligence (AI). By using Cerebras' industry-leading CS-3 AI accelerators for training with the AI 100 Ultra, a product of Qualcomm Technologies, Inc., for inference, production grade deployments can realize up to a 10x price-performance improvement.

"These joint efforts are aimed at ushering in a new era of high-performance low-cost inference and the timing couldn't be better. Our customers are focused on training the highest quality state-of-the-art models that won't break the bank at time of inference," said Andrew Feldman, CEO and co-founder of Cerebras. "Utilizing the AI 100 Ultra from Qualcomm Technologies, we can radically reduce the cost of inference ? without sacrificing model quality -- leading to the most efficient deployments available today."

Leveraging the latest cutting-edge ML techniques and world-class AI expertise, Cerebras will work with Qualcomm Technologies' AI 100 Ultra to speed up AI inference. Some of the advanced techniques to be used are as follows:

A combination of these and other advanced techniques are designed to allow the Cerebras and Qualcomm Technologies solutions to deliver an order of magnitude performance improvement while enabling it at model release, resulting in inference-ready models that can be deployed on Qualcomm cloud instances anywhere.

"The combination of Cerebras' AI training solution with the AI 100 Ultra helps deliver industry leading perf/TCO$ for AI Inference, as well as optimized and deployment-ready AI models to customers helping reduce time to deployment and time to RoI," said Rashid Attar, Vice President, Cloud Computing, Qualcomm Technologies, Inc.

By training on Cerebras, customers can now unlock massive performance and cost advantages with inference-aware training. Models trained on Cerebras are optimized to run inference on the AI 100 Ultra leading to friction-free deployments.

"AI has become a key part of pharmaceutical research and development, and the cost of operating models is a critical consideration in the research budget," said Kim Branson, Sr. Vice President and Global Head of AI/ML at GlaxoSmithKline. "Techniques like sparsity and speculative decoding that make inference faster while lowering operating costs are critical: this allows everyone to integrate and experiment with AI."

For more information on the Qualcomm Technologies and Cerebras AI training and inference solutions, please visit the Cerebras blog. The Cerebras CS-3 for AI training and Qualcomm AI 100 Ultra for inference at scale will be available in Q2/Q3 2024.

About Cerebras Systems

Cerebras Systems is a team of pioneering computer architects, computer scientists, deep learning researchers, and engineers of all types. We have come together to accelerate generative AI by building a new class of computer system. Our flagship product, the CS-2 system, is powered by the world's largest and fastest AI processor, our Wafer-Scale Engine. It makes training large models simple and easy by avoiding the complexity of distributed computing. Cerebras CS-2s are clustered together to make the largest AI supercomputers in the world, which are used by leading corporations for proprietary models, and to train open-source models with millions of downloads. Cerebras solutions are available through the Cerebras Cloud and on premise. For further information, visit https://www.cerebras.net.

Qualcomm Cloud AI and Qualcomm AI Stack are products of Qualcomm Technologies, Inc., and/or its subsidiaries.


These press releases may also interest you

at 19:15
The Boeing Company [NYSE: BA] announced today it closed an offering of $10.0 billion aggregate principal amount of fixed-rate senior unsecured notes (the "notes"), consisting of $1.0 billion aggregate principal amount of its 6.259% senior notes due...

at 19:05
Milrem Robotics, Europe's leading robotics and autonomous systems developer, is to introduce its most advanced autonomous combat support unmanned ground vehicle (UGV), THeMIS, at the Defense Services Asia (DSA) exhibition in Kuala Lumpur. The...

at 19:01
The latest insights briefing from the Energy Transitions Commission, Overcoming Turbulence in the Offshore Wind Sector, highlights the need for governments and the offshore wind industry to join forces to restore confidence in the market, drive down...

at 19:01
Infinitopes Precision Immunomics, an integrated cancer biotech combining world leading platforms in precision antigen discovery with vaccine vectors capable of durably stimulating protective immune responses, today announced the completion of a...

at 18:52
ApartmentLove Inc. ("ApartmentLove" or the "Company"), a leading provider of online home, apartment, and vacation rental marketing services to property managers, owners, renters, and vacationers from around the world has announced the Company's...

at 18:43
Temperatures will soar two to four degrees above the historical average across much of the United States this summer, leading to an increased demand for electricity to run air conditioners. More 90-degree days are expected in New York City, Boston,...



News published on and distributed by: