Le Lézard
Classified in: Science and technology
Subject: PDT

Labelbox partners with Google Cloud to offer LLM human evaluation services


SAN FRANCISCO, April 10, 2024 /PRNewswire/ -- As teams building generative AI applications transition from prototypes to production, evaluating the performance of large language models (LLMs) is becoming critical to their success. State-of-the-art techniques for evaluating LLMs and compound AI systems, like RAG, typically employ a hybrid strategy of automated and human evaluation. While optimizing LLMs for human preference judgment can improve their performance, human evaluation remains one of the most time-consuming and resource intensive parts of the process.

Labelbox and Google Cloud partner to provide an integrated solution for LLM human evaluation as a fully managed service.

To enable teams to evaluate and ship LLM applications confidently, Labelbox has partnered with Google Cloud to provide Vertex AI platform customers an integrated solution for LLM evaluation as a fully managed service.

Vertex AI LLM Evaluation 

With this LLM Evaluation solution, Vertex AI customers can go directly into the Vertex AI platform interface to launch an LLM evaluation job, set their desired evaluation type (e.g., single model or side-by-side comparison) and criteria (e.g, question-answer, multi-turn chat, summarization), and get quality reviewed results within days from skilled evaluation professionals.

The LLM Evaluation solution from Labelbox provides teams with easy access to human raters who will help evaluate the effectiveness of their organization's LLMs against a wide range of customisable criteria - from instruction following, verbosity, to relevance of any given response.

With integrated APIs customers can simply configure their task within the Vertex AI platform and everything else is taken care of by Labelbox before the QA process. Seamless visualization of the labeling team's responses within the Vertex AI platform also gives customers the ability to review and accept outputs, putting you in full control of the annotation quality.

A full suite of Labelbox products now available on the Google Cloud Marketplace

For teams looking to get the best of both worlds and combine a hybrid approach of AI-assistance with human evaluation, Google Cloud customers can now purchase a full suite of Labelbox products on the Google Cloud Marketplace. With native no-code integrations with Google Cloud's BigQuery, CloudSQL and Google Sheets, customers can integrate data pipelines with Labelbox in minutes.

With this offering, Labelbox provides a data-centric AI platform providing data curation, AI-assisted labeling, premium data labeling services, and model diagnostics to align task-specific models and build intelligent applications. The latest updates to Labelbox's products include model distillation, reinforcement learning with human feedback (RLHF) and LLM evaluation.

How to get started

As LLMs continue to power a broad array of everyday applications, they will continue to require nuanced supervision from humans to detect and mitigate errors, inconsistencies, or biases. The partnership between Google Cloud and Labelbox enables Vertex AI customers to receive a critical solution for enhancing how LLM products are built - by more easily injecting human evaluation and AI assistance directly into the process. With this technology, all the heavy lifting and manual effort is done for you, freeing up your organization's resources to focus on building and delivering AI products.

To learn more about the LLM evaluation solution, contact us and get early access here.

Media contact: 
David Mok 
6282527391
[email protected] 

SOURCE Labelbox


These press releases may also interest you

at 04:05
Forrester today announced the full conference agenda for its CX Summit EMEA event being held in London and digitally on June 24-26, 2024. Today, customers expect more personalised services that reflect their unique preferences and behaviours. With...

at 04:05
Merge, which provides unified APIs for B2B SaaS organizations to easily add hundreds of integrations to their products, has officially expanded to Europe with the establishment of its Berlin-based team. The expansion reinforces Merge's commitment to...

at 04:05
Gecko Robotics and Al Masaood Energy today announced a multi-year contract with ADNOC Gas, an integrated gas processing company and a subsidiary of Abu Dhabi National Oil Company (ADNOC), one of the world's largest integrated energy companies. The...

at 04:05
Alvotech , a global biotech company specializing in the development and manufacture of biosimilar medicines for patients worldwide, and Teva Pharmaceuticals, a U.S. affiliate of Teva Pharmaceutical Industries Ltd. , disclosed today that under the...

at 04:00
TEOCO, the leading provider of analytics and optimization solutions to over 300 communication service providers (CSPs) worldwide, announces that KPN, a premier provider of communications services in the Netherlands, is utilizing TEOCO's digital...

at 04:00
GOWIN Semiconductor Corporation, the world's fastest-growing FPGA manufacturer, today announced that its GOWIN EDA FPGA design environment has been certified compliant with the ISO 26262 and IEC 61508 functional safety standards by the...



News published on and distributed by: