AI Case Study: Model evaluation at Dropbox

The story

Context

A leading cloud storage platform developing a universal search tool that retrieves and organizes work across all of a user's connected applications.

Challenge

Behind the search interface runs a complex chain of retrieval and inference steps where a single prompt tweak can ripple unpredictably to cause...

Solution

Unlock full story

Scope & timeline

Under 10 minutes for automated PR evaluations

Quotes

“With Braintrust, our science fiction writer can sit down, see something he doesn't like, test against it very quickly, and deploy his change to production. That's pretty remarkable.”
– Ameya Bhatawdekar, Director of Machine Learning, Dropbox

Unlock 4 more quotes

The company

Dropbox

dropbox.com

Cloud storage, file sharing, and collaboration platform for teams and individuals.

IndustrySoftware & Platforms

LocationSan Francisco, CA, USA

Employees1K-5K

Founded2007

The vendor

Braintrust

braintrustdata.com

AI observability and evaluation platform that helps developers build, test, and monitor LLM-powered applications.

IndustrySoftware & Platforms

LocationSan Francisco, CA

Employees11-50

Founded2020

Use case

Dropbox's Model evaluation is part of this use case:

AI Infrastructure

70 case studies(+118% YoY)

Proven impact?

LowModerateVery Strong

3.4Moderate

2.9Lowwithin Software & Platforms

3.3Moderatewithin Product Engineering

View 70 case studies for ai infrastructure Explore ai infrastructure clusters

The story

Context

A leading cloud storage platform developing a universal search tool that retrieves and organizes work across all of a user's connected applications.

Challenge

Behind the search interface runs a complex chain of retrieval and inference steps where a single prompt tweak can ripple unpredictably to cause...

Solution

Unlock full story

Scope & timeline

Under 10 minutes for automated PR evaluations

Quotes

“With Braintrust, our science fiction writer can sit down, see something he doesn't like, test against it very quickly, and deploy his change to production. That's pretty remarkable.”
– Ameya Bhatawdekar, Director of Machine Learning, Dropbox

Unlock 4 more quotes

The company

Dropbox

dropbox.com

Cloud storage, file sharing, and collaboration platform for teams and individuals.

IndustrySoftware & Platforms

LocationSan Francisco, CA, USA

Employees1K-5K

Founded2007

The vendor

Braintrust

braintrustdata.com

AI observability and evaluation platform that helps developers build, test, and monitor LLM-powered applications.

IndustrySoftware & Platforms

LocationSan Francisco, CA

Employees11-50

Founded2020

Use case

Dropbox's Model evaluation is part of this use case:

AI Infrastructure

70 case studies(+118% YoY)

Proven impact?

LowModerateVery Strong

3.4Moderate

2.9Lowwithin Software & Platforms

3.3Moderatewithin Product Engineering

View 70 case studies for ai infrastructure Explore ai infrastructure clusters

DropboxModel evaluation

The story

Scope & timeline

Quotes

The company

Dropbox

The vendor

Braintrust

Use case

Similar Case Studies

Navan

Call quality assurance

monday.com

Agent testing

Jamf

Performance review automation

Notion

Smartsheet

Retool

72 AI case studies in AI Infrastructure

monday.com

Agent testing

Superhuman

Internal tool development

Jeppesen

Explore use cases

283 AI case studies in Software & Platforms

Paychex

Customer support

Vimeo

Customer support

Navan

Explore industries

589 AI case studies in Product Engineering

Lenovo

On-device moderation

HelloFresh

Operations automation

Rappi

Explore functions

DropboxModel evaluation

The story

Scope & timeline

Quotes

The company

Dropbox

The vendor

Braintrust

Use case

Similar Case Studies

Navan

Call quality assurance

monday.com

Agent testing

Jamf

Performance review automation

Notion

Smartsheet

Retool

72 AI case studies in AI Infrastructure

monday.com

Agent testing

Superhuman

Internal tool development

Jeppesen

Explore use cases

283 AI case studies in Software & Platforms

Paychex

Customer support

Vimeo

Customer support

Navan

Explore industries

589 AI case studies in Product Engineering

Lenovo

On-device moderation

HelloFresh

Operations automation

Rappi

Explore functions