AI case study

WillowVoice dictation

Self-hosting caused weekly outages and lag. Moving to Groq ended downtime and cut response times by 500ms, regardless of prompt length.

Willow

Software & Platforms

PublishedSep 12, 2025|5 months ago

Key results

Faster Responses

300-500ms

Result highlights

Unlock 2 result highlights

The story

Context

A voice productivity platform enabling users to dictate emails, Slack messages, and meeting summaries through real-time speech-to-text processing.

Challenge

Self-hosting models on public GPUs resulted in weekly outages that forced the team to frequently notify users of server downtime. Latency increased...

Solution

Unlock full story

Scope & timeline

3 weeks from signing to production launch

Quotes

“Uptime is the lifeblood of our product. If the service goes down, even for a short time, we risk losing trust, and losing users.”
– Lawrence Liu, CTO and Co-founder of Willow, Willow

Unlock 10 more quotes

The company

Willow

willowvoice.com

AI-powered voice dictation software for Mac, Windows, and iOS.

IndustrySoftware & Platforms

LocationSan Francisco, CA, USA

Employees1-10

Founded2023

The AI provider

Groq

LPU hardware and cloud platform for high-speed AI inference.

IndustryTechnology

LocationMountain View, CA, USA

Employees251-1K

Founded2016

Similar Case Studies

Related implementations across industries and use cases

Tenali AI

Software & Platforms|SMB

Real-time sales assistant

5-10s latency broke call momentum. Migrating to Groq cut response time to 200ms, allowing the AI to guide reps instantly.

33%Sales Cycle Reduction

33% shorter sales cycles for sales teams
4x increase in rep productivity for sales teams

Published Sep 19, 2025

ScreenApp

Software & Platforms|SMB

Video transcription

One-hour videos took 20 minutes to transcribe. A new inference engine processes them in 15 seconds.

15 secsTranscription Timevs 20 minutes

Transcription time cut from 20 mins to 15 secs
50% reduction in customer churn
30% increase in free-to-paid conversions

Published Oct 1, 2025

Fireworks AI

Software & Platforms|SMB

Generative AI infrastructure

Massive models were too slow to scale. Moving to H100 inference cut latency by 50% and slashed costs by 4x.

2xCompletion Acceptance

2x completion acceptance for Sourcegraph Cody users

via aws.amazon.com

Published May 1, 2024

K

Kindroid

Software & Platforms|SMB

via elevenlabs.io

Unlock to view details

P

Portola

Software & Platforms|SMB

via openai.com

Unlock to view details

+1 more

D

Decagon

Software & Platforms|SMB

via together.ai

Unlock to view details

604 AI case studies in Software & Platforms

Shopify

Software & Platforms|Enterprise

Merchant business assistant

Setup and data analysis held back shops for weeks. AI now runs those workflows, helping merchants land their first sale in days.

DaysTime to First Salevs weeks

Merchant time to first sale cut from weeks to days
Internal tool build time cut from days to minutes

via cloud.google.com

Published today

monday.com

Software & Platforms|Mid-size

AI agent evaluation

Serial testing bottlenecked development. Now, parallelized checks validate hundreds of complex conversation paths in seconds.

Evaluation feedback time cut from 162s to 18s
8.7x faster evaluation feedback loops

via blog.langchain.com

Published today

B

BMC Helix

Software & Platforms|Enterprise

via cloud.google.com

Unlock to view details

See All in Software & Platforms

Explore industries

Software & Platforms(604)|Financial Services(319)|Technology(176)|Healthcare Providers(174)|Retail(157)|Education & Training(134)|Pharmaceuticals & Biotech(126)

1,355 AI case studies in Product Engineering

AstraZeneca

Pharmaceuticals & Biotech|Enterprise

Lab logistics and onboarding

Lab supply orders were handwritten in notebooks. Digital ordering now takes seconds, saving 30,000 hours for research annually.

30k hrsAnnual Time Savings

30,000 hours saved annually
Supply order time cut from 30 mins to seconds
Projected 90,000 hours saved on onboarding

via servicenow.com

Published Nov 3, 2025

Hitachi Vantara

Technology|Enterprise

Employee workflow automation

Experts spent 15 minutes pulling data from scattered systems. Natural language prompts now generate detailed reports instantly.

15% increase in employee satisfaction
At least 40% reduction in developer time
Microsoft Copilot integration in 1 month

via servicenow.com

Published Nov 3, 2025

T

The Washington Post

Media|Mid-size

via together.ai

Unlock to view details

See All in Product Engineering

Explore functions

Product Engineering(1,355)|Customer Service(352)|Knowledge Management(268)|Operations(213)|Marketing(189)|Sales(129)|Legal & Compliance(99)