Scaling Down to Scale Up: A Cost-Benefit Analysis of Replacing OpenAI's LLM with Open Source SLMs in Production (2312.14972v3)

Published 20 Dec 2023 in cs.SE, cs.AI, and cs.LG

Abstract: Many companies use LLMs offered as a service, like OpenAI's GPT-4, to create AI-enabled product experiences. Along with the benefits of ease-of-use and shortened time-to-solution, this reliance on proprietary services has downsides in model control, performance reliability, uptime predictability, and cost. At the same time, a flurry of open-source small LLMs (SLMs) has been made available for commercial use. However, their readiness to replace existing capabilities remains unclear, and a systematic approach to holistically evaluate these SLMs is not readily available. This paper presents a systematic evaluation methodology and a characterization of modern open-source SLMs and their trade-offs when replacing proprietary LLMs for a real-world product feature. We have designed SLaM, an open-source automated analysis tool that enables the quantitative and qualitative testing of product features utilizing arbitrary SLMs. Using SLaM, we examine the quality and performance characteristics of modern SLMs relative to an existing customer-facing implementation using the OpenAI GPT-4 API. Across 9 SLMs and their 29 variants, we observe that SLMs provide competitive results, significant performance consistency improvements, and a cost reduction of 5x~29x when compared to GPT-4.

References (48)

Authors (9)

Chandra Irugalbandara (3 papers)
Ashish Mahendra (4 papers)
Roland Daynauth (6 papers)
Tharuka Kasthuri Arachchige (1 paper)
Krisztian Flautner (6 papers)
Lingjia Tang (15 papers)
Yiping Kang (8 papers)
Jason Mars (21 papers)
Jayanaka Dantanarayana (2 papers)

Citations (6)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Tweets

https://twitter.com/drjasonmars/status/1765798381937430858

https://twitter.com/drjasonmars/status/1781688232217907215

https://twitter.com/JaseciLabs/status/1781023954762699244

https://twitter.com/JaseciLabs/status/1776305110525645082

https://twitter.com/JaseciLabs/status/1781024737055117448

https://twitter.com/JaseciLabs/status/1781018958352576557

YouTube

Show All Videos

Scaling Down to Scale Up: A Cost-Benefit Analysis of Replacing OpenAI's LLM with Open Source SLMs in Production (2312.14972v3)

Summary

Related Papers

Tweets

YouTube