back

Using OSS models to build AI apps with millions of users

Building AI apps that scale with open source models

In a world increasingly dominated by proprietary AI systems, Hassan El Mghari offers a refreshing counternarrative. His recent talk explores how open source models can power applications serving millions of users while maintaining sustainable economics. As businesses contemplate their AI strategy, El Mghari's insights provide a compelling case for considering open source alternatives alongside the dominant proprietary options from OpenAI and Anthropic.

El Mghari draws from his experience at Replit, where their AI coding assistant "Ghostwriter" serves millions of developers. His presentation walks through the practical considerations of building AI applications at scale, challenging the conventional wisdom that only proprietary models can deliver sufficient performance for production applications. The economics he presents are particularly striking: serving millions of users with proprietary models could bankrupt many startups, while open source alternatives offer a path to sustainability.

  • Open source models have rapidly improved, now matching or exceeding proprietary alternatives for many specific use cases while offering significantly better economics
  • Fine-tuning smaller models on domain-specific data often outperforms larger general-purpose models, especially for specialized tasks
  • Thoughtful system design—combining retrieval augmented generation, multiple specialized models, and effective prompting—can create exceptional user experiences without relying on the largest models

The economics revolution of open source AI

Perhaps the most compelling insight from El Mghari's talk is the stark economic contrast between proprietary and open source models. When he demonstrates that serving 5 million monthly users with GPT-4 could cost a staggering $10 million per month versus just $50,000 with an open source alternative, the business implications become impossible to ignore.

This matters enormously in our current business climate. As companies race to integrate AI capabilities, many are building their strategies around expensive proprietary APIs without fully exploring alternatives. For startups, this decision could mean the difference between sustainability and burning through venture funding at an alarming rate. For enterprises, it represents potential cost savings of millions while maintaining control over their AI infrastructure.

The economic advantages extend beyond direct costs. El Mghari notes that open source models allow for local deployment, reducing latency and eliminating data privacy concerns that come with sending information to third-party APIs. This infrastructure flexibility creates strategic advantages that purely API-based approaches cannot match.

Beyond the obvious:

Recent Videos

May 6, 2026

Hermes Agent Master Class

https://www.youtube.com/watch?v=R3YOGfTBcQg Welcome to the Hermes Agent Master Class — an 11-episode series taking you from zero to fully leveraging every feature of Nous Research's open-source agent. In this first episode, we install Hermes from scratch on a brand new machine with no prior skills or memory, walk through full configuration with OpenRouter, tour the most important CLI and slash commands, and run our first real task: a competitor research report on a custom children's book AI business idea. Every future episode will build on this fresh install so you can see the compounding value of the agent in real time....

Apr 29, 2026

Andrej Karpathy – Outsource your thinking, but you can’t outsource your understanding

https://www.youtube.com/watch?v=96jN2OCOfLs Here's what Andrej Karpathy just figured out that everyone else is still dancing around: we're not in an era of "better models." We're in a different era of computing altogether. And the difference between understanding that and not understanding it is the difference between being a vibe coder and being an agentic engineer. Last October, Karpathy had a realization. AI didn't stop being ChatGPT-adjacent. It fundamentally shifted. Agentic coherent workflows started to actually work. And he's spent the last three months living in side projects, VB coding, exploring what's actually possible. What he found is a framework that explains...

Mar 30, 2026

Andrej Karpathy on the Decade of Agents, the Limits of RL, and Why Education Is His Next Mission

A summary of key takeaways from Andrej Karpathy's conversation with Dwarkesh Patel In a wide-ranging conversation with Dwarkesh Patel, Andrej Karpathy — former head of AI at Tesla, founding member of OpenAI, and creator of some of the most popular AI educational content on the internet — shared his views on where AI is headed, what's still broken, and why he's now pouring his energy into education. Here are the key takeaways. "It's the Decade of Agents, Not the Year of Agents" Karpathy's now-famous quote is a direct pushback on industry hype. Early agents like Claude Code and Codex are...