2025-09-12

The Real State of AI for Startups in 2025: What Nobody's Telling You

By Thomas M. Jumper

Why 90% of AI Implementations Fail—and the Battle-Tested Framework Smart Founders Use to Build Real Moats, Not ChatGPT Wrappers

Let me start with something that might piss off the AI evangelists: most startups are using AI wrong.

I’ve spent the last month interviewing founders in my network and digging through case studies of companies actually shipping AI features. The pattern is depressing. Everyone’s slapping ChatGPT onto their product like it’s 2010 and they’re adding "social features." One founder proudly walked me through their "AI-powered" expense tracker on a Zoom call. It was literally just GPT-4 categorizing receipts—something a regex could do better and cheaper.

But here’s the thing: the founders who get it are quietly building empires while everyone else is playing with toys. And the numbers back this up—78% of organizations are now using AI according to Stanford’s latest AI Index, but most are getting mediocre results. The winners? They’re seeing 70% revenue increases.

The Landscape Has Changed (And Most People Haven’t Noticed)

Remember when everyone freaked out about ChatGPT in late 2022? That feels like ancient history now. The real shift happened when nobody was looking, sometime around March 2024, when open-source models got scary good.

I’ll give you a concrete example. Databricks recently showcased a customer service chatbot handling 120 requests per minute, processing 3,500 input tokens and generating 300 output tokens per interaction. Using Llama 3.3 70B, the monthly cost of running this chatbot would be 88% lower compared to Llama 3.1 405B and 72% more cost-effective compared to leading proprietary models.

This tracks with what’s happening industry-wide: the cost of running GPT-3.5-level performance has dropped 280x in just 18 months. The performance gap between proprietary and open-source models? It’s shrunk from 8% to 1.7%.

This is the part the big AI companies don’t want you to know: you probably don’t need them anymore.

The Open Source Revolution Nobody’s Talking About

Meta did something wild. They basically gave away the farm with Llama, and now every hacker with a GPU is building models that would’ve cost millions to develop two years ago.

I was skeptical at first. Really skeptical. But look at what’s happening: WizardCoder (an open-source model) is matching GPT-4 on certain coding tasks. Mistral’s models are being used by startups to build medical tools that rival products from companies with 100x the funding. A developer named Teknium fine-tuned models in his bedroom that compete with corporate efforts.

The numbers are insane: global private investment in AI hit $252 billion in 2024 (up 26% year-over-year), but increasingly that money isn’t going to closed models—it’s funding companies building on top of open infrastructure.

The gap between OpenAI’s best and open-source isn’t 10% anymore. On most tasks, it’s basically zero. On some specific tasks where you can fine-tune? Open source wins.

A Framework That Actually Works (Because I’ve Used It)

Forget the consultant-speak. Here’s how successful founders are actually thinking about AI:

1. Start Where It Hurts Most

Every startup has that one stupid task that eats hours. For my team, it was manually categorizing user feedback. We had someone spending 20 hours a week on it.

We didn’t need some fancy AI strategy. We needed that person doing literally anything else.

So we built the dumbest possible solution: a Python script that ran feedback through a local LLM and categorized it. Took an afternoon. Saved 20 hours a week.

That’s not just my experience, the data backs it up. McKinsey found that 42% of companies using AI this way are already seeing cost reductions. Not revolutionary, just sensible.

2. The "10x Test"

Here’s my rule: if AI doesn’t make something 10x better or faster, don’t bother.

A 20% improvement? Nobody cares. Your users won’t notice. Your team won’t feel it. But 10x? That changes behavior.

Take my own experience. We built an AI Trade Recommender System that pulls data from more than a dozen APIs and generates daily trade ideas. At first, it just scraped headlines and spit out a basic summary. Technically helpful, but it wasn’t much better than skimming the Wall Street Journal yourself. Nobody used it.

So we rebuilt it. The system now generates sector-level insights, runs company-level filters, and delivers pre-packaged trade scenarios with performance recaps, all in under a minute. It’s gone from a novelty to something that I can rely on. That’s the 10x effect.

Research confirms this—a study on customer support agents found that AI assistance increased productivity by 34% on average, but the gains were concentrated in specific high-leverage use cases. The lesson? Go big or go home.

3. Build Moats, Not Features

This is crucial and almost everyone gets it wrong. AI features aren’t moats. Any competitor can call the same API you’re calling.

Real moats come from: your data—the stuff only you have; your workflow—how AI fits into your specific user journey; and your feedback loops—how you’re improving based on usage.

Harvey AI (the legal tech startup) gets this. They didn’t just plug in GPT-4 and call it a day. They’ve trained on millions of legal documents their law firm partners provided exclusively. That training data is their moat. A competitor can’t just spin up the same thing overnight.

What AI Actually Looks Like Inside Real Startups

Let me walk you through how actual companies are using AI, based on public information and founder interviews:

Cursor (Developer Tools)

Cursor isn’t trying to replace programmers. They built an IDE where AI reviews your code in real-time, spots bugs, and suggests fixes. Not architectural decisions, just the "you forgot to handle this edge case" stuff.

The result? Developers using Cursor report reviewing 3x more code because they’re not hunting for trivial bugs. Junior developers learn faster because the AI explains why something might break.

They’re running models locally when possible. No data has to leave your machine for basic features. Enterprise customers love that, especially since 56% of organizations cite data privacy as their top AI concern.

Descript (Content Creation Platform)

Descript started as a podcast editing tool but found their killer AI feature: removing filler words and awkward pauses automatically. Not sexy, but it saves editors hours.

Then they added something clever—AI that identifies which customers are struggling based on usage patterns. Instead of automated help emails, it alerts the support team for personal outreach.

Conversion rate on those personal touches? Way higher than any automated campaign. The AI enables human connection at scale.

Jasper (Content Platform)

Jasper tried being an "AI writer" at first. Realized everyone else was doing the same thing. They pivoted to something smarter: AI that helps you plan content strategy, not just write.

Their AI analyzes what’s working in your industry, identifies content gaps, and suggests topics with traffic potential. Writers still write, they just write smarter things.

They went from near-death to $125M ARR. The lesson? Using AI for ideation beats using it for generation.

The Hiring Reality Nobody Admits

You know what I’m tired of? "Prompt engineer" job postings. It’s like hiring a "Google search specialist" in 2005.

The bad news: junior roles are evaporating. Startups have been replacing three junior analysts with one senior analyst plus AI tools. The senior analyst is more productive than all four would’ve been together. MIT research found that "AI tools can boost junior worker productivity by up to 43%, effectively erasing the experience gap."

The good news: if you’re smart about AI, you can hire differently. Companies can now hire brilliant people straight from college who would’ve typically needed three years of experience. But they are scary good with AI tools. They perform like a senior employee at junior salary.

The reality: your best AI hire isn’t an AI expert. It’s someone who deeply understands your problem space and can leverage AI to solve it.

Only 44% of companies have formal AI governance, but the ones that do report that business-side ownership beats tech-side ownership for ROI.

Product Design in the AI Era (The Stuff That Works)

Here’s the key takeaways from my research on AI product design from dozens of launches:

Make AI invisible when possible. The best AI features don’t scream "POWERED BY AI!" They just work. Superhuman doesn’t advertise their AI. They just surface the important emails. Notion’s AI isn’t the hero—it’s a helpful assistant that appears when needed.

Always provide an escape hatch. Every AI feature needs an "actually, let me do this myself" button. Users need control, especially when AI inevitably does something weird.

Design for AI failure. Your AI will fail. It will hallucinate. It will confidently give wrong answers. Design for this reality. Good example: Casetext (before their $650M exit to Thomson Reuters) always showed confidence scores and linked to source documents. Users could verify everything. Trust through transparency.

The number of reported AI incidents increased 56% in 2024 alone. The companies that survived? They planned for failure from day one.

The Next 18 Months (Buckle Up)

I’m not a fortune teller, but here’s what’s obvious if you’re paying attention:

1. AI Agents Are Coming (And They’re Terrifying)

We’re about to have AI that can actually do things—not just chat. I’m talking about AI that can navigate websites, fill out forms, make purchases, send emails—real actions.

Companies like Adept and Lindy are already building this. Lindy’s AI can manage your calendar, triage emails, and handle customer support tickets autonomously. It’s not replacing workers—it’s giving them superpowers.

Gartner calls this "Agentic AI" and predicts it’ll be mainstream by 2026. I think they’re being conservative.

2. The Commodity Trap

Basic AI features are becoming commodities. If your differentiation is "we have a chatbot," you’re already dead.

The winners will be whoever builds the best workflow around AI, not whoever has the best AI. Think Uber vs. taxis—Uber didn’t invent cars or GPS, they just packaged them better.

Spending on genAI will hit $644B this year, but most of it will burn on me-too features. Don’t be part of that statistic.

3. Regulation Is Coming (And It’s Going to Be Messy)

The EU is already moving. The US doubled its AI-related regulations in 2024 alone. If you’re touching anything in healthcare, finance, or hiring with AI, start documenting your decision processes now.

Scale AI (the data labeling company) built an entire compliance product because they saw this coming. They’re now selling "AI audit trails" to enterprises terrified of regulatory risk.

In healthcare specifically, 95% of executives expect AI to transform the industry, but only half are seeing ROI, mostly because they’re paralyzed by regulatory uncertainty. The startups that figure out compliance will clean up.

The Uncomfortable Truth

Every hype cycle is the same—cloud, mobile, crypto. The ones who win aren’t the first to experiment—they’re the first to solve something real.

Most startups using AI will fail. Not because the technology doesn’t work, but because they’re solving the wrong problems.

I see founders building AI tools to write tweets faster. Meanwhile, their customer onboarding takes 45 minutes. That’s the problem to solve.

The opportunity isn’t in doing AI things. It’s in doing real things better with AI.

Look at the data: 61% of organizations using AI for supply chain are seeing cost reductions. 70% using it for strategic planning report revenue gains. But social media post generators? Nobody’s getting rich off those.

Your Move

Stop reading Medium articles about AI trends. Stop going to AI meetups where everyone’s building the same chatbot.

Instead, do this: list your three biggest operational headaches. Pick the simplest one. Spend a weekend trying to automate it with the dumbest possible AI solution. If it works, great. If not, you learned something.

The founders who win in the AI era won’t be the ones with the fanciest models or the biggest GPU clusters. They’ll be the ones who figure out how to solve real problems for real people—and happen to use AI along the way.

One last thing, and this is important. Everyone’s acting like AI changes everything. It doesn’t. Customers still want their problems solved. They still want to feel heard. They still want products that work.

AI is just a really powerful tool. Like cloud computing was. Like mobile was. Like the internet was.

Use it wisely. Or don’t. But please, for the love of everything holy, stop adding chatbots to everything.

← Back to blog