Forbes Top 50 AI 2024 Dataset
A reference dataset built from the Forbes 2024 Top 50 AI Companies list — primary focus, funding, and target user, with a quick read on where the money and the product categories are clustering.
Forbes’ 2024 Top 50 AI companies list is a useful snapshot of where venture capital, applied research, and product attention are landing in commercial AI. I’ve cleaned and reorganised the list here as a quick-reference dataset — primary focus, total funding (where disclosed), and the dominant target user for each company.
The dataset
| Company | Primary Focus | Funding | Target User |
|---|---|---|---|
| OpenAI | General-purpose foundation models, ChatGPT | $11.3B | Developers, Enterprise, Consumer |
| Anthropic | Foundation models with safety focus (Claude) | $7.7B | Developers, Enterprise |
| xAI | Foundation models (Grok), integration with X | $6.0B | Consumer, Developers |
| Databricks | Data + AI platform, MosaicML | $4.2B | Enterprise data teams |
| Anduril Industries | AI for defence and autonomous systems | $3.7B | Defence & Government |
| Mistral AI | Open-weight foundation models from Europe | $1.3B | Developers, Enterprise |
| Cohere | Enterprise LLM platform | $970M | Enterprise developers |
| Glean | Enterprise search and assistants | $618M | Knowledge workers |
| Hugging Face | Open-source models & community hub | $395M | Developers, Researchers |
| Together AI | Open-source AI cloud and fine-tuning | $228M | Developers |
| Sakana AI | Research-led model architectures | $244M | Researchers, Enterprise |
| Perplexity | AI-powered search engine | $165M | Consumer, Knowledge workers |
| Runway | Generative video and creative tools | $237M | Creators, Studios |
| ElevenLabs | Voice cloning and generation | $101M | Creators, Enterprise |
| Suno | Generative music creation | $125M | Consumer, Creators |
| Cresta | AI for contact centres | $271M | Customer service teams |
| Sierra | AI customer service platform | $110M | Enterprise CX teams |
| Writer | Enterprise generative AI writing platform | $326M | Enterprise marketing & content |
| Jasper | Marketing-focused generative AI | $131M | Marketing teams |
| Harvey | AI for legal work | $206M | Law firms, Legal departments |
| EvenUp | AI for personal injury claims | $135M | Plaintiff law firms |
| Hippocratic AI | Healthcare-safe generative AI | $278M | Healthcare providers |
| Abridge | Clinical conversation transcription | $212M | Hospitals & clinicians |
| Tempus AI | AI-driven precision medicine | $1.3B | Clinicians, Pharma |
| OpenEvidence | Medical knowledge assistant | $50M | Clinicians |
| Photoroom | AI photo editing | $63M | Small business, Creators |
| Captions | AI video editing for creators | $100M | Creators |
| Pika | Generative video for creators | $135M | Creators, Consumer |
| Mercor | AI-driven talent matching | $35M | Recruiters, Candidates |
| Cognition Labs | Autonomous software engineering (Devin) | $196M | Engineering teams |
| Magic | Code generation and agentic coding | $465M | Engineering teams |
| Codeium | AI coding assistant | $243M | Developers |
| Cursor (Anysphere) | AI-native code editor | $173M | Developers |
| Replit | Cloud IDE + AI coding agent | $239M | Developers, Learners |
| Poolside | Foundation models for software engineering | $626M | Developers, Enterprise |
| Crusoe Energy | AI cloud built on stranded energy | $1.7B | AI labs, Enterprise |
| CoreWeave | GPU-cloud infrastructure | $12.6B | AI labs, Enterprise |
| Lambda | GPU cloud and workstations | $863M | AI teams, Researchers |
| Vast Data | Storage built for AI workloads | $381M | Enterprise data teams |
| Pinecone | Vector database | $138M | Developers, Enterprise |
| Sambanova | AI compute systems | $1.1B | Enterprise, Government |
| Cerebras | Wafer-scale AI hardware | $720M | AI labs, Enterprise |
| Groq | Inference-focused AI hardware | $640M | AI labs, Enterprise |
| Etched | Transformer-specific ASICs | $125M | AI labs, Enterprise |
| World Labs | Spatial intelligence and 3D worlds | $230M | Game studios, Robotics |
| Skild AI | Foundation models for robotics | $300M | Robotics, Industrial |
| Figure AI | Humanoid robots | $854M | Industrial, Logistics |
| Physical Intelligence | Generalist robotic intelligence | $470M | Robotics, Industrial |
| Waabi | Autonomous trucking | $283M | Logistics & freight |
| Imbue | Agentic reasoning systems | $220M | Enterprise |
Funding figures are rounded totals as reported by Forbes, Pitchbook and Crunchbase at the time of the 2024 list. They are a directional signal of capital scale, not a precise live figure.
Key trends
A few patterns jump out of the list:
- Foundation models still dominate at the top end of capital. OpenAI, Anthropic, xAI and Databricks together represent the bulk of disclosed funding.
- Infrastructure is the second-largest cluster. CoreWeave, Crusoe, Lambda, Sambanova, Cerebras, Groq, and Etched are all infrastructure plays — capital is flowing as much to the picks and shovels as to the model builders.
- Vertical AI is the most diverse category. Healthcare (Hippocratic, Abridge, Tempus, OpenEvidence), legal (Harvey, EvenUp), and customer experience (Cresta, Sierra) are all attracting hundreds of millions, with very different go-to-market motions to the general-purpose model providers.
- Developer tools are an arms race. Cursor, Codeium, Magic, Cognition Labs, Replit, and Poolside are competing across IDE, completion, and agentic coding — a market that didn’t exist commercially three years ago.
- Creative AI is consolidating around video. Runway, Pika, Captions, ElevenLabs, and Suno are pushing the modality from text and image into video and audio, with consumer-grade quality emerging in 2024.
Market insights
Three observations from sitting with the dataset:
- The Top 50 list is bimodal. A handful of foundation-model and infrastructure companies own most of the dollars; the rest of the list is smaller vertical plays. The “middle” of the market is thinner than the headlines suggest.
- Capital efficiency varies wildly. Some companies on the list have raised ten times what others have for similar product surface area, often because the underlying compute or research cost is structurally higher.
- The “AI for X” pattern is winning. Most of the new entrants on the list are not horizontal model builders — they are AI-native applications for a specific industry, with proprietary data and workflow as the defensible asset.
Useful to come back to whenever there’s a “where is AI investment going?” conversation in a planning meeting.