All work
Article

Forbes Top 50 AI 2024 Dataset

A reference dataset built from the Forbes 2024 Top 50 AI Companies list — primary focus, funding, and target user, with a quick read on where the money and the product categories are clustering.

AI Documentation
Forbes Top 50 AI 2024 Dataset cover image

Forbes’ 2024 Top 50 AI companies list is a useful snapshot of where venture capital, applied research, and product attention are landing in commercial AI. I’ve cleaned and reorganised the list here as a quick-reference dataset — primary focus, total funding (where disclosed), and the dominant target user for each company.

The dataset

CompanyPrimary FocusFundingTarget User
OpenAIGeneral-purpose foundation models, ChatGPT$11.3BDevelopers, Enterprise, Consumer
AnthropicFoundation models with safety focus (Claude)$7.7BDevelopers, Enterprise
xAIFoundation models (Grok), integration with X$6.0BConsumer, Developers
DatabricksData + AI platform, MosaicML$4.2BEnterprise data teams
Anduril IndustriesAI for defence and autonomous systems$3.7BDefence & Government
Mistral AIOpen-weight foundation models from Europe$1.3BDevelopers, Enterprise
CohereEnterprise LLM platform$970MEnterprise developers
GleanEnterprise search and assistants$618MKnowledge workers
Hugging FaceOpen-source models & community hub$395MDevelopers, Researchers
Together AIOpen-source AI cloud and fine-tuning$228MDevelopers
Sakana AIResearch-led model architectures$244MResearchers, Enterprise
PerplexityAI-powered search engine$165MConsumer, Knowledge workers
RunwayGenerative video and creative tools$237MCreators, Studios
ElevenLabsVoice cloning and generation$101MCreators, Enterprise
SunoGenerative music creation$125MConsumer, Creators
CrestaAI for contact centres$271MCustomer service teams
SierraAI customer service platform$110MEnterprise CX teams
WriterEnterprise generative AI writing platform$326MEnterprise marketing & content
JasperMarketing-focused generative AI$131MMarketing teams
HarveyAI for legal work$206MLaw firms, Legal departments
EvenUpAI for personal injury claims$135MPlaintiff law firms
Hippocratic AIHealthcare-safe generative AI$278MHealthcare providers
AbridgeClinical conversation transcription$212MHospitals & clinicians
Tempus AIAI-driven precision medicine$1.3BClinicians, Pharma
OpenEvidenceMedical knowledge assistant$50MClinicians
PhotoroomAI photo editing$63MSmall business, Creators
CaptionsAI video editing for creators$100MCreators
PikaGenerative video for creators$135MCreators, Consumer
MercorAI-driven talent matching$35MRecruiters, Candidates
Cognition LabsAutonomous software engineering (Devin)$196MEngineering teams
MagicCode generation and agentic coding$465MEngineering teams
CodeiumAI coding assistant$243MDevelopers
Cursor (Anysphere)AI-native code editor$173MDevelopers
ReplitCloud IDE + AI coding agent$239MDevelopers, Learners
PoolsideFoundation models for software engineering$626MDevelopers, Enterprise
Crusoe EnergyAI cloud built on stranded energy$1.7BAI labs, Enterprise
CoreWeaveGPU-cloud infrastructure$12.6BAI labs, Enterprise
LambdaGPU cloud and workstations$863MAI teams, Researchers
Vast DataStorage built for AI workloads$381MEnterprise data teams
PineconeVector database$138MDevelopers, Enterprise
SambanovaAI compute systems$1.1BEnterprise, Government
CerebrasWafer-scale AI hardware$720MAI labs, Enterprise
GroqInference-focused AI hardware$640MAI labs, Enterprise
EtchedTransformer-specific ASICs$125MAI labs, Enterprise
World LabsSpatial intelligence and 3D worlds$230MGame studios, Robotics
Skild AIFoundation models for robotics$300MRobotics, Industrial
Figure AIHumanoid robots$854MIndustrial, Logistics
Physical IntelligenceGeneralist robotic intelligence$470MRobotics, Industrial
WaabiAutonomous trucking$283MLogistics & freight
ImbueAgentic reasoning systems$220MEnterprise

Funding figures are rounded totals as reported by Forbes, Pitchbook and Crunchbase at the time of the 2024 list. They are a directional signal of capital scale, not a precise live figure.

A few patterns jump out of the list:

  • Foundation models still dominate at the top end of capital. OpenAI, Anthropic, xAI and Databricks together represent the bulk of disclosed funding.
  • Infrastructure is the second-largest cluster. CoreWeave, Crusoe, Lambda, Sambanova, Cerebras, Groq, and Etched are all infrastructure plays — capital is flowing as much to the picks and shovels as to the model builders.
  • Vertical AI is the most diverse category. Healthcare (Hippocratic, Abridge, Tempus, OpenEvidence), legal (Harvey, EvenUp), and customer experience (Cresta, Sierra) are all attracting hundreds of millions, with very different go-to-market motions to the general-purpose model providers.
  • Developer tools are an arms race. Cursor, Codeium, Magic, Cognition Labs, Replit, and Poolside are competing across IDE, completion, and agentic coding — a market that didn’t exist commercially three years ago.
  • Creative AI is consolidating around video. Runway, Pika, Captions, ElevenLabs, and Suno are pushing the modality from text and image into video and audio, with consumer-grade quality emerging in 2024.

Market insights

Three observations from sitting with the dataset:

  1. The Top 50 list is bimodal. A handful of foundation-model and infrastructure companies own most of the dollars; the rest of the list is smaller vertical plays. The “middle” of the market is thinner than the headlines suggest.
  2. Capital efficiency varies wildly. Some companies on the list have raised ten times what others have for similar product surface area, often because the underlying compute or research cost is structurally higher.
  3. The “AI for X” pattern is winning. Most of the new entrants on the list are not horizontal model builders — they are AI-native applications for a specific industry, with proprietary data and workflow as the defensible asset.

Useful to come back to whenever there’s a “where is AI investment going?” conversation in a planning meeting.

Like to have a chat?

Always happy to talk about product, design systems, conversion, or building teams that ship.