Positron Is Betting on Large Models. The Market Is Thinking Small

As enterprises embrace small language models, Positron’s chips may face a shrinking future
Positron’s message to the AI world is this: the language models of the future will be massive, and they’ll need inference chips like Atlas to run. The startup, founded in 2023, makes an inference accelerator it says blows past Nvidia’s GPUs in both energy and cost efficiency: “2x to 5x performance per watt and dollar,” according to CEO Mitesh Agrawal. Atlas is air-cooled, runs Nvidia-trained models without any code rewrites, and was shipped into production just 15 months after launch. Early customers include Cloudflare and Parasail. That’s fast execution. Investors seem impressed: Positron just closed a $51.6 million Series A led by Valor Equity, Atreides, and DFJ. Next on the roadmap is Titan, a 2026 system built around Positron’s custom “Asimov” ASICs. The claim?
Subscribe or log in to Continue Reading

Uncompromising innovation. Timeless influence. Your support powers the future of independent tech journalism.

Already have an account? Sign In.

📣 Want to advertise in AIM Media House? Book here >

Picture of Mukundan Sivaraj
Mukundan Sivaraj
Mukundan is a writer and editor covering the AI startup ecosystem at AIM Media House. Reach out to him at mukundan.sivaraj@aimmediahouse.com.
25 July 2025 | 583 Park Avenue, New York
The Biggest Exclusive Gathering of CDOs & AI Leaders In United States

Subscribe to our Newsletter: AIM Research’s most stimulating intellectual contributions on matters molding the future of AI and Data.