Inference Startup Inferact Lands $150M To Commercialize vLLM

Inferact, the startup behind open source AI inference engine vLLM, secures $150M at an $800M valuation to accelerate real-world AI deployment
Matilda
Inference Startup Inferact Lands $150M To Commercialize vLLM
Inferact Raises $150M to Power AI Inference with vLLM What happens when one of the most widely used open source tools for running AI models becomes a company? The answer is Inferact —a new startup founded by the original creators of vLLM , which just raised $150 million in seed funding at a staggering $800 million valuation . With backing from top-tier investors like Andreessen Horowitz and Lightspeed Venture Partners, Inferact is poised to reshape how businesses deploy large language models (LLMs) in production—faster, cheaper, and more efficiently. Credit: Askold Romanov/iStock As AI shifts from flashy demos to real-world applications, the bottleneck isn’t training anymore—it’s inference : the process of actually using trained models to generate responses, analyze data, or power user-facing features. That’s where vLLM, and now Inferact, come in. Why Inference Is the New Battleground in AI For years, headlines focused on who could train the biggest model. But in 2026, the industry’s attention…