Published on August 5, 2025
By Anshika Mathews
In Enterprise AI

2025 Is The Year Of AI Video For Fal

We suspected it was going to be big, but we had no idea it would be this big.

In AI, timing is everything. Large language models have already reshaped software, search, and customer service. Video, meanwhile, remains a technical frontier which is highly anticipated, but still limited by latency, infrastructure, and accessibility. Fal, a generative media infrastructure company founded in 2021, is preparing for what it calls the “ChatGPT moment” for video and just raised $125 million to make sure it’s ready when it arrives.

“We don’t think the ChatGPT moment for video has happened yet,” said CEO Burkay Gur. “But we’re close.”

That anticipation has shaped nearly every decision Fal has made over the past two years from its early focus on inference infrastructure to its refusal to pivot toward large language models, even when growth temporarily stalled. Instead, the company concentrated on performance, speed, and usability across image, video, audio, and 3D workloads. The result: over 1 million developers, more than 100 enterprise customers, and billions of AI-generated assets served monthly.

Fal’s Series C round, led by Meritech Capital with participation from Salesforce Ventures, Shopify Ventures, and Google’s AI Futures Fund, values the company at $1.5 billion. The funds will be used to expand GPU capacity, grow the team, and scale Fal Serverless, a new platform for running custom generative media models.

“2025 is the year of AI video,” said Gur. “We suspected it was going to be big, but we had no idea it would be this big.”

From Python Optimization to Real-Time Generative Infrastructure

Fal was founded by Gur and longtime friend Gorkem Yurtseven. The original focus was machine learning tooling and compute optimization for Python-based workloads. Gur, who had previously worked on ML systems at Coinbase, believed there was a gap in infrastructure that could handle high-throughput, low-latency inference.

That thesis evolved rapidly. As diffusion models like DALL-E and Stable Diffusion gained popularity in late 2022, Fal began experimenting with inference-level optimization. The team built its own inference engine and began benchmarking performance across image models.

“We got pretty obsessed about how do we optimize these models? How do you run them more efficiently? And that’s kind of how we entered this market,” Gur said.

Performance became a differentiator. Fal’s platform could run diffusion models two to three times faster than standard solutions. This speed attracted early users building generative design tools, image editors, and creative platforms. By mid-2023, Fal’s infrastructure was powering real-time media generation for customers like Adobe, Canva, and Shopify.

“Latency kills creativity and productivity,” Gur said. “That’s why we focus on performance.”

Staying Focused While Others Chased LLMs

When the market pivoted sharply toward LLMs in 2023, Fal made a deliberate decision not to follow. Despite a period of flat revenue, the company stayed focused on visual and multimodal models.

“We could have pivoted to LLM inference but we decided to focus on image and video, where we already had technical advantages,” Gur said. “Staying specific would help us stay closer to our users and understand their problems better.”

That strategy aligned with customer usage. All of Fal’s early users were building similar types of creative applications, giving the company tighter feedback loops and domain expertise. The platform now hosts video models like Wan Pro and Kling, image models like Flux, and audio models from providers like Minimax.

Fal doesn’t host every model. Each one is benchmarked and tested for real-world performance before being added to the platform. “Many models from research labs or even big companies are cherry-picked—they only show the best results in demos,” said Gur. Fal runs internal benchmarks to verify quality, demand, and latency before rollout.

From Developer Tool to Creative Infrastructure

Fal’s platform was built with developers in mind, but the team quickly realized that performance and ease-of-use could serve a broader audience. The company now offers both API integrations and a web-based frontend that allows developers and non-technical users to chain models, experiment, and test workflows without friction.

“Turns out, if you build a really, really good developer product, you’ve also built a good product for non-technical people and those definitions are even changing now,” Gur said.

The company has also built its own CDN and serverless architecture for model deployment, allowing users to run open-source, private, or commercial models through a single interface without DevOps complexity.

Fal Serverless and the Path Forward

The newly announced Fal Serverless offering will allow enterprise teams and startups to deploy their own generative media models using Fal’s infrastructure stack. With GPU demand continuing to rise, the company is investing heavily in infrastructure expansion and operational scale.

“To be able to handle this tremendous growth, we are doubling down in all areas of our business,” Gur said. “More support, more go-to-market, and a larger GPU fleet.”

In the past year, Fal’s revenue has grown 60x, fueled by increased usage across image and video workloads. Despite the scale, Gur maintains that the market is still in its early stages.

“We are very certain that it is still early for generative media models,” he said.

Creation Is Cheap. Creativity Is Not.

While generative AI has sparked resistance from creative communities, Gur believes adoption is accelerating. Rather than replacing artists, he sees tools like Fal as reducing the technical and financial barriers to production.

“These models really just reduce the cost of creation by insane amounts,” Gur said. “But the cost of creativity is still high.”

That distinction between creation and creativity is Fal’s long-term strategy. Rather than building end-user apps, it aims to be the platform that makes real-time, personalized content generation possible across industries. For now, the ChatGPT moment for video remains a future event. But they want to be synonymous with generative media as Gur said.

📣 Want to advertise in AIM Media House? Book here >

Anshika Mathews

Anshika is the Global Media Lead for AIM Media House. She holds a keen interest in technology and related policy-making and its impact on society. She can be reached at anshika.mathews@aimmediahouse.com

Global leaders, intimate gatherings, bold visions for AI.

CDO Vision World Series

CDO Vision is a premier, year-round networking initiative connecting top Chief
Data Officers (CDOs) & Enterprise AI Leaders across major cities worldwide.

2025 Is The Year Of AI Video For Fal

From Python Optimization to Real-Time Generative Infrastructure

Staying Focused While Others Chased LLMs

From Developer Tool to Creative Infrastructure

Fal Serverless and the Path Forward

Creation Is Cheap. Creativity Is Not.

Microsoft Deepens Lambda Partnership With Massive AI Infrastructure Deployment

Vertiv’s Not Here for the Heat

Hippocratic AI Raises $126 Million, Valuing the Company at $3.5 Billion

AUI Raises $20 Million and Raises Questions About the Transformer Era

What Kroger Is Doing With AI Amid CEO Changes and Layoffs

Before We Launch AI Into Space, Let’s Fix It on Earth

Does the Palantir Earnings Report Mask a Valuation Bubble?

The Edge Is the New Data Center, According to Cisco

Who’s Making Sure AI Gets It Right? Campbell Brown Says Humans

Inside Steve Madden’s AI-Driven Makeover

At Albertsons, the Chatbot Helps You Shop and the AI Keeps It Fresh

This Week’s AI-Driven Corporate Layoffs Explained

Perplexity Patents Turns IP Search Into a Conversation

Etsy Names Kruti Patel Goyal as New CEO

IBM Consulting Appoints Yogendra (Yogi) Goyal to Lead Its Global AI-First Business Operations

Explore our year-round AI events across U.S. cities >>