Funding
Tripo AI Raises $150M Series A3 as 3D Foundation Models Move Toward Interactive Worlds

Tripo AI has raised more than $150 million in Series A3 financing, adding a broad mix of strategic and financial investors across the automotive, gaming, internet, and technology sectors. The round signals growing interest in AI systems that can generate, edit, simulate, and eventually sustain 3D environments, a category that is beginning to move beyond creator tools and into infrastructure for interactive entertainment, intelligent manufacturing, virtual reality, and embodied AI.
A Cross-Sector Round for a Cross-Sector Technology
The financing brought together investors with very different commercial interests. Automotive backers included Geely Capital, while gaming companies 4399 Network, Tanwan, and Giant Network also participated. Strategic investors Fosun Capital and Orinno Capital joined the round, alongside financial investors including CoStone Capital, Addor Capital, T-Capital, and Muhua Tech Ventures. Existing shareholders INCE Capital and Genesis Capital also increased their investments.
That investor mix matters because 3D generation is not confined to one market. For gaming companies, faster 3D asset creation can reshape production pipelines. For automotive and manufacturing investors, AI-generated 3D models point toward design iteration, simulation, training environments, and digital twins. For internet and entertainment companies, the long-term opportunity is more interactive content, where users do not simply consume flat media but enter, modify, and share persistent digital spaces.
Tripo’s Core Wedge: Turning Ideas Into 3D Assets
Tripo’s product ecosystem is built around turning text, images, or sketches into 3D assets through an AI workspace. The company’s tools cover text-to-3D, image-to-3D, AI texturing, model segmentation, auto rigging, stylization, Smart Mesh, and high-detail model generation, with integrations for workflows involving Blender, Unity, Unreal Engine, ComfyUI, Cocos, and Godot.
That places Tripo in one of the more technically difficult corners of generative AI. Image and video models operate in two-dimensional outputs, while 3D models need usable geometry, topology, texture, scale, export formats, and downstream compatibility. A 3D asset that looks good in a preview is not necessarily ready for a game engine, animation pipeline, 3D printer, VR environment, or simulation system.
Tripo’s developer platform extends that model into applications and services, giving teams API access for converting text and image inputs into 3D models. This is an important part of the company’s strategy because the market for 3D generation will likely depend not only on standalone tools, but also on how deeply these models can be embedded into creative software, commerce platforms, gaming workflows, industrial design tools, and enterprise systems.
From Faster 3D Generation to More Usable Outputs
The company says it has released a series of advanced 3D foundation models over the past six months, including Tripo H3.1 and Tripo P1.0. It has also introduced capabilities such as 8K Texture Generation and Segmentation V2, both of which point to a larger shift in the category: the goal is no longer just to generate a recognizable 3D object, but to create something editable, detailed, and useful inside professional workflows.
This is where features like segmentation become important. Splitting complex models into structured, editable parts can make a generated model easier to refine, repurpose, or integrate into a larger project. For creators, that reduces manual cleanup. For games, animation, and product design, it can help move AI-generated assets from rough concepts toward assets that can be rigged, animated, textured, assembled, or printed.
The next stage of AI 3D competition will likely be defined less by novelty and more by reliability: whether generated assets can survive the handoff into real production environments.
Project Eden Pushes Tripo Toward World Models
The more ambitious part of Tripo’s roadmap is Project Eden, its world model research preview. Rather than treating “world models” as another name for video generation, Tripo frames Eden around persistent, editable environments that maintain state over time.
The technical distinction is important. A video model can generate the next frame, but it does not necessarily understand that an object should still exist after it leaves the camera view. A static 3D scene can provide structure, but it does not necessarily support change, memory, physics, or multiplayer interaction. Eden is designed around separating the underlying world state from rendering, so the environment exists before any camera looks at it and can be updated by actions, rules, and feedback.
That direction could eventually connect Tripo’s 3D asset generation work with embodied AI, simulation, robotics, multiplayer games, and training environments. The company has been careful to frame Eden as an early research preview rather than a finished general-purpose world model, but the strategic direction is clear: Tripo is trying to move from generating 3D objects to generating and maintaining interactive worlds.
A Sign of the Next Generative AI Layer
Tripo’s new financing is not just another AI fundraise. It reflects a broader market belief that the next major layer of generative AI may be spatial. The internet has spent decades moving from text to images to video. The next step is likely to involve interactive 3D environments that users, developers, and AI agents can manipulate in real time.
The challenge for Tripo will be execution. AI-generated 3D needs to be fast, but speed alone is not enough. The assets must be structurally sound, visually detailed, editable, exportable, and dependable across professional tools. World models face an even harder test: persistence, physics, consistency, and multi-agent coordination.
With more than $150 million in new Series A3 financing, Tripo AI now has fresh capital to pursue both sides of that problem: making 3D creation easier today, while building toward interactive worlds that behave less like generated media and more like persistent digital environments.












