UniversalAGI

Highlights

Generalization. SUV-PT generalizes across SUVs of different body types. It demonstrates impressive zero-shot generalization without fine-tuning on SUV geometries that the model has not seen.
Fully automated large scale data collection. We created an automated, expert-validated simulation pipeline creating millions of geometries and simulations.
Multiple global conditions.¹ SUV-PT supports a wide range of air density and inlet velocity conditions.
Built-in uncertainty quantification. Alongside every prediction, SUV-PT outputs a per-point uncertainty quantification (see Figure 2).²
Days → Seconds. Expert level knowledge and days of High-Performance Computing (HPC) compute for meshing + solving in traditional CFD are collapsed into seconds at a fraction of cost, and accessible to everyone.
Access. Try out SUV-PT on platform.universalagi.com, or programmatically through our API (see Appendix A4). Upload any SUV geometry and get accurate results in seconds (see Figure 1).

1 Simulation is the bottleneck in the design loop

Designing a vehicle body is an iterative loop. A designer proposes a shape. Engineers evaluate its aerodynamic performance. The result informs the next shape. The loop repeats for weeks to months, and almost all of that time goes to a single step: evaluating the design.

The most accurate way to measure a vehicle's aerodynamics is a physical wind tunnel. But that needs a built vehicle, exactly what does not yet exist during design. Modern design relies on CFD simulation instead. CFD approximates the flow around a body by solving the governing equations of fluid dynamics over a discretized model of the vehicle and the surrounding air (see Appendix A1). Accurate enough to guide design decisions without building the car, CFD has become the industry standard. Running it correctly is difficult and slow, for four main reasons.

Geometry cleanup. Raw CAD geometries are rarely simulation-ready. Surfaces have gaps, overlaps, and non-watertight regions that a solver cannot mesh, and removing them takes manual repair and engineering judgment before any meshing can begin.
Meshing. The vehicle and the space around it must be discretized into a volume of millions of cells, called a mesh. Meshing requires domain expertise, as it is error prone, and often takes several iterations of its own.
Configuration. A simulation cannot be set up from a fixed template. Driving it to a converged, trustworthy solution takes specialist judgment, monitoring, and revision.
Solve. The computation is distributed across the nodes of a high-performance computing (HPC) cluster, and even then a single high-fidelity run takes hours to days, depending on fidelity and compute.

Compute and expert time are the visible costs. In a competitive environment, the iteration cycles they impose are even more expensive. Because each evaluation takes hours to days and a trained specialist to set up, only a handful of shapes can be afforded to run simulations on. Design cycles take months, the design space is sampled sparsely, and the most efficient shape might not be among the few that are evaluated.

An ideal process would invert this. It would be accurate enough to trust, return results in seconds, and run without geometry cleanup, meshing or solver expertise. A designer could then extensively search the design space. Rather than at a few points, they can compare thousands of variants in seconds, and select the most efficient one that still meets brand identity and customer needs. Because each evaluation takes seconds, aerodynamic performance can guide the shape as it is designed, instead of being measured only after the design is finished.

2 SUV-PT predicts a vehicle's aerodynamics in seconds

As a foundational model that understands a vehicle's input geometry and its aerodynamics, SUV-PT sidesteps the need for geometry cleanup, meshing, configuration, and managed HPC. SUV-PT produces velocity fields, surface pressure, wall shear, and the integrated drag and lift in seconds. It also reports a per-point uncertainty quantification (UQ) estimate alongside each field. Figure 2 shows the surface fields for SUVs that the model has not seen. Each row compares the reference high-fidelity CFD field, SUV-PT's prediction, their difference, and SUV-PT's uncertainty quantification.

SUV surface field in four labeled columns: reference CFD, SUV-PT prediction, their difference (Error), and SUV-PT uncertainty quantification. — Figure 2: SUV-PT accurately predicts external aerodynamics on unseen geometries. Each row is one SUV vehicle, with the reference, high-fidelity CFD field (column 1), SUV-PT's prediction (column 2), their difference (column 3), and SUV-PT's uncertainty quantification results (column 4).

A second SUV in the same four columns: reference CFD, SUV-PT prediction, difference, and uncertainty quantification. — Figure 2: SUV-PT accurately predicts external aerodynamics on unseen geometries. Each row is one SUV vehicle, with the reference, high-fidelity CFD field (column 1), SUV-PT's prediction (column 2), their difference (column 3), and SUV-PT's uncertainty quantification results (column 4).

SUV-PT estimates uncertainty during inference. The resulting field is physically interpretable across the vehicle surface (see Figure 2). In the front half of the vehicle, before major separation, model uncertainty and accuracy follow the same pattern over various designs. After flow separation, the uncertainty shifts from shared-aerodynamic patterns to design-specific features, highlighting wake-sensitive components of each geometry. The uncertainty field reflects meaningful aerodynamic structure, both in attached flow regions where the flow is more constrained and in separated regions where each design produces distinct wake behavior.

2.1 SUV-PT outputs high-fidelity full field CFD

We evaluate on three benchmarks of increasing difficulty: DrivAerML [4], an open-source reference dataset; SUV-Bench Medium; and SUV-Bench Hard (see Appendix A3). We measure agreement with high-fidelity CFD using three metrics: $C_{d} A$ MAPE, the mean error on drag area (lower is better); $C_{d} A$ Max APE, the worst-case error on any single design (lower is better); and the Spearman rank correlation (higher is better), which captures whether the model ranks designs correctly (see Appendix A2).

Benchmark	CdA MAPE	CdA Max APE	Spearman Rank
DrivAerML	1.1%	3.4%	0.97
SUV-Bench Medium	2.0%	5.6%	0.96
SUV-Bench Hard	16.8%	49.1%	0.54

Table 1: SUV-PT drag-area accuracy against high-fidelity CFD across three benchmarks of increasing difficulty. We report mean and worst-case

C_{d} A

percentage error, and Spearman rank correlation (lower is better for the error columns, higher for rank).

2.2 SUV-PT is trained in stages

We train SUV-PT in a similar way modern language models are trained. Training undergoes successive stages that move from large volumes of lower-fidelity data to volumes of high-fidelity data.

Lower-fidelity pre-training. The model first learns the general structure of flow over vehicle bodies from a large volume of approximate simulations.
High-fidelity pre-training. It is then refined on accurate, expensive simulations that resolve flow details.
Domain fine-tuning (optional). Finally, it can be specialized to a customer's vehicle class or operating conditions.

Across these stages, SUV-PT is trained on millions of geometries and simulations. These simulations consist of both steady-state and time series of transient-state flow fields. The model can predict both surface properties as well as volume flow distribution.³

2.3 Generating the training data is challenging

As in most of machine learning, data is a binding constraint. High-fidelity aerodynamic data does not exist publicly at the scale, diversity, and fidelity a physics model requires. So we generate it ourselves. This introduces three main challenges.

The data must be trustworthy. Our in-house expert simulation team produces accurate, high-fidelity, and validated data that serve as ground truth.
The data itself is expensive, because each run takes hours to days of compute.
The data is large. The full corpus spans multiple petabyte.

2.4 Training at this scale is a systems problem as much as a modeling one

A useful comparison is to modern language models. The largest corpora with trillions of tokens hardly reach the scale of petabytes. In CFD, a single training sample is a complete simulation field in space, reaching a hundred million floating point vectors, on the order of gigabytes. And because the samples are so large, the bottleneck is not only the computation. It is transferring data from storage to the GPUs fast enough to keep them utilized. The model itself adds another source of complexity. To work across, for example, vehicles, it has to handle geometries of different sizes, meshes with millions of points and different fidelities, and multiple global flow conditions.

SUV-PT is an achievement of our creative cloud infrastructure and training platform optimized for both throughput and resilience, churning through petabytes of data across multiple GPU nodes.

3 LIFT is faster and more accurate than existing state-of-the-art physics models

To place our LIFT architecture in context, we compared it against the existing state-of-the-art physics models GeoTransolver [1] and Transolver [2]. Every model was given the same number of parameters, so the comparison reflects the architecture rather than its scale. A training epoch is one pass over each sample at a downsampled input resolution, while inference is run on the full sample.

On a single Nvidia A100 GPU, LIFT is the fastest to train and to serve, the former measured on a reduced training set, and it has the smallest memory footprint (see Table 2). Because GPU memory is a binding constraint, we also report training throughput per unit of peak memory, where LIFT is 5x more efficient than state-of-the-art architectures.

Architecture	P50 Inference latency	Training throughput	Peak GPU memory	Throughput / memory
LIFT	10.2 s	0.93 (samples/s)	13.9 GiB	0.067 samples/s/GiB
Transolver [2]	19.7 s	0.45 (samples/s)	21.1 GiB	0.021 samples/s/GiB
GeoTransolver [1]	23.6 s	0.41 (samples/s)	23.0 GiB	0.018 samples/s/GiB

Table 2: P50 inference latency (lower is better), training throughput (higher is better), peak GPU memory (lower is better), and throughput / memory (higher is better) by architecture, measured at matched parameter count on a single A100 GPU.

3.1 Training time: LIFT reaches the same loss in less wall-clock time

Holding the model size fixed, we trained each architecture on a reduced set to the same loss target and recorded wall-clock time. At a batch size of one, LIFT achieves 2.3x and 2.1x the throughput of GeoTransolver [1] and Transolver [2], respectively (see Figure 3).

Two bar charts comparing GeoTransolver, Transolver, and LIFT: median training time per epoch (left, lowest for LIFT) and training throughput in samples per second (right). — Figure 3: Training cost at a matched model size. Median wall-clock time per epoch (left, lower is better) and training throughput (right, higher is better) for each architecture.

3.2 Inference latency: LIFT outperforms the state-of-the-art

We measured single-sample inference latency on DrivAerML [4] surface data, using one NVIDIA A100 GPU per model and timing the forward pass only (see Figure 4). We report the latency distribution with its median (P50) and tails (P90, P95). LIFT has the lowest latency at every percentile, with a median of 10.2 s against 19.7 s for Transolver [2] and 23.6 s for GeoTransolver [1].

Inference-latency violin plots for GeoTransolver, Transolver, and LIFT with P50, P90, and P95 markers; LIFT has the lowest latency. — Figure 4: Inference latency (lower is better). Violin plots in three columns (GeoTransolver, Transolver, LIFT) with P50, P90, and P95.

4 SUV design in seconds

SUV-PT is served through a project-based platform and API. Each project holds a set of geometries and the conditions they are evaluated under. Within a project, a user can run a parameter sweep¹ across multiple boundary conditions and geometries and obtain results in seconds. The platform supports the comparisons a CAD workflow relies on. For example, variants placed side by side and profile distributions along the body. Results can be exported for downstream use. The same workflow is available through our API, so a geometry optimization procedure can infer SUV-PT directly and evaluate thousands of variants in seconds (see Appendix A4).

5 Availability

SUV-PT is live on our platform. Upload an SUV geometry and get its aerodynamics in under a minute. Open the platform to evaluate your own designs, and contact us to discuss deployments or domain customization for your use case.

The current public preview exposes a single condition set. The entire condition set is included with full model access.
UQ is included with full model access.
This release exposes the surface fields. Surface and volume fields are included with full model access.

Appendix

A1 CFD and meshing

CFD is used across many engineering and biomedical domains, including aircraft design, turbomachinery, building ventilation, and cardiovascular flow modeling. Here, we focus on vehicle aerodynamics, where CFD provides a numerical alternative to wind-tunnel testing. Rather than measuring airflow around a physical prototype, CFD solves discretized governing equations over a computational mesh to approximate the velocity and pressure fields around the vehicle.

At typical road speeds, vehicle aerodynamics is well described by the incompressible-flow approximation. Under this assumption, the fluid motion is governed by the incompressible Navier-Stokes equations:

\nabla \cdot u = 0,

ρ (\frac{\partial u}{\partial t} + (u \cdot \nabla) u) = - \nabla p + μ \nabla^{2} u + f .

Here $u$ denotes the velocity field, $p$ the pressure, $ρ$ the fluid density, $μ$ the dynamic viscosity, and $f$ an external body-force field. The first equation enforces mass conservation. For an incompressible fluid, air neither accumulates nor vanishes within the domain. The second equation enforces momentum conservation, balancing acceleration with pressure gradients, viscous stresses, and external body forces. Once the solver approximates the pressure and shear-stress distributions on the vehicle surface, drag can be computed by integrating those surface forces.

The governing equations are continuous, but a computer can only solve a finite-dimensional approximation. A CFD solver discretizes the air volume surrounding the vehicle into a mesh and computes the flow variables. Mesh design is one of the central engineering decisions in high-fidelity CFD. Cells must be fine enough near the vehicle surface, across the boundary layer, around wheel wakes, and in separated-flow regions behind the vehicle. If critical regions are under-resolved, the solver may converge to a numerically precise but physically inaccurate solution. If the mesh is refined everywhere, the simulation becomes unnecessarily expensive. Equally important is mesh quality, as poorly shaped or highly skewed cells can degrade numerical accuracy and compromise solver stability.

Even with a well-designed mesh, directly resolving the Navier-Stokes equations across all relevant turbulent scales is usually too expensive for full-vehicle aerodynamics. The smallest turbulent structures can be orders of magnitude smaller than the vehicle-scale flow features, so resolving them everywhere would require a prohibitive number of cells and time steps. Turbulence models address this gap by modeling part of the turbulent motion rather than resolving every scale directly. This choice strongly affects the mesh and compute budget. Reynolds-averaged Navier-Stokes models reduce cost by modeling most turbulent fluctuations and can often use coarser meshes, especially away from walls. Large eddy simulation and detached eddy simulation resolve more unsteady turbulent structures, but they require finer grids, smaller time steps, and greater computational cost. Every modeling choice therefore balances accuracy, cost, and the physics captured.

A2 Evaluation metrics

We report three metrics on drag area, $C_{d} A$ . Let $y_{i}$ be the CFD value for design $i$ , ${\hat{y}}_{i}$ be the model prediction, and $N$ be the number of designs.

Mean absolute percentage error (lower is better) measures average relative error:

MAPE = \frac{100}{N} \sum_{i = 1}^{N} | \frac{y_{i} - {\hat{y}}_{i}}{y_{i}} | .

Max absolute percentage error (lower is better) measures the worst prediction:

max APE = 100 \cdot max_{i} | \frac{y_{i} - {\hat{y}}_{i}}{y_{i}} | .

Spearman rank correlation (higher is better) measures whether the model orders designs accurately:

ρ = 1 - \frac{6 \sum_{i = 1}^{N} d_{i}^{2}}{N (N^{2} - 1)}, d_{i} = rank ({\hat{y}}_{i}) - rank (y_{i}) .

A3 Benchmarks

We use three benchmarks:

DrivAerML [4] is a public and well known, high-fidelity CFD dataset built around the DrivAer reference body.
SUV-Bench medium contains production-like SUV geometries with moderate variation.
SUV-Bench hard contains production-like SUV geometry outliers with unrealistically mutated geometry features (see Figure 5).

2D PCA scatter of ULIP-2 embeddings for vehicles across the three benchmarks, colored by source. — Figure 5: Geometry diversity across the three benchmarks. Each point is a vehicle, embedded with ULIP-2 [3] and projected to 2D with a Principal Component Analysis (PCA), colored by source. Each benchmark is downsampled to 30 geometries for illustration.

A4 API and CAD software access for geometry optimization

The API is useful when an evaluation becomes a search over thousands of geometries and operating conditions. A geometry optimization procedure can generate a batch of variants, infer SUV-PT on each one, and rank the candidates by drag without manual geometry cleanup, meshing, solver setup, or managing the HPC. The operating conditions can be swept, for example multiple inlet velocities, or fixed with a single value.

from universalagi import UniversalAGI, DesignScores, GeometrySet, OperatingConditions, Range, Value

client = UniversalAGI()  # reads UAGI_API_KEY from the environment

geometries = GeometrySet.from_paths([
    "variants/suv_001.stl",
    "variants/suv_002.stl",
    "variants/suv_003.stl",
    "...",
    "variants/suv_n.stl",
])

conditions = OperatingConditions(
    velocity=Range(25.0, 40.0, steps=1000),
    air_density=Value(1.225),
)

simulation = client.simulate(
    model="suv-pt-1",
    geometries=geometries,
    conditions=conditions,
)

scores = DesignScores.from_simulation(simulation, metric="CdA")
best_geometry = scores.best()
print(best_geometry.path, best_geometry.value)

Each prediction returns integrated aerodynamic quantities such as $C_{d} A$ , $C_{D}$ , and $C_{L}$ . The full surface and volume fields can also be exported to standard mesh formats (e.g., vtu, vtp, stl, cgns) for inspection in tools such as ParaView. The same API can be called from extensions or automation scripts inside CAD environments such as CATIA, Autodesk, FreeCAD, or SolidWorks. This enables designers to make aerodynamically informed decisions without leaving their workflow (see Figure 6).

SUV-PT called from inside FreeCAD: a designer evaluates an SUV geometry's aerodynamics without leaving the CAD environment. — Figure 6: Access UniversalAGI's high-fidelity physics models in existing workflows (here, SUV-PT running inside FreeCAD).

References

Adams, C., Ranade, R., Cherukuri, R., & Choudhry, S. (2025). GeoTransolver: Learning Physics on Irregular Domains Using Multi-scale Geometry Aware Physics Attention Transformer. arXiv preprint arXiv:2512.20399.
Wu, H., Luo, H., Wang, H., Wang, J., & Long, M. (2024). Transolver: A fast transformer solver for pdes on general geometries. arXiv preprint arXiv:2402.02366.
Xue, L., Yu, N., Zhang, S., Panagopoulou, A., Li, J., Martín-Martín, R., ... & Savarese, S. (2024). Ulip-2: Towards scalable multimodal pre-training for 3d understanding. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (pp. 27091-27101).
Ashton, N., Mockett, C., Fuchs, M., Fliessbach, L., Hetmann, H., Knacke, T., ... & Maddix, D. (2024). DrivAerML: High-fidelity computational fluid dynamics dataset for road-car external aerodynamics. arXiv preprint arXiv:2408.11969.

Try SUV-PT