RoboFlow4D | PIRLab

01

Planning in 3D Space

Interactive task examples show flow-conditioned planning for manipulation scenes.

02

Pipeline

RGB, points, and language are fused to predict future 3D flow and guide action.

03

Videos and Results

Simulation videos, real-robot videos, and benchmark tables are reproduced below.

Planning in 3D Space

Predicted flow becomes a spatial plan.

RoboFlow4D treats world modelling as a closed loop between observation, prediction, and execution. Given a visual sequence and language instruction, the model predicts future multi-frame 3D flow that describes how task-relevant geometry should move.

The original project page includes interactive 3D demos for household manipulation tasks such as Moka Pot, Drawer, Book to Caddy, and Push Cube. The local page keeps the task structure and links back to the live project resources.

Interactive demo

Moka Pot

Turn on the stove and put the moka pot on it.

Interactive demo

Drawer

Open a drawer, place an object inside, and complete the action sequence.

Interactive demo

Push Cube

Use predicted 3D motion to support goal-directed pushing.

Pipeline

A lightweight flow world model for real-time manipulation.

FlowDiT

RGB, point, and text tokens to multi-frame 3D flow

The model encodes observations and task instructions, predicts future 3D flow, and feeds that flow into a policy for action generation.

Closed loop

Slow planner, fast executor

RoboFlow4D acts as a predictive planner, while the action policy executes conditioned on both robot state and explicit flow.

+6.2 / +11.0

Average success-rate gains reported over base policies on LIBERO and ManiSkill3.

120x

Reported planning speedup compared with modular flow-planning pipelines.

< 1s

Goal-oriented planning latency aimed at real-time robot deployment.

Simulation Videos

Flow-conditioned policy rollouts in benchmark tasks.

LIBERO Object

Cream cheese to basket

LIBERO Object

Milk to basket

LIBERO Spatial

Bowl on cabinet to plate

LIBERO Spatial

Bowl on ramekin to plate

LIBERO Goal

Open drawer and place bowl

LIBERO Goal

Cream cheese to bowl

LIBERO Long

Both moka pots on stove

LIBERO Long

Two mugs to two plates

Real-World Videos

Robot manipulation with predicted flow.

Real robot

Cup insertion

Pick up the brown cup and insert it into the black cup.

Real robot

Pick-and-place assembly

Place an object into the target workspace with flow-guided control.

Real robot

Drawer manipulation

Open the top drawer, place the red cube inside, and close it.

Real robot

Stacking

Pick up the red cube and place it on the blue cube.

Quantitative Results

Benchmark gains with RoboFlow4D guidance.

Method	Spatial	Object	Goal	Long	Average
Octo	78.9	85.7	84.6	51.1	75.1
SpatialVLA	88.2	89.9	78.6	55.5	78.1
4D-VLA	88.9	95.2	90.9	79.1	88.6
DP	81.6	91.5	78.4	64.0	78.9
DP + RoboFlow4D	89.8	93.2	85.2	72.0	85.1
DiT	84.2	96.3	85.4	68.8	83.7
DiT + RoboFlow4D	90.2	97.0	88.4	75.2	87.7

Real-world DP gain

+12.5 average success

RoboFlow4D improves DP real-robot average success while reducing average completion time in the reported tasks.

Real-world DiT gain

+11.3 average success

The same flow guidance improves DiT across pick-and-place, stack, assemble, and drawer scenarios.

Deployment

Lightweight world modelling

The system is designed to make predictive 3D motion practical inside a robot control loop.

RoboFlow4D.

Planning in 3D Space

Pipeline

Videos and Results

Predicted flow becomes a spatial plan.

Moka Pot

Drawer

Push Cube

A lightweight flow world model for real-time manipulation.

RGB, point, and text tokens to multi-frame 3D flow

Slow planner, fast executor

Flow-conditioned policy rollouts in benchmark tasks.

Cream cheese to basket

Milk to basket

Bowl on cabinet to plate

Bowl on ramekin to plate

Open drawer and place bowl

Cream cheese to bowl

Both moka pots on stove

Two mugs to two plates

Robot manipulation with predicted flow.

Cup insertion

Pick-and-place assembly

Drawer manipulation

Stacking

Benchmark gains with RoboFlow4D guidance.

+12.5 average success

+11.3 average success

Lightweight world modelling