3D/4D Tracking

Dense point tracking across frames, scene flow estimation, and unified geometry-plus-tracking models — from pixel-level correspondence to world-centric 3D trajectories.

⌘K

3D Point Tracking 15+ papers

Model / Method	Full Title	Venue	Year
OmniMotion	Tracking Everything Everywhere All at Once	ICCV	2023
OmniTrackFast	Track Everything Everywhere Fast and Robustly	ECCV	2024
SpatialTracker	Tracking Any 2D Pixels in 3D Space	CVPR	2024
EgoPoints	Advancing Point Tracking for Egocentric Videos	WACV	2025
SceneTracker	Long-term Scene Flow Estimation Network	T-PAMI	2025
DELTA	Dense Efficient Long-range 3D Tracking for any video	ICLR	2025
Seurat	From Moving Points to Depth	CVPR	2025
TAPIP3D	Tracking Any Point in Persistent 3D Geometry	arXiv	2025
TrackingWorld	World-centric Monocular 3D Tracking	NeurIPS	2025
SyncTrack4D	SyncTrack4D: Cross-Video Motion Alignment and Video Synchronization for Multi-Video 4D Gaussian Splatting	arXiv	2025
V-DPM	V-DPM: 4D Video Reconstruction with Dynamic Point Maps	arXiv	2026
DePT3R	DePT3R: Joint Dense Point Tracking and 3D Reconstruction of Dynamic Scenes	arXiv	2025

Unified Depth, Pose & Tracking 23+ papers

Model / Method	Full Title	Venue	Year
TracksTo4D	Fast Encoder-Based 3D from Casual Videos	NeurIPS	2024
Uni4D	Unifying Visual Foundation Models for 4D Modeling	CVPR	2025
Stereo4D	Learning How Things Move in 3D from Internet Stereo Videos	CVPR	2025
VGGT	Visual Geometry Grounded Transformer	CVPR	2025
Zero-MSF	Zero-Shot Monocular Scene Flow Estimation	CVPR	2025
St4RTrack	Simultaneous 4D Reconstruction and Tracking in the World	ICCV	2025
SpatialTrackerV2	3D Point Tracking Made Easy	ICCV	2025
MVTracker	Multi-View 3D Point Tracking	ICCV	2025
DPM	Dynamic Point Maps: Versatile Representation for Dynamic 3D Reconstruction	arXiv	2025
POMATO	Marrying Pointmap Matching with Temporal Motion	arXiv	2025
D²USt3R	Enhancing 3D Reconstruction with 4D Pointmaps	arXiv	2025
BA-Track	Back on Track: Bundle Adjustment for Dynamic Scene Reconstruction	arXiv	2025
Trace Anything	Representing Any Video in 4D via Trajectory Fields	arXiv	2025
PointSt3R	Point Tracking through 3D Grounded Correspondence	WACV	2025
Dens3R	A Foundation Model for 3D Geometry Prediction	arXiv	2025
FlashVGGT	Efficient Visual Geometry Transformers with Compressed Descriptor Attention	arXiv	2025
Fin3R	Fine-tuning Feed-forward 3D Reconstruction Models	NeurIPS	2025
AMB3R	Accurate Feed-forward Metric-scale 3D Reconstruction	arXiv	2025
HTTM	HTTM: Head-wise Temporal Token Merging for Faster VGGT	arXiv	2025
Selfi	Selfi: Self Improving Reconstruction Engine via 3D Geometric Feature Alignment	arXiv	2025
UniPR-3D	UniPR-3D: Towards Universal Visual Place Recognition with Visual Geometry Grounded Transformer	arXiv	2025