| TeleBoost | TeleBoost: A Systematic Alignment Framework for High-Fidelity, Controllable, and Robust Video Generation | arXiv | 2026 |
| MonarchRT | MonarchRT: Efficient Attention for Real-Time Video Generation | arXiv | 2026 |
| SLA2 | SLA2: Sparse-Linear Attention with Learnable Routing and QAT | arXiv | 2026 |
| Flow-Factory | Flow-Factory: A Unified Framework for Reinforcement Learning in Flow-Matching Models | arXiv | 2026 |
| Light Forcing | Light Forcing: Accelerating Autoregressive Video Diffusion via Sparse Attention | arXiv | 2026 |
| ScripterAgent | The Script is All You Need: An Agentic Framework for Long-Horizon Dialogue-to-Cinematic Video Generation | arXiv | 2026 |
| CamPilot | CamPilot: Improving Camera Control in Video Diffusion Model with Efficient Camera Reward Feedback | arXiv | 2026 |
| OmniTransfer | OmniTransfer: All-in-one Framework for Spatio-temporal Video Transfer | arXiv | 2026 |
| Streaming-dLLM | Streaming-dLLM: Accelerating Diffusion LLMs via Suffix Pruning and Dynamic Decoding | arXiv | 2026 |
| SpargeAttn | Accurate Sparse Attention Accelerating Any Model Inference | arXiv | 2025 |
| SageAttention2 | Efficient Attention with Thorough Outlier Smoothing and Per-thread INT4 Quantization | arXiv | 2025 |
| FlashVideo | Flowing Fidelity to Detail for Efficient High-Resolution Video Generation | arXiv | 2025 |
| Sparse VideoGen | Accelerating Video Diffusion Transformers with Spatial-Temporal Sparsity | arXiv | 2025 |
| Fast Sliding Tile Attention | Fast Video Generation with Sliding Tile Attention | arXiv | 2025 |
| Diffusion Adversarial Post-Training | One-Step Video Generation | arXiv | 2025 |
| Turbo2K | Towards Ultra-Efficient and High-Quality 2K Video Synthesis | arXiv | 2025 |
| T2V-Turbo-v2 | Enhancing Video Generation Model Post-Training | arXiv | 2024 |
| Real-Time PAB | Real-Time Video Generation with Pyramid Attention Broadcast | arXiv | 2024 |
| xGen-VideoSyn-1 | High-fidelity Text-to-Video Synthesis with Compressed Representations | arXiv | 2024 |
| SageAttention | Accurate 8-Bit Attention for Plug-and-play Inference Acceleration | arXiv | 2024 |
| From Slow to Fast | From Slow Bidirectional to Fast Causal Video Generators | arXiv | 2024 |
| MotionStream | MotionStream: Real-Time Video Generation with Interactive Motion Controls | arXiv | 2025 |
| Delta-DiT | Delta-DiT: A Training-Free Acceleration Method Tailored for Diffusion Transformers | arXiv | 2025 |
| TeaCache | TeaCache: Training-Free Input-Aware Cache for Accelerating Diffusion Models | arXiv | 2025 |