FlowCache is a caching framework for autoregressive video models, using chunkwise policies and KV cache compression. Achieves 2.38x speedup on MAGI-1 and 6.7x on SkyReels-V2 with minimal quality loss. Code available on GitHub.
Key Points
- 1.Chunkwise caching adapts to varying chunk similarities
- 2.Importance-redundancy KV compression maintains memory bounds
- 3.Enables real-time ultra-long video generation
Impact Analysis
Unlocks scalable, efficient autoregressive video synthesis, setting new benchmarks for speed and quality.
Technical Details
Dynamic recomputation control per chunk. Fixed memory with joint optimization; tested on Transformer-based models.