FramePack – Packing Input Frame Context in Next-Frame Prediction Models for Offline Video Generation With Low Resource Requirements

A.I., software

April 27, 2025

pIXELsHAM.com

https://lllyasviel.github.io/frame_pack_gitpage/

Diffuse thousands of frames at full fps-30 with 13B models using 6GB laptop GPU memory.
Finetune 13B video model at batch size 64 on a single 8xA100/H100 node for personal/lab experiments.
Personal RTX 4090 generates at speed 2.5 seconds/frame (unoptimized) or 1.5 seconds/frame (teacache).
No timestep distillation.
Video diffusion, but feels like image diffusion.

Watch this video on YouTube

Image-to-5-Seconds (30fps, 150 frames)

COLLECTIONS

| Featured AI
| Design And Composition
| Explore posts

POPULAR SEARCHES

FEATURED POSTS

Social Links

DISCLAIMER – Links and images on this website may be protected by the respective owners’ copyright. All data submitted by users through this site shall be treated as freely available to share.

FramePack – Packing Input Frame Context in Next-Frame Prediction Models for Offline Video Generation With Low Resource Requirements

Game Development tips

Zibra.AI – Real-Time Volumetric Effects in Virtual Production. Now free for Indies!

Ethan Roffler interviews CG Supervisor Daniele Tosti

Sensitivity of human eye

Photography basics: Exposure Value vs Photographic Exposure vs Il/Luminance vs Pixel luminance measurements

Advanced Computer Vision with Python OpenCV and Mediapipe

Rec-2020 – TVs new color gamut standard used by Dolby Vision?

Principles of Animation with Alan Becker, Dermot OConnor and Shaun Keenan