pIXELsHAM – blog of links related to computer animation and production technology Sponsored by ReelMatters.com

BREAKING NEWS

LATEST POSTS

FLASHVSR – Towards Real-Time Diffusion-Based Streaming Video Super-Resolution Upscaler

pIXELsHAM.com

Oct 31, 2025

A.I.

FlashVSR is a streaming, one-step diffusion-based video super-resolution framework with block-sparse attention and a Tiny Conditional Decoder. It reaches ~17 FPS at 768×1408 on a single A100 GPU. A Locality-Constrained Attention design further improves generalization and perceptual quality on ultra-high-resolution videos.

https://zhuang2002.github.io/FlashVSR/index.html

https://huggingface.co/JunhaoZhuang/FlashVSR

https://zhuang2002.github.io/FlashVSR/comparisons.html

Views : 6
Stable Video Infinity – Infinite-Length Video Generation with Error Recycling

pIXELsHAM.com

Oct 31, 2025

A.I.
https://github.com/vita-epfl/Stable-Video-Infinity

Stable Video Infinity (SVI) is able to generate ANY-length videos with high temporal consistency, plausible scene transitions, and controllable streaming storylines in ANY domains.
- OpenSVI: Everything is open-sourced: training & evaluation scripts, datasets, and more.
- Infinite Length: No inherent limit on video duration; generate arbitrarily long stories (see the 10‑minute “Tom and Jerry” demo).
- Versatile: Supports diverse in-the-wild generation tasks: multi-scene short films, single‑scene animations, skeleton-/audio-conditioned generation, cartoons, and more.
- Efficient: Only LoRA adapters are tuned, requiring very little training data: anyone can make their own SVI easily.
Views : 4
Google Vista – A Test-Time Self-Improving Video Generation AI Agent

pIXELsHAM.com

Oct 31, 2025

A.I.

https://g-vista.github.io/

VISTA is a modular, configurable framework for optimizing text-to-video generation. Given a user video prompt P, it produces an optimized video V* and its refined prompt P* through two phases: (i) Initialization and (ii) Self-Improvement, inspired by the human video optimization process via prompting. During (i), the prompt is parsed and planned into variants to generate candidate videos (Step 1), after which the best video-prompt pair is selected (Step 2). In (ii), the system generates multi-dimensional, multi-agent critiques (Step 3), refines the prompt (Step 4), produces new videos, and reselects the champion pair (Step 2). This phase continues until a stopping criterion is met or the maximum number of iterations is reached.

Views : 13
NANO3D – A Training-Free Approach for Efficient 3D Models Editing Without Masks

pIXELsHAM.com

Oct 31, 2025

A.I., modeling

https://jamesyjl.github.io/Nano3D/

https://github.com/JAMESYJL/Nano3D

Views : 4
Ditto – Free open source model to edit any video

pIXELsHAM.com

Oct 31, 2025

A.I.

https://github.com/EzioBy/Ditto

Views : 10
UltraGen – High-Resolution Video Generation with Hierarchical Attention

pIXELsHAM.com

Oct 31, 2025

A.I.

https://sjtuplayer.github.io/projects/UltraGen/

https://github.com/sjtuplayer/UltraGen

https://arxiv.org/pdf/2510.18775

Views : 5
HoloCine – Holistic Generation of Cinematic Multi-Shot Long Video Narratives

pIXELsHAM.com

Oct 31, 2025

Featured

https://holo-cine.github.io/

Views : 6
DyPE – Dynamic Position Extrapolation for Ultra High Resolution Diffusion Images

pIXELsHAM.com

Oct 31, 2025

A.I.

https://noamissachar.github.io/DyPE/

https://github.com/guyyariv/DyPE

https://arxiv.org/pdf/2510.20766

(more…)
Views : 4
Olo – A novel color via stimulation of individual photoreceptors at population scale

pIXELsHAM.com

Oct 31, 2025

colour

https://www.science.org/doi/10.1126/sciadv.adu1052

We introduce a principle, Oz, for displaying color imagery: directly controlling the human eye’s photoreceptor activity via cell-by-cell light delivery. Theoretically, novel colors are possible through bypassing the constraints set by the cone spectral sensitivities and activating M cone cells exclusively. In practice, we confirm a partial expansion of colorspace toward that theoretical ideal. Attempting to activate M cones exclusively is shown to elicit a color beyond the natural human gamut, formally measured with color matching by human subjects. They describe the color as blue-green of unprecedented saturation. Further experiments show that subjects perceive Oz colors in image and video form. The prototype targets laser microdoses to thousands of spectrally classified cones under fixational eye motion. These results are proof-of-principle for programmable control over individual photoreceptors at population scale.

(more…)
Views : 4
Drew Struzan (1947-2025) – The Man Behind Your Favorite Childhood Movie Posters

pIXELsHAM.com

Oct 31, 2025

design

https://indieground.net/blog/drew-struzan-the-man-behind-your-favorite-childhood-movie-posters/

Views : 6
Beijing Academy of AI releases BAAI Emu3.5-Image – Native Multimodal Models are World Learners

pIXELsHAM.com

Oct 31, 2025

A.I.

https://emu.world/pages/web/landingPage

(more…)
Views : 5
ComfyUI SeC Nodes – ComfyUI custom nodes for SeC (Segment Concept) – State-of-the-art video object segmentation that outperforms SAM 2.1, utilizing the SeC-4B model developed by OpenIXCLa

pIXELsHAM.com

Oct 27, 2025

A.I.
https://github.com/9nate-drake/Comfyui-SecNodes

What is SeC?

SeC (Segment Concept) is a breakthrough in video object segmentation that shifts from simple feature matching to high-level conceptual understanding. Unlike SAM 2.1 which relies primarily on visual similarity, SeC uses a Large Vision-Language Model (LVLM) to understand what an object is conceptually, enabling robust tracking through:
- Semantic Understanding: Recognizes objects by concept, not just appearance
- Scene Complexity Adaptation: Automatically balances semantic reasoning vs feature matching
- Superior Robustness: Handles occlusions, appearance changes, and complex scenes better than SAM 2.1
- SOTA Performance: +11.8 points over SAM 2.1 on SeCVOS benchmark
How SeC Works
1. Visual Grounding: You provide initial prompts (points/bbox/mask) on one frame
2. Concept Extraction: SeC’s LVLM analyzes the object to build a semantic understanding
3. Smart Tracking: Dynamically uses both semantic reasoning and visual features
4. Keyframe Bank: Maintains diverse views of the object for robust concept understanding
The result? SeC tracks objects more reliably through challenging scenarios like rapid appearance changes, occlusions, and complex multi-object scenes.

Views : 9
David James Armsby – Dinosauria The Animated Series – “Hunted by Moonlight”

pIXELsHAM.com

Oct 27, 2025

animation, blender, design

Views : 2
Storm HydroFX – GPU Accelerated Next- Gen FLIP Fluid Tool

pIXELsHAM.com

Oct 25, 2025

production, software

https://storm-vfx.com/hydrofx/

Views : 8
Stalled Trek: The Dumbsday Machine – Full Movie

pIXELsHAM.com

Oct 24, 2025

trailers

Views : 7

FEATURED POSTS

BBC – Wildlife Photographer of the Year 2025, the best pictures so far

pIXELsHAM.com

Aug 27, 2025

colour, composition, lighting, photography

https://www.bbc.com/news/articles/c70r7plrdndo

(more…)
Views : 14

Recraft V3 – A text-to-image SOTA model with the ability to generate long texts, vector art, images in brand style

pIXELsHAM.com

Jun 18, 2025

A.I.

https://www.recraft.ai/blog/recraft-introduces-a-revolutionary-ai-model-that-thinks-in-design-language

Views : 10

Vivian Maier photography

pIXELsHAM.com

May 4, 2011

photography

An incredible story. Vivian Maier was a nanny who lived in Chicago for most of her life and passed away in 2009 at the age of 83. Little more is known about her, except that she was an avid street photographer. Her work was discovered at an auction in 2007, more than 100,000 negatives and undeveloped rolls of film, sold by a storage facility who were cleaning out her locker for delinquent rent.

http://www.cracktwo.com/2011/04/amazing-mystery-photographer-comes-to_28.html

Views : 1,430

Daniele Tosti Interview for the magazine InCG, Taiwan, Issue 28, 201609

pIXELsHAM.com

Dec 28, 2017

Featured, ves

Interview for the magazine InCG, Taiwan, Issue 28, 201609

(more…)

Views : 1,761

Photography basics: Shutter angle and shutter speed and motion blur

pIXELsHAM.com

Mar 3, 2016

Featured, photography

http://www.shutterangle.com/2012/cinematic-look-frame-rate-shutter-speed/

https://www.cinema5d.com/global-vs-rolling-shutter

https://www.wikihow.com/Choose-a-Camera-Shutter-Speed

Shutter Speed vs. Shutter Angle

Shutter is the device that controls the amount of light through a lens. Basically in general it controls the amount of time a film is exposed.
Shutter speed is how long this device is open for, which also defines motion blur… the longer it stays open the blurrier the image captured.
The number refers to the amount of light actually allowed through.

As a reference, shooting at 24fps, at 180 shutter angle or 1/48th of shutter speed (0.0208 exposure time) will produce motion blur which is similar to what we perceive at naked eye

Talked of as in (shutter) angles, for historical reasons, as the original exposure mechanism was controlled through a pie shaped mirror in front of the lens.

A shutter of 180 degrees is blocking/allowing light for half circle. (half blocked, half open). 270 degrees is one quarter pie shaped, which would allow for a higher exposure time (3 quarter pie open, vs one quarter closed) 90 degrees is three quarter pie shaped, which would allow for a lower exposure (one quarter open, three quarters closed)
(more…)
Views : 10,036

All RGB colors in an image 1080p

pIXELsHAM.com

Mar 10, 2014

colour, design

Views : 980

The Godfather of Digital Painting, Craig Mullins

pIXELsHAM.com

May 6, 2021

design

Views : 875

Arvid Schneider – Lush Forest Biomes | Houdini 21

pIXELsHAM.com

Aug 16, 2025

lighting, modeling, software

https://www.patreon.com/arvidschneider/shop/lush-forest-biomes-scene-files-2137972

Views : 11

Views : 20,515