pIXELsHAM – Page 49 – blog of links related to computer animation and production technology Sponsored by ReelMatters.com

BREAKING NEWS

LATEST POSTS

CAST – Component-Aligned 3D Scene Reconstruction from an RGB Image

pIXELsHAM.com

Feb 27, 2025

A.I., photogrammetry

https://sites.google.com/view/cast4

Views : 151

CUDA Programming for Python Developers

pIXELsHAM.com

Feb 27, 2025

software

https://www.pyspur.dev/blog/introduction_cuda_programming

Check your Cuda version, it will be the release version here:

>>> nvcc --version
nvcc: NVIDIA (R) Cuda compiler driver
Copyright (c) 2005-2024 NVIDIA Corporation
Built on Wed_Apr_17_19:36:51_Pacific_Daylight_Time_2024
Cuda compilation tools, release 12.5, V12.5.40
Build cuda_12.5.r12.5/compiler.34177558_0

or from here:

>>> nvidia-smi
Mon Jun 16 12:35:20 2025
+-----------------------------------------------------------------------------------------+
| NVIDIA-SMI 555.85                 Driver Version: 555.85         CUDA Version: 12.5     |
|-----------------------------------------+------------------------+----------------------+

PixVerse – Prompt, lypsync and extended video generation

pIXELsHAM.com

Feb 27, 2025

A.I.
https://app.pixverse.ai/onboard

PixVerse now has 3 main features:
- text to video ➡️ How To Generate Videos With Text Prompts
- image to video ➡️ How To Animate Your Images And Bring Them To Life
- upscale ➡️ How to Upscale Your Video
Enhanced Capabilities
– Improved Prompt Understanding: Achieve more accurate prompt interpretation and stunning video dynamics.
– Supports Various Video Ratios: Choose from 16:9, 9:16, 3:4, 4:3, and 1:1 ratios.
– Upgraded Styles: Style functionality returns with options like Anime, Realistic, Clay, and 3D. It supports both text-to-video and image-to-video stylization.

New Features
– Lipsync: The new Lipsync feature enables users to add text or upload audio, and PixVerse will automatically sync the characters’ lip movements in the generated video based on the text or audio.
– Effect: Offers 8 creative effects, including Zombie Transformation, Wizard Hat, Monster Invasion, and other Halloween-themed effects, enabling one-click creativity.
– Extend: Extend the generated video by an additional 5-8 seconds, with control over the content of the extended segment.

Views : 497
High-resolution 3D digital twin of Tokyo – Free to download

pIXELsHAM.com

Feb 27, 2025

modeling, reference

https://info.tokyo-digitaltwin.metro.tokyo.lg.jp/3dmodel

https://3dview.tokyo-digitaltwin.metro.tokyo.lg.jp

https://github.com/tokyo-digitaltwin

Views : 134
Alibaba Group Tongyi Lab WanxAI Wan2.1 – open source model

pIXELsHAM.com

Feb 26, 2025

A.I.

https://wanxai.com

👍 SOTA Performance: Wan2.1 consistently outperforms existing open-source models and state-of-the-art commercial solutions across multiple benchmarks.

🚀 Supports Consumer-grade GPUs: The T2V-1.3B model requires only 8.19 GB VRAM, making it compatible with almost all consumer-grade GPUs. It can generate a 5-second 480P video on an RTX 4090 in about 4 minutes (without optimization techniques like quantization). Its performance is even comparable to some closed-source models.

🎉 Multiple tasks: Wan2.1 excels in Text-to-Video, Image-to-Video, Video Editing, Text-to-Image, and Video-to-Audio, advancing the field of video generation.

🔮 Visual Text Generation: Wan2.1 is the first video model capable of generating both Chinese and English text, featuring robust text generation that enhances its practical applications.

💪 Powerful Video VAE: Wan-VAE delivers exceptional efficiency and performance, encoding and decoding 1080P videos of any length while preserving temporal information, making it an ideal foundation for video and image generation.

https://huggingface.co/Comfy-Org/Wan_2.1_ComfyUI_repackaged/tree/main/split_files

https://huggingface.co/Comfy-Org/Wan_2.1_ComfyUI_repackaged/tree/main/example%20workflows_Wan2.1

https://huggingface.co/Wan-AI/Wan2.1-T2V-14B

https://huggingface.co/Kijai/WanVideo_comfy/tree/main

Views : 806
VES Cinematic Color – Motion-Picture Color Management

pIXELsHAM.com

Feb 25, 2025

colour, production, ves

https://cinematiccolor.org

This paper presents an introduction to the color pipelines behind modern feature-film visual-effects and animation.

Authored by Jeremy Selan, and reviewed by the members of the VES Technology Committee including Rob Bredow, Dan Candela, Nick Cannon, Paul Debevec, Ray Feeney, Andy Hendrickson, Gautham Krishnamurti, Sam Richards, Jordan Soles, and Sebastian Sylwan.

Cinematic_Color_VES_compressed Download

Views : 636
The VFX Reference Platform

pIXELsHAM.com

Feb 25, 2025

production, software, ves

The VFX Reference Platform is a set of tool and library versions to be used as a common target platform for building software for the VFX industry. Its purpose is to minimise incompatibilities between different software packages, ease the support burden for integrated pipelines and encourage further adoption of Linux by both studios and software vendors. The Reference Platform is updated annually by a group of software vendors in collaboration with the Visual Effects Society Technology Committee.

https://vfxplatform.com

Each annual reference platform is designated by the calendar year in which major product releases should be targeting that particular reference.

Views : 51
Deep Compositing in Nuke – a walkthrough

pIXELsHAM.com

Feb 25, 2025

production

https://dsp.stackexchange.com/questions/81723/understanding-depth-map-depth-image-disparity-map-disparity-image

Depth Map: A depth map is a representation of the distance or depth information for each pixel in a scene. It is typically a two-dimensional array where each pixel contains a value that represents the distance from the camera to the corresponding point in the scene. The depth values are usually represented in metric units, such as meters. A depth map provides a continuous representation of the scene’s depth information.

For example, in Arnold this is achieved through a Z AOV, this collects depth of the shading points as seen from the camera.
https://help.autodesk.com/view/ARNOL/ENU/?guid=arnold_user_guide_ac_output_aovs_ac_aovs_html
https://help.autodesk.com/view/ARNOL/ENU/?guid=arnold_for_3ds_max_ax_aov_tutorials_ax_zdepth_aov_html
(more…)
Views : 55
VFX Giant MPC and Parent Company Technicolor Shut Down Amid ‘Severe Financial Challenges

pIXELsHAM.com

Feb 23, 2025

ves

https://variety.com/2025/film/global/technicolor-vfx-mpc-shutter-severe-challenges-1236316354

Shaun Severi, Head of Creative Production at the Mill, claimed in a LinkedIn post that 4,500 had lost their jobs in 24 hours: “The problem wasn’t talent or execution — it was mismanagement at the highest levels…the incompetence at the top was nothing short of disastrous.”

According to Severi, successive company presidents “buried the company under massive debt by acquiring VFX Studios…the second president, after a disastrous merger of the post houses, took us public, artificially inflating the company’s value — only for it to come crashing down when the real numbers were revealed….and the third and final president, who came from a car rental company, had no vision of what she was building, selling or managing.”

https://deadline.com/2025/03/technicolor-ceo-company-back-lies-in-ruins-vfx-post-production-1236305304

Views : 44
Free Blender God Ray Generator

pIXELsHAM.com

Feb 22, 2025

blender

https://80.lv/articles/get-this-magical-god-ray-generator-in-blender-for-free

https://ko-fi.com/s/a6ebd3a46a

Views : 40
Rubberband – AI Media Platform for Pros

pIXELsHAM.com

Feb 22, 2025

A.I.

https://www.rubbrband.com/home

Views : 44
Moondream Gaze Detection – Open source code

pIXELsHAM.com

Feb 22, 2025

A.I.

This is convenient for captioning videos, understanding social dynamics, and for specific cases such as sports analytics, or detecting when drivers or operators are distracted.

https://huggingface.co/spaces/moondream/gaze-demo

https://moondream.ai/blog/announcing-gaze-detection

Views : 33
X-Dyna – Expressive Dynamic Human Image Animation

pIXELsHAM.com

Feb 22, 2025

A.I.

https://x-dyna.github.io/xdyna.github.io

A novel zero-shot, diffusion-based pipeline for animating a single human image using facial expressions and body movements derived from a driving video, that generates realistic, context-aware dynamics for both the subject and the surrounding environment.

Views : 27
Agent Leaderboard on Hugging Face

pIXELsHAM.com

Feb 22, 2025

A.I.

https://www.galileo.ai/blog/agent-leaderboard

https://huggingface.co/spaces/galileo-ai/agent-leaderboard

Views : 43
Paper.Design – AI driven design

pIXELsHAM.com

Feb 22, 2025

design

https://paper.design

https://liquid.paper.design/?refraction=0.015&edge=0.4&patternBlur=0.005&liquid=0.07&speed=0.3&patternScale=2&background=metal

Views : 30

FEATURED POSTS

Composition – Basic cinematography

pIXELsHAM.com

May 18, 2018

composition, lighting, photography

https://www.cs.williams.edu/~morgan/cs371-f14/reading/cinematography.pdf

McGuire_cinematography_compressed Download
(more…)
Views : 1,692

Lumalabs unbounded NeRF running in a browser

pIXELsHAM.com

Aug 17, 2022

A.I., software

https://captures.lumalabs.ai/unbounded

Views : 1,012

Turn Your iPhone into a DSLR with These Keychain Lenses From Olloclip

pIXELsHAM.com

Apr 28, 2016

photography

http://www.gearinstitute.com/gear-news/gear-spotter/item/turn-your-iphone-into-a-dslr-with-these-keychain-lenses-from-olloclip

Views : 1,038

3D Gaussian Splatting step by step beginner course

pIXELsHAM.com

Dec 23, 2023

A.I., Featured, photogrammetry

Arkadiusz Szadkowski : Splats vs Points vs Mesh

🔸 Gaussian Splats: imagine throwing thousands of tiny ellipsoidal paint drops. They overlap, blend, and create a smooth, photorealistic look. Fast, great for visualization, but less structured for measurements.

🔸 Point Clouds: every dot is a measured hit. LiDAR or photogrammetry gives us millions of them forming a constellation of reality. Amazing for accuracy, but they don’t connect the dots out of the box.

🔸 Meshes: take those points, connect them into triangles, and you get very realistic surfaces. Strong for 3D analysis, simulation as continues watertight models.

(more…)
Views : 173

AI Search – Find The Best AI Tools & Apps

pIXELsHAM.com

Mar 14, 2025

A.I., Featured

https://ai-search.io

Views : 65

Gamma correction

pIXELsHAM.com

Apr 1, 2018

colour, Featured, lighting, photography, production

http://www.normankoren.com/makingfineprints1A.html#Gammabox

https://en.wikipedia.org/wiki/Gamma_correction

http://www.photoscientia.co.uk/Gamma.htm

https://www.w3.org/Graphics/Color/sRGB.html

http://www.eizoglobal.com/library/basics/lcd_display_gamma/index.html

https://forum.reallusion.com/PrintTopic308094.aspx

Basically, gamma is the relationship between the brightness of a pixel as it appears on the screen, and the numerical value of that pixel. Generally Gamma is just about defining relationships.

Three main types:
– Image Gamma encoded in images
– Display Gammas encoded in hardware and/or viewing time
– System or Viewing Gamma which is the net effect of all gammas when you look back at a final image. In theory this should flatten back to 1.0 gamma.

(more…)
Views : 3,244

handmade animal design 1

pIXELsHAM.com

Aug 13, 2017

design

Views : 992

Christopher Butler – Understanding the Eye-Mind Connection – Vision is a mental process

pIXELsHAM.com

Jul 23, 2024

colour, composition, lighting, photography, quotes
https://www.chrbutler.com/understanding-the-eye-mind-connection

The intricate relationship between the eyes and the brain, often termed the eye-mind connection, reveals that vision is predominantly a cognitive process. This understanding has profound implications for fields such as design, where capturing and maintaining attention is paramount. This essay delves into the nuances of visual perception, the brain’s role in interpreting visual data, and how this knowledge can be applied to effective design strategies.

This cognitive aspect of vision is evident in phenomena such as optical illusions, where the brain interprets visual information in a way that contradicts physical reality. These illusions underscore that what we “see” is not merely a direct recording of the external world but a constructed experience shaped by cognitive processes.

Understanding the cognitive nature of vision is crucial for effective design. Designers must consider how the brain processes visual information to create compelling and engaging visuals. This involves several key principles:
1. Attention and Engagement
2. Visual Hierarchy
3. Cognitive Load Management
4. Context and Meaning
Views : 63