pIXELsHAM – blog of links related to computer animation and production technology Sponsored by ReelMatters.com

BREAKING NEWS

LATEST POSTS

Cartesian Caramel – Blender 4.1 Scale Armor Breakdown (and Particle System Stuff)

pIXELsHAM.com

Jun 3, 2025

blender, modeling

Views : 9
The Haunted Graveyard Diorama with live Ghosts – Ipad Reflection Explanation and Basic Tutorial

pIXELsHAM.com

Jun 3, 2025

cool, design

Views : 10
Studio Tim Fu – Living Sketches architecture

pIXELsHAM.com

Jun 2, 2025

design

https://timfu.com/

(more…)
Views : 25
How to Build & Sell AI Agents – Ultimate Beginner’s Guide

pIXELsHAM.com

Jun 2, 2025

A.I., production

Views : 5
N8N.io – From Zero to Your First AI Agent in 25 Minutes

pIXELsHAM.com

Jun 2, 2025

A.I., Featured, production

https://n8n.io

https://github.com/n8n-io/self-hosted-ai-starter-kit

Views : 18
Transformer Explainer -Interactive Learning of Text-Generative Models

pIXELsHAM.com

Jun 2, 2025

A.I.

https://github.com/poloclub/transformer-explainer

Transformer Explainer is an interactive visualization tool designed to help anyone learn how Transformer-based models like GPT work. It runs a live GPT-2 model right in your browser, allowing you to experiment with your own text and observe in real time how internal components and operations of the Transformer work together to predict the next tokens. Try Transformer Explainer at http://poloclub.github.io/transformer-explainer

Views : 14
How to Design for 3D Printing in Blender – Beginner Tutorial

pIXELsHAM.com

Jun 2, 2025

3Dprinting, blender, modeling

Views : 14
Henry Daubrez – How to generate VR/ 360 videos directly with Google VEO

pIXELsHAM.com

May 30, 2025

A.I., VR

https://www.linkedin.com/posts/upskydown_vr-googleveo-veo3-activity-7334269406396461059-d8Da

If you prompt for a 360° video in VEO (like literally write “360°” ) it can generate a Monoscopic 360 video, then the next step is to inject the right metadata in your file so you can play it as an actual 360 video.
Once it’s saved with the right Metadata, it will be recognized as an actual 360/VR video, meaning you can just play it in VLC and drag your mouse to look around.

Spatial Media Metadata Injector – for 360 videos

Views : 14
Revopoint Trackit – Optical Tracking 3D Scanner

pIXELsHAM.com

May 30, 2025

photogrammetry

https://www.kickstarter.com/projects/revopoint3d/revopoint-trackit-optical-tracking-3d-scanner

Views : 8
Teoman Şirvancı – Making a CG F1 Toy Car turntable with Renderman

pIXELsHAM.com

May 29, 2025

lighting, modeling

https://renderman.pixar.com/f1-toy-car

Views : 18
Black Forest Labs released FLUX.1 Kontext

pIXELsHAM.com

May 29, 2025

A.I., Featured, production
https://replicate.com/blog/flux-kontext

https://replicate.com/black-forest-labs/flux-kontext-pro

There are three models, two are available now, and a third open-weight version is coming soon:
- FLUX.1 Kontext [pro]: State-of-the-art performance for image editing. High-quality outputs, great prompt following, and consistent results.
- FLUX.1 Kontext [max]: A premium model that brings maximum performance, improved prompt adherence, and high-quality typography generation without compromise on speed.
- Coming soon: FLUX.1 Kontext [dev]: An open-weight, guidance-distilled version of Kontext.
We’re so excited with what Kontext can do, we’ve created a collection of models on Replicate to give you ideas:
- Multi-image kontext: Combine two images into one.
- Portrait series: Generate a series of portraits from a single image
- Change haircut: Change a person’s hair style and color
- Iconic locations: Put yourself in front of famous landmarks
- Professional headshot: Generate a professional headshot from any image
Views : 82
AI Models – A walkthrough by Andreas Horn

pIXELsHAM.com

May 28, 2025

A.I.

the 8 most important model types and what they’re actually built to do: ⬇️

1. 𝗟𝗟𝗠 – 𝗟𝗮𝗿𝗴𝗲 𝗟𝗮𝗻𝗴𝘂𝗮𝗴𝗲 𝗠𝗼𝗱𝗲𝗹
→ Your ChatGPT-style model.
Handles text, predicts the next token, and powers 90% of GenAI hype.
🛠 Use case: content, code, convos.

2. 𝗟𝗖𝗠 – 𝗟𝗮𝘁𝗲𝗻𝘁 𝗖𝗼𝗻𝘀𝗶𝘀𝘁𝗲𝗻𝗰𝘆 𝗠𝗼𝗱𝗲𝗹
→ Lightweight, diffusion-style models.
Fast, quantized, and efficient — perfect for real-time or edge deployment.
🛠 Use case: image generation, optimized inference.

3. 𝗟𝗔𝗠 – 𝗟𝗮𝗻𝗴𝘂𝗮𝗴𝗲 𝗔𝗰𝘁𝗶𝗼𝗻 𝗠𝗼𝗱𝗲𝗹
→ Where LLM meets planning.
Adds memory, task breakdown, and intent recognition.
🛠 Use case: AI agents, tool use, step-by-step execution.

4. 𝗠𝗼𝗘 – 𝗠𝗶𝘅𝘁𝘂𝗿𝗲 𝗼𝗳 𝗘𝘅𝗽𝗲𝗿𝘁𝘀
→ One model, many minds.
Routes input to the right “expert” model slice — dynamic, scalable, efficient.
🛠 Use case: high-performance model serving at low compute cost.

5. 𝗩𝗟𝗠 – 𝗩𝗶𝘀𝗶𝗼𝗻 𝗟𝗮𝗻𝗴𝘂𝗮𝗴𝗲 𝗠𝗼𝗱𝗲𝗹
→ Multimodal beast.
Combines image + text understanding via shared embeddings.
🛠 Use case: Gemini, GPT-4o, search, robotics, assistive tech.

6. 𝗦𝗟𝗠 – 𝗦𝗺𝗮𝗹𝗹 𝗟𝗮𝗻𝗴𝘂𝗮𝗴𝗲 𝗠𝗼𝗱𝗲𝗹
→ Tiny but mighty.
Designed for edge use, fast inference, low latency, efficient memory.
🛠 Use case: on-device AI, chatbots, privacy-first GenAI.

7. 𝗠𝗟𝗠 – 𝗠𝗮𝘀𝗸𝗲𝗱 𝗟𝗮𝗻𝗴𝘂𝗮𝗴𝗲 𝗠𝗼𝗱𝗲𝗹
→ The OG foundation model.
Predicts masked tokens using bidirectional context.
🛠 Use case: search, classification, embeddings, pretraining.

8. 𝗦𝗔𝗠 – 𝗦𝗲𝗴𝗺𝗲𝗻𝘁 𝗔𝗻𝘆𝘁𝗵𝗶𝗻𝗴 𝗠𝗼𝗱𝗲𝗹
→ Vision model for pixel-level understanding.
Highlights, segments, and understands *everything* in an image.
🛠 Use case: medical imaging, AR, robotics, visual agents.

Views : 19
Spaitial.ai – Spatial Foundation Models

pIXELsHAM.com

May 28, 2025

A.I., photogrammetry

https://www.spaitial.ai/

Views : 14
Introducting ComfyUI Native API Nodes

pIXELsHAM.com

May 22, 2025

A.I.
https://blog.comfy.org/p/comfyui-native-api-nodes

Models Supported
- Black Forest Labs Flux 1.1[pro] Ultra, Flux .1[pro]
- Kling 2.0, 1.6, 1.5 & Various Effects
- Luma Photon, Ray2, Ray1.6
- MiniMax Text-to-Video, Image-to-Video
- PixVerse V4 & Effects
- Recraft V3, V2 & Various Tools
- Stability AI Stable Image Ultra, Stable Diffusion 3.5 Large
- Google Veo2
- Ideogram V3, V2, V1
- OpenAI GPT4o image
- Pika 2.2
Views : 15
ComfyUI-CoCoTools_IO – A set of nodes focused on advanced image I/O operations, particularly for EXR file handling

pIXELsHAM.com

May 21, 2025

A.I., production
https://github.com/Conor-Collins/ComfyUI-CoCoTools_IO

Features
- Advanced EXR image input with multilayer support
- EXR layer extraction and manipulation
- High-quality image saving with format-specific options
- Standard image format loading with bit depth awareness
Current Nodes

Image I/O
- Image Loader: Load standard image formats (PNG, JPG, WebP, etc.) with proper bit depth handling
- Load EXR: Comprehensive EXR file loading with support for multiple layers, channels, and cryptomatte data
- Load EXR Layer by Name: Extract specific layers from EXR files (similar to Nuke’s Shuffle node)
- Cryptomatte Layer: Specialized handling for cryptomatte layers in EXR files
- Image Saver: Save images in various formats with format-specific options (bit depth, compression, etc.)
Image Processing
- Colorspace: Convert between sRGB and Linear colorspaces
- Z Normalize: Normalize depth maps and other single-channel data
Views : 77

FEATURED POSTS

CHRISTOPHER NOLAN cinematography

pIXELsHAM.com

Dec 16, 2022

composition, lighting, photography, reference

Views : 451

59 AI Filmmaking Tools For Your Workflow

pIXELsHAM.com

Jul 10, 2024

A.I., Featured, production, software
https://curiousrefuge.com/blog/ai-filmmaking-tools-for-filmmakers
1. Runway
2. PikaLabs
3. Pixverse (free)
4. Haiper (free)
5. Moonvalley (free)
6. Morph Studio (free)
7. SORA
8. Google Veo
9. Stable Video Diffusion (free)
10. Leonardo
11. Krea
12. Kaiber
13. Letz.AI
14. Midjourney
15. Ideogram
16. DALL-E
17. Firefly
18. Stable Diffusion
19. Google Imagen 3
20. Polycam
21. LTX Studio
22. Simulon
23. Elevenlabs
24. Auphonic
25. Adobe Enhance
26. Adobe’s AI Rotoscoping
27. Adobe Photoshop Generative Fill
28. Canva Magic Brush
29. Akool
30. Topaz Labs
31. Magnific.AI
32. FreePik
33. BigJPG
34. LeiaPix
35. Move AI
36. Mootion
37. Heygen
38. Synthesia
39. Chat GPT-4
40. Claude 3
41. Nolan AI
42. Google Gemini
43. Meta Llama 3
44. Suno
45. Udio
46. Stable Audio
47. Soundful
48. Google MusicML
49. Viggle
50. SyncLabs
51. Lalamu
52. LensGo
53. D-ID
54. WonderStudio
55. Cuebric
56. Blockade Labs
57. Chat GPT-4o
58. Luma Dream Machine
59. Pallaidium (free)
Views : 126

Planetary Panoramas – 360 Degree Night-Sky Time-Lapse by Vincent Brady

pIXELsHAM.com

Jun 22, 2014

photography, VR

Views : 1,091

Eddie Yoon – There’s a big misconception about AI creative

pIXELsHAM.com

Dec 29, 2024

A.I., Featured, ves

You’re being tricked into believing that AI can produce Hollywood-level videos…

We’re far from it.

(more…)
Views : 98

Tencent Hunyuan3D 2.1 goes Open Source and adds MV (Multi-view) and MV Mini

pIXELsHAM.com

Jun 14, 2025

A.I., Featured, modeling

https://huggingface.co/tencent/Hunyuan3D-2mv

https://huggingface.co/tencent/Hunyuan3D-2mini

https://github.com/Tencent/Hunyuan3D-2

Tencent just made Hunyuan3D 2.1 open-source.
This is the first fully open-source, production-ready PBR 3D generative model with cinema-grade quality.
https://github.com/Tencent-Hunyuan/Hunyuan3D-2.1

What makes it special?
• Advanced PBR material synthesis brings realistic materials like leather, bronze, and more to life with stunning light interactions.
• Complete access to model weights, training/inference code, data pipelines.
• Optimized to run on accessible hardware.
• Built for real-world applications with professional-grade output quality.

They’re making it accessible to everyone:
• Complete open-source ecosystem with full documentation.
• Ready-to-use model weights and training infrastructure.
• Live demo available for instant testing.
• Comprehensive GitHub repository with implementation details.

Views : 217