A.I. – pIXELsHAM

A.I.

Tarkansarim AnimateDiff – Video outpainting workflow

pIXELsHAM.com

Jun 13, 2025

https://www.reddit.com/r/comfyui/comments/1c671l7/next_level_animatediff_outpainting_workflow/

https://drive.google.com/file/d/1wqI1rskIMtbyADZ7i586gx2iqaOYJF6_/view

Watch this video on YouTube

AnimateDiff_Outpainting_v1_3.json Download

A.I.

CogVideoX-5b – Video outpainting

pIXELsHAM.com

Jun 13, 2025

https://huggingface.co/THUDM/CogVideoX-5b

https://civitai.com/models/886882/cog-videox-5b-i2v-or-img-2-vid-tora-auto-outpainting

A.I.

VidPanos – Generative Outpainting Panoramic Videos from Casual Panning Videos

pIXELsHAM.com

Jun 13, 2025

https://vidpanos.github.io/

A.I.

Lovis Odin ComfyUI-8iPlayer – Seamlessly integrate 8i volumetric videos into your AI workflows

pIXELsHAM.com

Jun 12, 2025

Load holograms, animate cameras, capture frames, and feed them to your favorite AI models. Developed by Lovis Odin for Kartel.ai
You can obtain the MPD URL directly from the official 8i Web Player.

https://github.com/Kartel-ai/ComfyUI-8iPlayer/

A.I., jokes

The Prompt Floor – Behind the Scenes of AI

pIXELsHAM.com

Jun 12, 2025

https://www.reddit.com/r/aivideo/comments/1l89828/the_prompt_floor_behind_the_scenes_of_ai/

Watch this video on YouTube

A.I., production, software

Flawless AI DeepEditor – The film editing tool that makes filmmaking agile

pIXELsHAM.com

Jun 11, 2025

https://www.flawlessai.com/deepeditor

Watch this video on YouTube

Watch this video on YouTube

A.I.

Thomas Müller nv-tlabs GEN3C – 3D-Informed World-Consistent Video Generation with Precise Camera Control

pIXELsHAM.com

Jun 11, 2025

https://github.com/nv-tlabs/GEN3C

Load a picture, define a camera path in 3D, and then render a photoreal video.

A.I., ves

Disney, NBCU sue Midjourney over copyright infringement

pIXELsHAM.com

Jun 11, 2025

https://www.axios.com/2025/06/11/disney-nbcu-midjourney-copyright

https://www.reuters.com/business/media-telecom/disney-universal-sue-image-creator-midjourney-copyright-infringement-2025-06-11/

Why it matters: It’s the first legal action that major Hollywood studios have taken against a generative AI company.
The complaint, filed in a U.S. District Court in central California, accuses Midjourney of both direct and secondary copyright infringement by using the studios’ intellectual property to train their large language model and by displaying AI-generated images of their copyrighted characters.

A.I., photogrammetry

Blender VideoDepthAI – Turn any video into 3D Animated Scenes

pIXELsHAM.com

Jun 10, 2025

https://superhivemarket.com/products/video-depth-ai

Watch this video on YouTube

A.I., production

ComfyRun – A fully open source and self-hosted solution to run your ComfyUI workflows at blazing fast speeds on cloud GPUs

pIXELsHAM.com

Jun 6, 2025

https://github.com/punitda/ComfyRun

Best suited for individuals who want to

Run complex workflows in seconds on the powerful GPUs like A10G, A100, and H100 🔋
Experiment with any workflows you find across web without worrying about breaking your local ComfyUI environment 😎
Edit workflows on the go 📱

Watch this video on YouTube

A.I., production

Google Stitch – Transform ideas into UI designs for mobile and web applications

pIXELsHAM.com

Jun 5, 2025

https://stitch.withgoogle.com/

Stitch is available for free of charge with certain usage limits. Each user receives a monthly allowance of 350 generations using Flash mode and 50 generations using Experimental mode. Please note that these limits are subject to change.

Watch this video on YouTube

A.I., production

weavy.ai – Turn your creative vision into scalable workflows. Access all AI models and professional editing tools in one node based platform

pIXELsHAM.com

Jun 5, 2025

https://www.weavy.ai/

A.I., ves

Runway Partners with AMC Networks Across Marketing and TV Development

pIXELsHAM.com

Jun 4, 2025

https://runwayml.com/news/runway-amc-partnership

Runway and AMC Networks, the international entertainment company known for popular and award-winning titles including MAD MEN, BREAKING BAD, BETTER CALL SAUL, THE WALKING DEAD and ANNE RICE’S INTERVIEW WITH THE VAMPIRE, are partnering to incorporate Runway’s AI models and tools in AMC Networks’ marketing and TV development processes.

A.I.

LumaLabs.ai – Introducing Modify Video

pIXELsHAM.com

Jun 4, 2025

https://lumalabs.ai/blog/news/introducing-modify-video

Reimagine any video. Shoot it in post with director-grade control over style, character, and setting. Restyle expressive actions and performances, swap entire worlds, or redesign the frame to your vision.
Shoot once. Shape infinitely.

Watch this video on YouTube

A.I., production

How to Build & Sell AI Agents – Ultimate Beginner’s Guide

pIXELsHAM.com

Jun 2, 2025

A.I., production

N8N.io – From Zero to Your First AI Agent in 25 Minutes

pIXELsHAM.com

Jun 2, 2025

https://n8n.io

https://github.com/n8n-io/self-hosted-ai-starter-kit

Watch this video on YouTube

A.I.

Transformer Explainer -Interactive Learning of Text-Generative Models

pIXELsHAM.com

Jun 2, 2025

https://github.com/poloclub/transformer-explainer

Transformer Explainer is an interactive visualization tool designed to help anyone learn how Transformer-based models like GPT work. It runs a live GPT-2 model right in your browser, allowing you to experiment with your own text and observe in real time how internal components and operations of the Transformer work together to predict the next tokens. Try Transformer Explainer at http://poloclub.github.io/transformer-explainer

Watch this video on YouTube

A.I., VR

Henry Daubrez – How to generate VR/ 360 videos directly with Google VEO

pIXELsHAM.com

May 30, 2025

https://www.linkedin.com/posts/upskydown_vr-googleveo-veo3-activity-7334269406396461059-d8Da

If you prompt for a 360° video in VEO (like literally write “360°” ) it can generate a Monoscopic 360 video, then the next step is to inject the right metadata in your file so you can play it as an actual 360 video.
Once it’s saved with the right Metadata, it will be recognized as an actual 360/VR video, meaning you can just play it in VLC and drag your mouse to look around.

Spatial Media Metadata Injector – for 360 videos

A.I., production

Black Forest Labs released FLUX.1 Kontext

pIXELsHAM.com

May 29, 2025

https://replicate.com/blog/flux-kontext

https://replicate.com/black-forest-labs/flux-kontext-pro

There are three models, two are available now, and a third open-weight version is coming soon:

FLUX.1 Kontext [pro]: State-of-the-art performance for image editing. High-quality outputs, great prompt following, and consistent results.
FLUX.1 Kontext [max]: A premium model that brings maximum performance, improved prompt adherence, and high-quality typography generation without compromise on speed.
Coming soon: FLUX.1 Kontext [dev]: An open-weight, guidance-distilled version of Kontext.

We’re so excited with what Kontext can do, we’ve created a collection of models on Replicate to give you ideas:

Multi-image kontext: Combine two images into one.
Portrait series: Generate a series of portraits from a single image
Change haircut: Change a person’s hair style and color
Iconic locations: Put yourself in front of famous landmarks
Professional headshot: Generate a professional headshot from any image

A.I.

AI Models – A walkthrough by Andreas Horn

pIXELsHAM.com

May 28, 2025

the 8 most important model types and what they’re actually built to do: ⬇️

1. 𝗟𝗟𝗠 – 𝗟𝗮𝗿𝗴𝗲 𝗟𝗮𝗻𝗴𝘂𝗮𝗴𝗲 𝗠𝗼𝗱𝗲𝗹
→ Your ChatGPT-style model.
Handles text, predicts the next token, and powers 90% of GenAI hype.
🛠 Use case: content, code, convos.

2. 𝗟𝗖𝗠 – 𝗟𝗮𝘁𝗲𝗻𝘁 𝗖𝗼𝗻𝘀𝗶𝘀𝘁𝗲𝗻𝗰𝘆 𝗠𝗼𝗱𝗲𝗹
→ Lightweight, diffusion-style models.
Fast, quantized, and efficient — perfect for real-time or edge deployment.
🛠 Use case: image generation, optimized inference.

3. 𝗟𝗔𝗠 – 𝗟𝗮𝗻𝗴𝘂𝗮𝗴𝗲 𝗔𝗰𝘁𝗶𝗼𝗻 𝗠𝗼𝗱𝗲𝗹
→ Where LLM meets planning.
Adds memory, task breakdown, and intent recognition.
🛠 Use case: AI agents, tool use, step-by-step execution.

4. 𝗠𝗼𝗘 – 𝗠𝗶𝘅𝘁𝘂𝗿𝗲 𝗼𝗳 𝗘𝘅𝗽𝗲𝗿𝘁𝘀
→ One model, many minds.
Routes input to the right “expert” model slice — dynamic, scalable, efficient.
🛠 Use case: high-performance model serving at low compute cost.

5. 𝗩𝗟𝗠 – 𝗩𝗶𝘀𝗶𝗼𝗻 𝗟𝗮𝗻𝗴𝘂𝗮𝗴𝗲 𝗠𝗼𝗱𝗲𝗹
→ Multimodal beast.
Combines image + text understanding via shared embeddings.
🛠 Use case: Gemini, GPT-4o, search, robotics, assistive tech.

6. 𝗦𝗟𝗠 – 𝗦𝗺𝗮𝗹𝗹 𝗟𝗮𝗻𝗴𝘂𝗮𝗴𝗲 𝗠𝗼𝗱𝗲𝗹
→ Tiny but mighty.
Designed for edge use, fast inference, low latency, efficient memory.
🛠 Use case: on-device AI, chatbots, privacy-first GenAI.

7. 𝗠𝗟𝗠 – 𝗠𝗮𝘀𝗸𝗲𝗱 𝗟𝗮𝗻𝗴𝘂𝗮𝗴𝗲 𝗠𝗼𝗱𝗲𝗹
→ The OG foundation model.
Predicts masked tokens using bidirectional context.
🛠 Use case: search, classification, embeddings, pretraining.

8. 𝗦𝗔𝗠 – 𝗦𝗲𝗴𝗺𝗲𝗻𝘁 𝗔𝗻𝘆𝘁𝗵𝗶𝗻𝗴 𝗠𝗼𝗱𝗲𝗹
→ Vision model for pixel-level understanding.
Highlights, segments, and understands *everything* in an image.
🛠 Use case: medical imaging, AR, robotics, visual agents.

A.I., photogrammetry

Spaitial.ai – Spatial Foundation Models

pIXELsHAM.com

May 28, 2025

https://www.spaitial.ai/

Watch this video on YouTube

A.I.

Introducting ComfyUI Native API Nodes

pIXELsHAM.com

May 22, 2025

https://blog.comfy.org/p/comfyui-native-api-nodes

Models Supported

Black Forest Labs Flux 1.1[pro] Ultra, Flux .1[pro]
Kling 2.0, 1.6, 1.5 & Various Effects
Luma Photon, Ray2, Ray1.6
MiniMax Text-to-Video, Image-to-Video
PixVerse V4 & Effects
Recraft V3, V2 & Various Tools
Stability AI Stable Image Ultra, Stable Diffusion 3.5 Large
Google Veo2
Ideogram V3, V2, V1
OpenAI GPT4o image
Pika 2.2

A.I., production

ComfyUI-CoCoTools_IO – A set of nodes focused on advanced image I/O operations, particularly for EXR file handling

pIXELsHAM.com

May 21, 2025

https://github.com/Conor-Collins/ComfyUI-CoCoTools_IO

Features

Advanced EXR image input with multilayer support
EXR layer extraction and manipulation
High-quality image saving with format-specific options
Standard image format loading with bit depth awareness

Current Nodes

Image I/O

Image Loader: Load standard image formats (PNG, JPG, WebP, etc.) with proper bit depth handling
Load EXR: Comprehensive EXR file loading with support for multiple layers, channels, and cryptomatte data
Load EXR Layer by Name: Extract specific layers from EXR files (similar to Nuke’s Shuffle node)
Cryptomatte Layer: Specialized handling for cryptomatte layers in EXR files
Image Saver: Save images in various formats with format-specific options (bit depth, compression, etc.)