FeaturedAI – pIXELsHAM

Featured AI

Agent Leaderboard on Hugging Face

pIXELsHAM.com

Feb 22, 2025

https://www.galileo.ai/blog/agent-leaderboard

https://huggingface.co/spaces/galileo-ai/agent-leaderboard

Views : 32

A.I.
Read more: Agent Leaderboard on Hugging Face
Flex 1 Alpha – a pre-trained base 8 billion parameter rectified flow transformer

pIXELsHAM.com

Feb 22, 2025

https://huggingface.co/ostris/Flex.1-alpha

Flex.1 started as the FLUX.1-schnell-training-adapter to make training LoRAs on FLUX.1-schnell possible.

Views : 154

A.I.
Read more: Flex 1 Alpha – a pre-trained base 8 billion parameter rectified flow transformer
MatAnyone – Stable Video Matting with Consistent Memory Propagation

pIXELsHAM.com

Feb 19, 2025

https://huggingface.co/spaces/PeiqingYang/MatAnyone

https://pq-yang.github.io/projects/MatAnyone

https://github.com/richservo/MatAnyone

Views : 29

A.I.
Read more: MatAnyone – Stable Video Matting with Consistent Memory Propagation
ByteDance Goku – Flow-Based Video Generative Foundation Models

pIXELsHAM.com

Feb 11, 2025

https://saiyan-world.github.io/goku

Views : 44

A.I.
Read more: ByteDance Goku – Flow-Based Video Generative Foundation Models
DynVFX – Augmenting Real Videoswith Dynamic Content

pIXELsHAM.com

Feb 11, 2025

https://dynvfx.github.io

Given an input video and a simple user-provided text instruction describing the desired content, our method synthesizes dynamic objects or complex scene effects that naturally interact with the existing scene over time. The position, appearance, and motion of the new content are seamlessly integrated into the original footage while accounting for camera motion, occlusions, and interactions with other dynamic objects in the scene, resulting in a cohesive and realistic output video.

https://dynvfx.github.io/sm/index.html

Views : 433

A.I.
Read more: DynVFX – Augmenting Real Videoswith Dynamic Content
ByteDance OmniHuman-1

pIXELsHAM.com

Feb 7, 2025

https://omnihuman-lab.github.io

They propose an end-to-end multimodality-conditioned human video generation framework named OmniHuman, which can generate human videos based on a single human image and motion signals (e.g., audio only, video only, or a combination of audio and video). In OmniHuman, we introduce a multimodality motion conditioning mixed training strategy, allowing the model to benefit from data scaling up of mixed conditioning. This overcomes the issue that previous end-to-end approaches faced due to the scarcity of high-quality data. OmniHuman significantly outperforms existing methods, generating extremely realistic human videos based on weak signal inputs, especially audio. It supports image inputs of any aspect ratio, whether they are portraits, half-body, or full-body images, delivering more lifelike and high-quality results across various scenarios.

Views : 51

A.I.
Read more: ByteDance OmniHuman-1
Hunyuan3D-2 – Add-on for Blender and ComfyUI

pIXELsHAM.com

Feb 7, 2025

https://github.com/kijai/ComfyUI-Hunyuan3DWrapper

https://github.com/Tencent/Hunyuan3D-2/blob/main/blender_addon.py

https://github.com/tencent/Hunyuan3D-2

https://huggingface.co/tencent/Hunyuan3D-2

Views : 1,222

A.I., modeling
Read more: Hunyuan3D-2 – Add-on for Blender and ComfyUI
ComfyUI Tutorial – How To Create Consistent Images Using Flux Model in ComfyUI

pIXELsHAM.com

Feb 1, 2025

Views : 40

A.I.
Read more: ComfyUI Tutorial – How To Create Consistent Images Using Flux Model in ComfyUI
Netflix Eyeline-Research Go-with-the-Flow – An easy and efficient way to control the motion patterns of video diffusion models

pIXELsHAM.com

Feb 1, 2025

https://github.com/Eyeline-Research/Go-with-the-Flow

https://huggingface.co/Eyeline-Research/Go-with-the-Flow/tree/main

https://eyeline-research.github.io/Go-with-the-Flow

https://github.com/Pablerdo/hexaframe-dark

Views : 19

A.I.
Read more: Netflix Eyeline-Research Go-with-the-Flow – An easy and efficient way to control the motion patterns of video diffusion models
Heather Cooper – 9 Video Models Comparison: Text to video

pIXELsHAM.com

Feb 1, 2025

https://www.linkedin.com/posts/heatherbcooper_video-model-comparison-text-to-video-activity-7290822319407550464-QzUY

🔹 Google DeepMind Veo 2
🔹 OpenAI Sora
🔹 Hunyuan Video
🔹 Pika 2.1
🔹 Alibaba Cloud Wanx 2.1
🔹 Runway Gen-3
🔹 Kling AI 1.6
🔹 Luma AI Ray2
🔹 Hailuo T2V-01

Uncompressed video under the post

(more…)
Views : 23

A.I.
Read more: Heather Cooper – 9 Video Models Comparison: Text to video
DimensionX – Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion

pIXELsHAM.com

Jan 28, 2025

https://chenshuo20.github.io/DimensionX

https://github.com/wenqsun/DimensionX

https://huggingface.co/spaces/fffiloni/DimensionX

https://huggingface.co/wenqsun/DimensionX/tree/main

Views : 14

A.I.
Read more: DimensionX – Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion
ComfyUI-CogVideoXWrapper – Control motion paths in ComfyUI

pIXELsHAM.com

Jan 27, 2025

https://github.com/kijai/ComfyUI-CogVideoXWrapper

Views : 25

A.I.
Read more: ComfyUI-CogVideoXWrapper – Control motion paths in ComfyUI
LumaLabs Ray2 – A large–scale video generative model

pIXELsHAM.com

Jan 26, 2025

https://lumalabs.ai/ray

Views : 16

A.I.
Read more: LumaLabs Ray2 – A large–scale video generative model
The Best AI Animation Tool in 2025? (Prompt Battle)

pIXELsHAM.com

Jan 26, 2025

Views : 15

A.I.
Read more: The Best AI Animation Tool in 2025? (Prompt Battle)
Tencent Hunyuan3D – an advanced large-scale 3D synthesis system for generating high-resolution textured 3D assets

pIXELsHAM.com

Jan 25, 2025

https://github.com/tencent/Hunyuan3D-2

Hunyuan3D 2.0, an advanced large-scale 3D synthesis system for generating high-resolution textured 3D assets. This system includes two foundation components: a large-scale shape generation model – Hunyuan3D-DiT, and a large-scale texture synthesis model – Hunyuan3D-Paint.

The shape generative model, built on a scalable flow-based diffusion transformer, aims to create geometry that properly aligns with a given condition image, laying a solid foundation for downstream applications. The texture synthesis model, benefiting from strong geometric and diffusion priors, produces high-resolution and vibrant texture maps for either generated or hand-crafted meshes. Furthermore, we build Hunyuan3D-Studio – a versatile, user-friendly production platform that simplifies the re-creation process of 3D assets.

It allows both professional and amateur users to manipulate or even animate their meshes efficiently. We systematically evaluate our models, showing that Hunyuan3D 2.0 outperforms previous state-of-the-art models, including the open-source models and closed-source models in geometry details, condition alignment, texture quality, and e.t.c.

Views : 44

A.I.
Read more: Tencent Hunyuan3D – an advanced large-scale 3D synthesis system for generating high-resolution textured 3D assets
Invoke.com – The Gen AI Platform for Pro Studios

pIXELsHAM.com

Jan 25, 2025

https://www.invoke.com

Invoke is a powerful, secure, and easy-to-deploy generative AI platform for professional studios to create visual media. Train models on your intellectual property, control every aspect of the production process, and maintain complete ownership of your data, in perpetuity.

Views : 24

A.I., software
Read more: Invoke.com – The Gen AI Platform for Pro Studios
How does Stable Diffusion work?

pIXELsHAM.com

Jan 24, 2025

https://stable-diffusion-art.com/how-stable-diffusion-work/

Stable Diffusion is a latent diffusion model that generates AI images from text. Instead of operating in the high-dimensional image space, it first compresses the image into the latent space.

Stable Diffusion belongs to a class of deep learning models called diffusion models. They are generative models, meaning they are designed to generate new data similar to what they have seen in training. In the case of Stable Diffusion, the data are images.

Why is it called the diffusion model? Because its math looks very much like diffusion in physics. Let’s go through the idea.

(more…)
Views : 63

A.I., Featured
Read more: How does Stable Diffusion work?
IPAdapter – Text Compatible Image Prompt Adapter for Text-to-Image Image-to-Image Diffusion Models and ComfyUI implementation

pIXELsHAM.com

Jan 17, 2025

github.com/tencent-ailab/IP-Adapter

ip-adapter.github.io/

The IPAdapter are very powerful models for image-to-image conditioning. The subject or even just the style of the reference image(s) can be easily transferred to a generation. Think of it as a 1-image lora. They are an effective and lightweight adapter to achieve image prompt capability for the pre-trained text-to-image diffusion models. An IP-Adapter with only 22M parameters can achieve comparable or even better performance to a fine-tuned image prompt model.

Once the IP-Adapter is trained, it can be directly reusable on custom models fine-tuned from the same base model.

The IP-Adapter is fully compatible with existing controllable tools, e.g., ControlNet and T2I-Adapter.

Views : 15

A.I.
Read more: IPAdapter – Text Compatible Image Prompt Adapter for Text-to-Image Image-to-Image Diffusion Models and ComfyUI implementation
ComfyUI Tutorial Series Ep16 – How to Create Seamless Patterns & Tileable

pIXELsHAM.com

Jan 14, 2025

Views : 28

A.I.
Read more: ComfyUI Tutorial Series Ep16 – How to Create Seamless Patterns & Tileable
Learn Generative AI in 23 Hours

pIXELsHAM.com

Jan 10, 2025

https://www.freecodecamp.org/news/learn-generative-ai-in-23-hours

Views : 14

A.I.
Read more: Learn Generative AI in 23 Hours
LatentSync – Audio Conditioned Latent Diffusion Models for Lip Sync + ComfyUI model

pIXELsHAM.com

Jan 10, 2025

https://huggingface.co/spaces/fffiloni/LatentSync

https://github.com/bytedance/LatentSync

https://github.com/ShmuelRonen/ComfyUI-LatentSyncWrapper

https://www.gyan.dev/ffmpeg/builds

https://github.com/1038lab/ComfyUI-EdgeTTS

https://github.com/cocktailpeanut/fluxgym

Views : 42

A.I., software
Read more: LatentSync – Audio Conditioned Latent Diffusion Models for Lip Sync + ComfyUI model
DiffSensei – Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Comic Book Generation

pIXELsHAM.com

Jan 10, 2025

https://jianzongwu.github.io/projects/diffsensei

https://github.com/jianzongwu/DiffSensei

https://huggingface.co/jianzongwu/DiffSensei

Views : 38

A.I., software
Read more: DiffSensei – Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Comic Book Generation
Adobe TransPixar – Advancing Text-to-Video Generation with Transparency (Text-to-RGBA)

pIXELsHAM.com

Jan 10, 2025

https://wileewang.github.io/TransPixar

Views : 31

A.I., software
Read more: Adobe TransPixar – Advancing Text-to-Video Generation with Transparency (Text-to-RGBA)
Andrea Spinazzola – Multi region workflow in ComfyUI for controlled composition

pIXELsHAM.com

Jan 9, 2025

https://trustpicasso.gumroad.com/l/multi_region_wf

https://www.linkedin.com/posts/aspinazzola_comfyui-fluxredux-regionalprompting-activity-7283030334005215232-pELW

Views : 24

A.I.
Read more: Andrea Spinazzola – Multi region workflow in ComfyUI for controlled composition

COLLECTIONS

| Featured AI
| Design And Composition
| Explore posts

POPULAR SEARCHES

unreal | pipeline | virtual production | free | learn | photoshop | 360 | macro | google | nvidia | resolution | open source | hdri | real-time | photography basics | nuke

FEATURED POSTS

Social Links

DISCLAIMER – Links and images on this website may be protected by the respective owners’ copyright. All data submitted by users through this site shall be treated as freely available to share.

Subscribe to PixelSham.com RSS for free — Subscribe to PixelSham.com RSS for free

Views : 1,219