BREAKING NEWS
LATEST POSTS
-
ByteDance OmniHuman-1
https://omnihuman-lab.github.io
They propose an end-to-end multimodality-conditioned human video generation framework named OmniHuman, which can generate human videos based on a single human image and motion signals (e.g., audio only, video only, or a combination of audio and video). In OmniHuman, we introduce a multimodality motion conditioning mixed training strategy, allowing the model to benefit from data scaling up of mixed conditioning. This overcomes the issue that previous end-to-end approaches faced due to the scarcity of high-quality data. OmniHuman significantly outperforms existing methods, generating extremely realistic human videos based on weak signal inputs, especially audio. It supports image inputs of any aspect ratio, whether they are portraits, half-body, or full-body images, delivering more lifelike and high-quality results across various scenarios.
-
Conda – an open source management system for installing multiple versions of software packages and their dependencies into a virtual environment
https://anaconda.org/anaconda/conda
https://docs.conda.io/projects/conda/en/latest/user-guide/getting-started.html
NOTE The company recently changed their TOS and this service now incurs into costs for teams above a threshold.
Use MicroMamba instead. -
Vashi Nedomansky – Shooting ratios of feature films
In the Golden Age of Hollywood (1930-1959), a 10:1 shooting ratio was the norm—a 90-minute film meant about 15 hours of footage. Directors like Alfred Hitchcock famously kept it tight with a 3:1 ratio, giving studios little wiggle room in the edit.
Fast forward to today: the digital era has sent shooting ratios skyrocketing. Affordable cameras roll endlessly, capturing multiple takes, resets, and everything in between. Gone are the disciplined “Action to Cut” days of film.https://en.wikipedia.org/wiki/Shooting_ratio
-
General OCR Theory – Towards OCR-2.0 via a Unified End-to-end Model – HF Transformers implementation
https://huggingface.co/stepfun-ai/GOT-OCR-2.0-hf
GOT-OCR2 works on a wide range of tasks, including plain document OCR, scene text OCR, formatted document OCR, and even OCR for tables, charts, mathematical formulas, geometric shapes, molecular formulas and sheet music.
-
QNTM – Developer Philosophy
- Avoid, at all costs, arriving at a scenario where the ground-up rewrite starts to look attractive
- Aim to be 90% done in 50% of the available time
- Automate good practice
- Think about pathological data
- There is usually a simpler way to write it
- Write code to be testable
- It is insufficient for code to be provably correct; it should be obviously, visibly, trivially correct
-
Arminas Valunas – “Coca-Cola: Wherever you are.”
Arminas created this using Juggernaut Xl model and QR Code Monster SDXL ControlNet.
His pipeline:
Static Images – Forge UI.
Upscaled with Leonardo AI universal upscaler.
Animated with Runway ML and Minimax.
Video upscale – Topaz Video AI.
Composited in Adobe Premiere.
Juggernaut Xl download here:
https://civitai.com/models/133005/juggernaut-xl
QR Code Monster SDXL:
https://civitai.com/models/197247?modelVersionId=221829
FEATURED POSTS
-
braindump.me – Building an AI game studio: what we’ve learned so far
https://braindump.me/blog-posts/building-an-ai-game-studio
Braindump is an attempt to imagine what game creation could be like in the brave new world of LLMs and generative AI to give you an entire AI game studio, complete with coders, artists, and so on, to help you create your dream game.
-
Top 3D Printing Website Resources
The Holy Grail – https://github.com/ad-si/awesome-3d-printing
- Thingiverse – https://www.thingiverse.com/
- Makerworld – https://makerworld.com/
- Printables – https://www.printables.com/
- Cults – https://cults3d.com/
- CG Trader – https://www.cgtrader.com/3d-print-models
- Sketchfab – https://sketchfab.com/store/3d-models/stl
- 3D Export – https://3dexport.com/
- MyMiniFactory – https://www.myminifactory.com/
- Thangs – https://thangs.com/
- Yeggi – https://www.yeggi.com/
- FAB365 – https://fab365.net/
- Gambody – https://www.gambody.com/
- All3DP News – https://all3dp.com/
- TCT Magazine – https://www.tctmagazine.com/topics/3D-printing-news/
- 3DPrint.com – https://3dprint.com/
- NASA 3D Models – https://nasa3d.arc.nasa.gov/models/printable
-
Emmanuel Tsekleves – Writing Research Papers
Here’s the journey of crafting a compelling paper:
1️. ABSTRACT
This is your elevator pitch.
Give a methodology overview.
Paint the problem you’re solving.
Highlight key findings and their impact.
2️. INTRODUCTION
Start with what we know.
Set the stage for our current understanding.
Hook your reader with the relevance of your work.
3️. LITERATURE REVIEW
Identify what’s unknown.
Spot the gaps in current knowledge.
Your job in the next sections is to fill this gap.
4️. METHODOLOGY
What did you do?
Outline how you’ll fill that gap.
Be transparent about your approach.
Make it reproducible so others can follow.
5️. RESULTS
Let the data speak for itself.
Present your findings clearly.
Keep it concise and focused.
6️. DISCUSSION
Now, connect the dots.
Discuss implications and significance.
How do your findings bridge the knowledge gap?
7️. CONCLUSION
Wrap it up with future directions.
What does this mean for us moving forward?
Leave the reader with a call to action or reflection.
8️. REFERENCES
Acknowledge the giants whose shoulders you stand on.
A robust reference list shows the depth of your research.
-
Tim Kang – calibrated white light values in sRGB color space
8bit sRGB encoded
2000K 255 139 22
2700K 255 172 89
3000K 255 184 109
3200K 255 190 122
4000K 255 211 165
4300K 255 219 178
D50 255 235 205
D55 255 243 224
D5600 255 244 227
D6000 255 249 240
D65 255 255 255
D10000 202 221 255
D20000 166 196 2558bit Rec709 Gamma 2.4
2000K 255 145 34
2700K 255 177 97
3000K 255 187 117
3200K 255 193 129
4000K 255 214 170
4300K 255 221 182
D50 255 236 208
D55 255 243 226
D5600 255 245 229
D6000 255 250 241
D65 255 255 255
D10000 204 222 255
D20000 170 199 2558bit Display P3 encoded
2000K 255 154 63
2700K 255 185 109
3000K 255 195 127
3200K 255 201 138
4000K 255 219 176
4300K 255 225 187
D50 255 239 212
D55 255 245 228
D5600 255 246 231
D6000 255 251 242
D65 255 255 255
D10000 208 223 255
D20000 175 199 25510bit Rec2020 PQ (100 nits)
2000K 520 435 273
2700K 520 466 358
3000K 520 475 384
3200K 520 480 399
4000K 520 495 446
4300K 520 500 458
D50 520 510 482
D55 520 514 497
D5600 520 514 500
D6000 520 517 509
D65 520 520 520
D10000 479 489 520
D20000 448 464 520