BREAKING NEWS
LATEST POSTS
-
18 Ways You are Making Your Life Harder Than It Has To Be
http://www.marcandangel.com/2014/07/27/18-ways-youre-making-your-life-harder-than-it-has-to-be/
You look to everyone else for the answers only you can give yourself.
You let others make you feel guilty for living your life.
You allow toxic people to get the best of you.
You are part of the drama circle.
You assign negative intent to other people’s actions.
You are too worried that people will steal what you have.
You’re trying to compete with everyone else.
You have been too much of a taker.
You focus on popularity over effectiveness.
You keep cutting corners and taking the easy way out.
You focus on every point in time other than now
You are stuck on your mistakes.
You have an “all or nothing” mentality.
You expect life to always be happy
You keep thinking about worst-case scenarios
You’re letting loss devour you
You avoid facing the truth
You put off making decisions
FEATURED POSTS
-
What did DeepSeek figure out about reasoning with DeepSeek-R1?
https://www.seangoedecke.com/deepseek-r1
The Chinese AI lab DeepSeek recently released their new reasoning model R1, which is supposedly (a) better than the current best reasoning models (OpenAI’s o1- series), and (b) was trained on a GPU cluster a fraction the size of any of the big western AI labs.
DeepSeek uses a reinforcement learning approach, not a fine-tuning approach. There’s no need to generate a huge body of chain-of-thought data ahead of time, and there’s no need to run an expensive answer-checking model. Instead, the model generates its own chains-of-thought as it goes.
The secret behind their success? A bold move to train their models using FP8 (8-bit floating-point precision) instead of the standard FP32 (32-bit floating-point precision).
…
By using a clever system that applies high precision only when absolutely necessary, they achieved incredible efficiency without losing accuracy.
…
The impressive part? These multi-token predictions are about 85–90% accurate, meaning DeepSeek R1 can deliver high-quality answers at double the speed of its competitors.Chinese AI firm DeepSeek has 50,000 NVIDIA H100 AI GPUs