How close are we from building a virtual universe?

hamdani yusuf · « **Reply #1260 on:** 25/07/2025 02:36:45 »

POV: Chinese AI Lab Teaching Everyone How To Save Millions of Dollars

Quote

ByteDance Seed Proposed PMA which is a model merging technique for pre-training models to project your annealed performance without the need to go through annealing. This can save up to millions in big model training runs.

Model Merging in Pre-training of Large Language Models
[Paper] https://alphaxiv.org/abs/2505.12082

Other "model merging" techniques I mentioned (but are used in completely different scenarios)
https://alphaxiv.org/abs/2410.03617
https://alphaxiv.org/abs/2410.15661
https://alphaxiv.org/abs/2403.07816

hamdani yusuf · « **Reply #1261 on:** 25/07/2025 05:30:59 »

NEW Qwen 3 Coder: Did the Benchmark Lie?

Quote

We are looking into Qwen 3 Coder, the first open weight model that is closer to Sonnet 4.

00:00 Qwen Coder
01:49 Size and Architecture
02:57 How does it compare to Sonnet
07:36 Examples and Demonstrations

China Went HARD...

Quote

Timestamps:
0:00 Overview
4:11 How to use
4:31 Demos
5:12 Sponsor
6:20 Test

hamdani yusuf · « **Reply #1262 on:** 25/07/2025 09:12:10 »

8 Big Changes Making AI Agents a Huge Deal!

hamdani yusuf · « **Reply #1263 on:** 25/07/2025 11:12:18 »

New AI video model, AI operating system, self charging robots, ChatGPT Agent, Kimi K2

Quote

0:00 AI news intro
1:10 Pusa
5:37 Spatial Tracker V2
8:58 HopeJR
10:51 NeuralOS
14:35 ChatLLM
15:29 Kimi K2
24:47 Epona
27:15 Agility Digit demos
28:56 LimX CL-3 dance
30:25 Walker S2 auto recharge
31:46 PhysX
34:49 ChatGPT Agent
40:25 Clift
42:57 MovieS

hamdani yusuf · « **Reply #1264 on:** 25/07/2025 12:48:14 »

Context Rot: How Increasing Input Tokens Impacts LLM Performance (Paper Analysis)

Quote

Abstract:
Large Language Models (LLMs) are typically presumed to process context uniformly?that is, the model should handle the 10,000th token just as reliably as the 100th. However, in practice, this assumption does not hold. We observe that model performance varies significantly as input length changes, even on simple tasks.
In this report, we evaluate 18 LLMs, including the state-of-the-art GPT-4.1, Claude 4, Gemini 2.5, and Qwen3 models. Our results reveal that models do not use their context uniformly; instead, their performance grows increasingly unreliable as input length grows.

Authors: Kelly Hong, Anton Troynikov, Jeff Huber

hamdani yusuf · « **Reply #1265 on:** 26/07/2025 04:13:33 »

Google Takes the Gold. OpenAI under fire.

Quote

Google Deepmind wins the IMO 2025 Gold Medal using Gemini Deep Think.

Advanced version of Gemini with Deep Think officially achieves gold-medal standard at the International Mathematical Olympiad

Some comments to the video.

Quote

Open AI: we won, and we?re too spooky to release the model.
Google: We won, Logan will be pushing out the model Friday.

Quote

t0: xAI won!
1 second later: openAI won!
1 second later: Google won!

Quote

I know the "AGI" goal post has moved quite a lot since the early aughts, but I think an AI system that warrants the label must feature continuous learning, either via an infinite context window, or continuous updating of the underlying model weights.

These things are necessary for an AI agent to truly be a drop-in replacement for a white collar human employee. The systems need long-term memory, and a way to permanently integrate new information and skills.

This just isn't a thing yet, and it doesn't look like it's going to be solved in the next few years.

If these systems are AGI now, they're AGI that suffered catastrophic strokes or traumatic brain injuries.

-------------------

What you described is fully possible right now and has been for years. A long term memory just means building a system to save and recall data. It can be as organized or messy as you like given enough time and tokens. And every model has been capable of updating itself for quite a while, they`re just not allowed to. AGI has been here for over a year at least, just nobody wants to admit it (perhaps they`re worried about funding being curtailed if they do). The models (or perhaps underlying system) that the companies have are much much more powerful than anything we mere mortals can get our hands on.

Quote

from Deepmind's post: "To make the most of the reasoning capabilities of Deep Think, we additionally trained this version of Gemini on novel reinforcement learning techniques that can leverage more multi-step reasoning, problem-solving and theorem-proving data. We also provided Gemini with access to a curated corpus of high-quality solutions to mathematics problems, and added some general hints and tips on how to approach IMO problems to its instructions."

hamdani yusuf · « **Reply #1266 on:** 26/07/2025 15:00:54 »

Quote

AI just crossed a major threshold?it?s no longer just guessing. A new class of models called Energy-Based Transformers is letting AI reason like humans do?slow down, test different options, rethink bad answers, and only stop when it feels right. That means smarter decisions, longer attention on hard problems, and a built-in ability to say, ?this isn?t good enough yet.? In a world full of quick AI replies, this shift is huge. And it?s not theory. These models already outperform standard Transformers on tasks across language, images, and even video?while using way less compute.

🧠 What You?ll See:
? How Energy-Based Transformers mimic human-style thinking using energy scores
? The difference between fast GPT-like responses and deeper reasoning
? How these models rethink, retry, and know when they?re wrong
? Real-world tests across language, vision, and complex tasks
? Why this is a major leap toward truly intelligent systems

🚨 Why It Matters:
AI is finally moving beyond instant guesses. These models reason step by step, adapt their thinking in real time, and learn to solve hard problems like humans do. They know when to keep trying?and when to stop. This isn?t about speed. It?s about real intelligence.

hamdani yusuf · « **Reply #1267 on:** 29/07/2025 15:24:08 »

Quote

Unitree has just released the R1, a full-size humanoid robot priced at only $5,900, making it one of the most affordable AI-powered robots ever sold to the public. The R1 features advanced mobility, voice recognition, real-time visual input, and an open SDK for developers, allowing it to walk, flip, balance, and interact using AI. This marks a major milestone in humanoid robotics, with China now leading the push to bring agile, intelligent robots into everyday life.

🧠 What You?ll See:
?⁠ ⁠Unitree launches a full-size humanoid robot for just $5,900
?⁠ ⁠R1 walks, flips, kicks, and balances with real-time AI
?⁠ ⁠Voice recognition, visual input, and open SDK for full customization
?⁠ ⁠How this robot compares to Tesla Optimus, Atlas, and Digit
?⁠ ⁠Why this launch changes the game for humanoid robotics

Why It Matters:
This isn?t just another robot demo. Unitree?s R1 makes advanced AI robotics affordable, functional, and available to the public?something no one else has done at this scale. And the rest of the world?s still catching up.

hamdani yusuf · « **Reply #1268 on:** 29/07/2025 15:26:28 »

Three innovations that prove it.

alancalverd · « **Reply #1269 on:** 29/07/2025 15:29:08 »

Human beings can do all that, and can refuel themselves. More importantly, they can be held liable for their actions. And we have far more than we need, already on the shelf.

The intelligent world of engineering builds machines that do stuff that humans can't.

hamdani yusuf · « **Reply #1270 on:** 29/07/2025 15:33:45 »

Quote

Self-evolving AI. ASI-Arch autonomously designs new top AI models. #ai #ainews #agi #singularity

0:00 Background of AI innovation
2:26 Previous AI methods
3:35 ASI-Arch autonomous research
10:00 Extra details
11:13 Hailuo 02
12:41 Extra details
13:30 Results
16:05 AlphaGo moment
18:18 Top findings
24:06 Open sourced

hamdani yusuf · « **Reply #1271 on:** 29/07/2025 15:35:59 »

Quote from: alancalverd on 29/07/2025 15:29:08

Human beings can do all that, and can refuel themselves. More importantly, they can be held liable for their actions. And we have far more than we need, already on the shelf.

The intelligent world of engineering builds machines that do stuff that humans can't.

Many humans can't do that.

You can't always control them. You can't treat them like a slave.

hamdani yusuf · « **Reply #1272 on:** 02/08/2025 17:18:01 »

Quote

Can a neural network write its own data and skyrocket past GPT-4? In today's video, we dissect the brand-new ?Self-Adapting Language Models? paper (SEAL), where an LLM fabricates synthetic data, tunes LoRA adapters, and after just two rounds, outperforms much larger models on SQuAD and ARC.

hamdani yusuf · « **Reply #1273 on:** 02/08/2025 17:19:32 »

Quote

This isn?t your typical tech news roundup. In this video, we break down four massive stories from July that signal a major shift in how AI will shape our lives...from what we wear, to how we work, to the policies that govern it.

Here?s what happened:

🕶️ Meta?s Superintelligence Lab
Mark Zuckerberg just announced a $110 billion push to bring AI into your everyday life. Starting with smart glasses that act like a personal assistant. With talent from OpenAI and Scale AI leading the charge, Meta is going all-in on AI.

The U.S Government's ?America?s AI Action Plan?
The U.S. government released its most aggressive national AI strategy yet... focusing on speed, infrastructure, and ideology. From banning ?woke AI? in federal use to prioritizing open-source models and deregulated development, this could shape AI?s trajectory for years.

🤖 ChatGPT Gets Agency
OpenAI gave ChatGPT the ability to take real-world actions: browse the web, send emails, even run code. This moves us into the era of agentic AI where AI doesn?t just answer your questions, it takes initiative.

🚗 Tesla?s $16.5B Chip Deal with Samsung
Tesla signed a multi-billion dollar deal to manufacture custom AI chips with Samsung in Texas. These chips will power everything from Full Self-Driving to the Optimus robot, making Tesla not just a car company, but a full-stack AI player.

hamdani yusuf · « **Reply #1274 on:** 04/08/2025 05:20:59 »

Google?s New Self Improving AI Agent Just Crushed OpenAI?s Deep Research

Quote

Something big is happening at Google. In just a few days, they dropped three breakthrough AI systems?one that outperforms OpenAI?s Deep Research, another that builds real ML pipelines better than Kaggle pros, and a third that maps the Earth without satellites. These aren?t upgrades. They?re game-changing agents designed to replace researchers, coders, and analysts?and they?re already winning.

🧠 What You?ll See:
? Google?s TTD DR beats OpenAI on complex research benchmarks using a self-evolving AI agent
? MLE STAR dominates Kaggle challenges by building and refining real machine learning pipelines
? DeepMind?s AEF model creates satellite-free Earth maps using fused global data and AI precision
? All three systems show how Google is quietly pulling ahead in multi-domain AI autonomy

🚨 Why It Matters:
Google isn?t just improving AI?they?re turning it into a replacement for entire expert workflows. From writing reports and generating clean code to monitoring the planet in real time, these agents are already outperforming the best and learning as they go.

hamdani yusuf · « **Reply #1275 on:** 11/08/2025 11:48:48 »

This new free AI image generator is WILD

Quote

0:00 Qwen Image intro
0:42 Qwen Image demos
4:19 Image editing
6:10 Qwen Image vs Flux Krea dev vs GPT-4o
11:42 Slides & UI designs
15:18 ChatLLM
16:10 Other design tests
17:10 Photos and anatomy
19:33 Anime, logos, existing characters
21:26 Other art styles
22:40 Wildlife
23:44 How to use Qwen Image online
25:04 How to use Qwen Image offline with ComfyUI
30:50 How to use Qwen Image with low VRAM
34:29 How to edit images with Qwen Image

hamdani yusuf · « **Reply #1276 on:** 12/08/2025 14:12:44 »

GPT-5 Hate! Is it really that bad? Let's take a closer look!

hamdani yusuf · « **Reply #1277 on:** 12/08/2025 14:22:56 »

AGI is not coming!

My comment on the video : AGI is not coming! Yet. That's why many researchers believe embodied AI is necessary to validate or falsify the information that the AI models have accumulated/distilled.

hamdani yusuf · « **Reply #1278 on:** 12/08/2025 14:27:23 »

Demis Hassabis on shipping momentum, better evals and world models

Quote

Demis Hassabis, CEO of Google DeepMind, sits down with host Logan Kilpatrick. In this episode, learn about the evolution from game-playing AI to today's thinking models, how projects like Genie 3 are building world models to help AI understand reality and why new testing grounds like Kaggle?s Game Arena are needed to evaluate progress on the path to AGI.

Chapters:
00:00 - Intro
01:16 - Recent GDM momentum
02:07 - Deep Think and agent systems
04:11 - Jagged intelligence
07:02 - Genie 3 and world models
10:21 - Future applications of Genie 3
13:01 - The need for better benchmarks and Kaggle Game Arena
19:03 - Evals beyond games
21:47 - Tool use for expanding AI capabilities
24:52 - Shift from models to systems
27:38 - Roadmap for Genie 3 and the omni model
29:25 - The quadrillion token club

hamdani yusuf · « **Reply #1279 on:** 14/08/2025 08:34:50 »

GPT 5 - What They Didn't Say

Quote

In this video, I look at the launch of GPT-5 and what we can work out about the system that they have released.

⏱️Time Stamps:
00:00 Intro/ OpenAI GPT-5 Blog
02:07 Unified System & Router
05:58 Creative Expression and Writing
07:47 Evaluations
12:12 Coding
13:02 Pricing

How close are we from building a virtual universe?

hamdani yusuf (OP)

hamdani yusuf (OP)

hamdani yusuf (OP)

hamdani yusuf (OP)

hamdani yusuf (OP)

hamdani yusuf (OP)

hamdani yusuf (OP)

hamdani yusuf (OP)

hamdani yusuf (OP)

hamdani yusuf (OP)

hamdani yusuf (OP)

hamdani yusuf (OP)

hamdani yusuf (OP)

hamdani yusuf (OP)

hamdani yusuf (OP)

hamdani yusuf (OP)

hamdani yusuf (OP)

hamdani yusuf (OP)

hamdani yusuf (OP)