Charts
DataOn-chain
VIP
Market Cap
API
Rankings
CoinOSNew
CoinClaw🦞
Language
  • 简体中文
  • 繁体中文
  • English
Leader in global market data applications, committed to providing valuable information more efficiently.

Features

  • Real-time Data
  • Special Features
  • AI Grid

Services

  • News
  • Open Data(API)
  • Institutional Services

Downloads

  • Desktop
  • Android
  • iOS

Contact Us

  • Chat Room
  • Business Email
  • Official Email
  • Official Verification

Join Community

  • Telegram
  • Twitter
  • Discord

© Copyright 2013-2026. All rights reserved.

简体繁體English
|Legacy

AI’s Progress Now Depends on ‘World Models’ That Grasp Physical Reality

CN
Decrypt
Follow
4 months ago
AI summarizes in 5 seconds.

Robots and multimodal artificial intelligence still can’t grasp the physical world, a shortcoming one prominent researcher says is now the field’s biggest obstacle.


Fei-Fei Li, the Stanford computer scientist widely regarded as a pioneer of modern computer vision, said the gap between AI and physical reality has become the tech's most urgent problem and argues that closing it would require systems built around spatial reasoning rather than language alone.


AI is fast approaching the limits of text-based learning, and progress will ultimately depend on “world models,” Li said in a report published Monday.


“At the core of unlocking spatial intelligence is the development of world models—a new type of generative AI that must meet a fundamentally different set of challenges than LLMs,” Li wrote on X. “These models must generate spatially consistent worlds that obey physical laws, process multimodal inputs from images to actions, and predict how those worlds evolve or be interacted with over time.”





What in the world are these models?


The concept of “world models” dates back to the early 1940s, when Scottish philosopher and psychologist Kenneth Craik conducted cognitive science research.


The idea resurfaced in modern AI after David Ha and Jürgen Schmidhuber’s 2018 paper showed that a neural network could learn a compact internal model of an environment and use it as a simulator for planning and control.


Li argued that world models matter because robots and multimodal systems still struggle with grounded spatial reasoning, leaving them unable to judge distances and scene changes, or to predict basic physical outcomes.


“Robots as human collaborators, whether aiding scientists at the lab bench or assisting seniors living alone, can expand part of the workforce in dire need of more labour and productivity,” Li wrote. Real environments follow rules that current machines can’t capture, Li argues.


From gravity shaping motion to materials influencing light, solving this requires systems capable of storing spatial memory and modeling scenes in more than two dimensions.


In September, Li’s company, World Labs, released the beta for Marble, an early world model that produced explorable three-dimensional environments from text or image prompts.


Users could walk through these worlds without time limits or scene drift, and the environments remained consistent rather than morphing or breaking apart, the company claims.


“Marble is only our first step in creating a truly spatially intelligent world model,” Li wrote. “As the progress accelerates, researchers, engineers, users, and business leaders alike are beginning to recognize its extraordinary potential. The next generation of world models will enable machines to achieve spatial intelligence on an entirely new level—an achievement that will unlock essential capabilities still largely absent from today’s AI systems.”


Li said world model use cases include supporting a range of applications because they give AI an internal understanding of how environments behave.


Creators could use them to explore scenes in real time, robots could rely on them to navigate and handle objects more safely, and researchers in science and healthcare could run spatial simulations or improve imaging and lab automation.


Li linked spatial intelligence research back to early biological studies, noting that humans learned to perceive and act long before they developed language.


“Long before written language, humans told stories—painted them on cave walls, passed them through generations, built entire cultures on shared narratives,” she wrote. “Stories are how we make sense of the world, connect across distance and time, explore what it means to be human, and most importantly, find meaning in life and love within ourselves.”


Li said AI needed the same grounding to function in the physical world and argued that its role should be to support people, not replace them. Progress, however, would depend on models that understood how the world worked rather than only describing it.


“AI’s next frontier is Spatial Intelligence, a technology that will turn seeing into reasoning, perception into action, and imagination into creation,” Li said.


免责声明:本文章仅代表作者个人观点,不代表本平台的立场和观点。本文章仅供信息分享,不构成对任何人的任何投资建议。用户与作者之间的任何争议,与本平台无关。如网页中刊载的文章或图片涉及侵权,请提供相关的权利证明和身份证明发送邮件到support@aicoin.com,本平台相关工作人员将会进行核查。

送 666 USDT,我们是认真的!
广告
|
|
APP
Windows
Mac
Share To

X

Telegram

Facebook

Reddit

CopyLink

|
|
APP
Windows
Mac
Share To

X

Telegram

Facebook

Reddit

CopyLink

Selected Articles by Decrypt

26 minutes ago
These Three Altcoins Just Got Leveraged Crypto ETFs
1 hour ago
Solana DeFi Exchange Drift Protocol Exploited, Upwards of $285 Million Stolen
1 hour ago
Google\\\'s Veo 3.1 Lite Cuts API Costs in Half as OpenAI\\\'s Sora Exits the Market
View More

Table of Contents

|
|
APP
Windows
Mac
Share To

X

Telegram

Facebook

Reddit

CopyLink

Related Articles

avatar
avatarDecrypt
26 minutes ago
These Three Altcoins Just Got Leveraged Crypto ETFs
avatar
avatarbitcoin.com
1 hour ago
Cango Secures $75M in Fresh Capital to Expand Ecohash AI Computing Platform
avatar
avatarDecrypt
1 hour ago
Solana DeFi Exchange Drift Protocol Exploited, Upwards of $285 Million Stolen
avatar
avatarDecrypt
1 hour ago
Google\\\'s Veo 3.1 Lite Cuts API Costs in Half as OpenAI\\\'s Sora Exits the Market
avatar
avatarcoindesk
2 hours ago
Citadel-backed EDX Markets applies for U.S. trust charter to expand institutional crypto services
APP
Windows
Mac

X

Telegram

Facebook

Reddit

CopyLink