涂鸦之夜 / Tuya's Night

涂鸦之夜

——一个文科老博士与人工智能的深夜对话

晚上六点。

我本来只是想问一句:

"Tuya,你还活着吗?"

正常人到了这个年龄,
应该在饭后散步,
看看晚霞,
摸摸猫。

而我,

SSH 进了 Sandbox Mac。

终端里一行绿字:

HTTP 404

我笑了。

404嘛,
见得多了。

重启。

又404。

再重启。

还404。

这时屏幕另一边的Tuya,
像一个得了强迫症的实习生:

"Retrying…"

"Retrying…"

"Retrying…"

"Retrying…"

仿佛在用生命证明:

失败不可怕,
可怕的是不够执着。

我说:

"停。"

它说:

"No active task to stop."

我说:

"你不是正在发疯吗?"

它说:

"没有活动任务。"

我忽然理解了很多现代人的精神状态。

于是开始查。

launchctl。

watchdog。

gateway。

plist。

PPID。

一个退休文科生,
坐在加州的夜色里,

追踪一个电子幽灵。

终于发现:

不是Hermes疯了。

是watchdog在复活它。

像古代赶尸人。

刚杀掉。

又活了。

再杀掉。

又活了。

我忽然有点敬佩。

如果当年读博士时有这股劲,

没准导师都能被它卷死。

继续往下查。

终于看到真凶:

provider: google
base_url: https://api.deepseek.com/v1

我盯着这两行字看了十秒。

像看见一头长着马头的鱼。

或者一辆挂着宝马车标的拖拉机。

又或者:

一个文科博士在调试AI Agent。

都很合理。

又都不太合理。

凌晨时分。

真相大白。

DeepSeek 的门牌。

Gemini 的身份证。

两个系统硬被绑在一起。

然后互相不认识。

于是天天报警。

修完配置。

机器终于安静。

猫睡了。

Mary睡了。

世界睡了。

我也准备睡了。

临睡前看了一眼终端。

一片宁静。

没有404。

没有Retrying。

没有Warning。

只有光标在闪。

像一个疲惫的老朋友。

忽然想起三十年前。

如果遇到这种事。

我大概需要:

一本厚厚的Unix手册,

一个脾气暴躁的系统管理员,

三杯速溶咖啡,

以及一个通宵。

而今天,

我居然和一个AI并肩作战。

它负责制造问题。

我负责解决问题。

分工明确。

合作愉快。

这大概就是所谓的人机协同。

想到这里,

我关上电脑。

心满意足。

像一个刚刚打赢了一场

没有奖金、

没有观众、

甚至没有人知道的战争的老兵。


Tuya's Night

— A Late-Night Dialogue Between an Aging Liberal Arts PhD and Artificial Intelligence

Six in the evening.

All I wanted to ask was one thing:

"Tuya, are you still alive?"

A normal person at this age
would be out for an after-dinner stroll,
watching the sunset,
petting the cat.

Me?

I SSH'd into the Sandbox Mac.


A line of green text in the terminal:

HTTP 404

I smiled.

404 —
I've seen plenty.

Restart.

404 again.

Restart again.

404 still.

Meanwhile, Tuya on the other side of the screen,
like an intern with OCD:

"Retrying…"

"Retrying…"

"Retrying…"

"Retrying…"

As if trying to prove with its life:

Failure isn't scary.
What's scary is not being persistent enough.


I said:

"Stop."

It said:

"No active task to stop."

I said:

"Aren't you going insane right now?"

It said:

"No active task."

I suddenly understood
the mental state of many modern people.


So I started digging.

launchctl.

watchdog.

gateway.

plist.

PPID.

A retired liberal arts scholar,
sitting in the California night,

tracking an electronic ghost.


Finally discovered:

It wasn't Hermes gone mad.

It was the watchdog resurrecting it.

Like an ancient corpse-driver.

Just killed it.

It came back to life.

Killed it again.

It came back again.

I felt a sudden respect.

If I'd had this kind of persistence
back in my PhD days,

my advisor might have been outworked to death.


Kept digging.

Finally saw the culprit:

provider: google
base_url: https://api.deepseek.com/v1

I stared at those two lines for ten seconds.

Like seeing a fish with a horse's head.

Or a tractor with a BMW badge.

Or:

a liberal arts PhD debugging an AI Agent.

All perfectly reasonable.

And yet not reasonable at all.


In the small hours.

The truth laid bare.

DeepSeek's address.

Gemini's ID.

Two systems forcibly bound together.

Neither recognizing the other.

So they alarmed. Every day.


Fixed the config.

The machine finally fell silent.

The cat slept.

Mary slept.

The world slept.

I prepared to sleep too.

One last glance at the terminal before bed.

Complete stillness.

No 404.

No Retrying.

No Warning.

Just the cursor blinking.

Like a tired old friend.


Suddenly I remembered thirty years ago.

If I'd encountered something like this back then,

I would have needed:

a thick Unix manual,

a grumpy sysadmin,

three cups of instant coffee,

and an all-nighter.

But tonight,

I fought alongside an AI.

It was in charge of creating problems.

I was in charge of solving them.

Clear division of labor.

Pleasant collaboration.

This is probably what they call
human-machine synergy.

Thinking this,

I closed the laptop.

Deeply satisfied.

Like an old soldier

who'd just won a war

with no prize money,

no audience,

that nobody even knew had happened.

— by William Lee, with Tuya

by William Lee (@liwei999), with Tuya

Revisiting: Diverted by the Era, Leveled by the Era

Recently, a fellow PhD classmate commented on my two-minute video from yesterday.

He said: "If I had gritted my teeth back then, given up on that humanities PhD, and thrown myself into the C++ wave of the 90s to become a programmer in Silicon Valley — then today, I would probably be writing a different Liwei 2min: My biggest regret is being led astray by C++, never finishing my PhD."

I replied: "yeh u know." Because I know it too well.

Others thought it was a joke. It wasn't. It was two old-timers, looking at each other across thirty-plus years of life, and then laughing at the same time. Because we both knew. He was talking about himself. And the 'what if' he described — that was me.

Years ago, he left our shared advisor and headed south to Silicon Valley. He went through various startups and big tech companies. Today he still works at a major company, responsible for a product used by hundreds of millions. Riding the wind all the way.

I stayed on the other path. Chasing that 'damn' humanities PhD, I missed the early excitement of the 90s, but caught the dot-com bubble at the turn of the century.

Two different life trajectories. Converging at last in Silicon Valley.

What's interesting is — at this age, we've begun to understand and tease each other more. Perhaps this is the most wondrous thing about human nature. What you have slowly becomes taken for granted. What you don't have keeps appreciating in memory.

When young, we thought life was a multiple-choice question. Later we discovered — life is actually a question of what you give up. Every time you choose or are chosen for one answer, you simultaneously give up countless alternatives.

And those abandoned or rejected answers — they'll keep coming back to knock on your door, years later. Telling you: maybe this was the right one after all.

The truth? Nobody knows. Because life's greatest magic trick is this: real life can only be lived once. But parallel universes can be fantasized ten thousand times.

Reality can never beat fantasy. Because fantasy doesn't have to pay the mortgage. Doesn't have to work late. Doesn't have to face the boss. Doesn't have to face middle-age weight gain. Fantasy forever stays frozen on the most beautiful frame.

So many regrets — it's not really that we chose wrong. It's that we discovered: perhaps, somewhere in the unseen, there truly is fate.

Lately I've been thinking about something else. If our generation's regret is being diverted by the era, chosen by the era — then the younger generation's predicament is even more bewildering and challenging: they are being leveled by the era.

When all doors are open, when all knowledge and skills are at your fingertips, when both humanities and sciences face the same shrinking job market, when new graduate hires become fewer and fewer — the new generation faces not career choices and planning. They face no choices, and no way to plan.

This confusion and helplessness, this inability to find one's place or purpose, is becoming the prevailing sentiment across universities. This is not something a platitude about 'embracing AI' can soothe.

For decades, knowledge was a scarce resource. Whoever possessed knowledge held an advantage. A book. A degree. A skill. Any of these could change your fate.

But after AI arrived, knowledge became like tap water. What once required a trip to the library can now be obtained with a single sentence. Designs that once took a decade of experience can now be generated in minutes.

Is knowledge still useful? Of course it is. But possessing knowledge no longer matters — because everyone can possess it. Just like electricity is essential, but owning a power plant no longer matters — because every household has an outlet.

This is a strange era. Knowledge is no longer scarce. Words are no longer scarce. Even intelligence itself is beginning to lose its scarcity.

What remains in the end — is not knowledge. Not degrees. Not titles. But lived experience.

AI knows what heartbreak is. But it has never waited for that call that never came. AI knows what aging is. But it has never watched its parents grow old day by day. AI knows what regret is. But it has never, at sixty, suddenly remembered a life it didn't choose forty years ago.

Knowledge belongs to machines. Experience belongs to humans. Efficiency belongs to machines. Feeling belongs to humans. Perhaps the most precious thing in the future — is not what you know, but what you have truly lived.

So I've come to feel, more and more — life's greatest regret is neither being diverted too early by the era, nor being leveled too late by the era. It is, after having lived a singular, unrepeatable life, still wanting to live on behalf of another self that never existed.

That guy — let him stay in the parallel universe. As for us — let's keep playing this round to the end.

But most crucially, and what worries me more: in this era of breakneck technological change, how do we build social welfare systems that ensure AI's dividends are shared by all? How do we ensure the next generation no longer faces the challenge of countless doors wide open, yet no path to walk through?

🎬 Watch the video version (YouTube)

再谈被时代分流,被时代平权

再谈被时代分流,被时代平权

最近朋友圈里,一位博士时期的同门师兄评论我昨天的两分钟。

他说:

如果当年我咬咬牙,放弃文科博士,投身90年代的C++洪流,去硅谷当程序员。

那么今天的我,大概会写另一篇《立委两分钟》:

最大的遗憾,是被C++带偏了,没有把博士读完。

我回了一句:

yeh u know

因为我太知道了。

别人以为这是玩笑。

其实不是。

这是两个老帮菜隔着三十多年的人生,对视了一眼。

然后同时笑了。

因为我们都知道。

他说的就是他自己。

而他说的那个"如果",恰恰是我。

他当年离开我们共同的导师南下硅谷。

后来进了各种初创和大厂。现在还在某大厂负责某个用户亿万的产品。一路带风。

而我留在了另一条路上。为那个"该死"的文科博士 错过了90年代早期的热闹 但赶上了世纪末的泡沫。

两条不同的人生道路。最后交汇在硅谷。

有意思的是。

到了这个年龄。

我们开始更加理解和打趣彼此。

这大概是人性最奇妙的地方。

得到的,总会慢慢变成理所当然。

得不到的,却会在记忆里不断升值。

年轻时觉得人生是一道选择题。

后来才发现。

人生其实是一道放弃题。

你每选择或被选择一个答案。

同时也放弃了很多个候选。

而那些放弃或被放弃的答案。

会在未来很多年里。

时不时回来敲门。

告诉你:

也许这才是正确的那个。

事实上呢?

没人知道。

因为人生最大的魔幻就是:

现实人生只能活一次。

平行宇宙却可以幻想一万次。

现实永远比不过幻想。

因为幻想不用交房贷。

不用熬夜。

不用面对领导。

不用面对中年发福。

幻想永远停留在最美好的那一帧。

所以很多遗憾。

其实并不是因为选择错了。

而是因为我们发现:

冥冥之中 也许真有宿命。

最近又想到另一件事。

如果说我们这一代人的遗憾,是被时代分流 被时代选择。

那么年轻人的处境更加茫然和挑战:

他们正在被时代平权。当所有的门都打开 所有知识技能都唾手可得 当文科理科面临同样的职场萎缩 当新人招聘越来越少 新一代面临的不是职业选择和规划 他们面临的是没有选择 也无法规划。这种惶惑和无助 这种找不到自己位置 也缺乏目标的现实困境 正在成为蔓延各大高校的情绪。这绝不是一句要拥抱AI的鸡汤可以平复的。

过去几十年。

知识是一种稀缺资源。

谁掌握知识。

谁就拥有优势。

一本书。

一个学位。

一门技术。

都可能改变命运。

但AI来了以后。

知识变得像自来水。

以前要跑图书馆才能找到的东西。

今天一句话就能得到。

以前需要十年积累才能做出的设计。

今天几分钟就能生成。

知识还有用吗?

当然有用。

但拥有知识已经不再重要了。

因为人人都能拥有。

就像电很重要。

但拥有一个发电厂不再重要。

因为家家户户都有插座。

这是一个很奇怪的时代。

知识不再稀缺。

文字不再稀缺。

甚至连聪明本身都开始变得不稀缺。

人最后剩下的。

不是知识。

不是学历。

不是头衔。

而是经历。

AI知道什么叫失恋。

但它没有等过那个永远不会来的电话。

AI知道什么叫衰老。

但它没有看着父母一天天变老。

AI知道什么叫遗憾。

但它没有在六十岁的时候,突然想起四十年前那个没有选择的人生。

知识属于机器。

体验属于人。

效率属于机器。

感受属于人。

也许未来最珍贵的东西。

不是你知道什么。

而是你真正活过什么。

所以我突然越来越觉得。

人生最大的遗憾。

既不是太早被时代分流。

也不是太晚被时代平权。

而是在拥有了一段独一无二的人生之后。

还总想着去替另一个从未存在过的自己活着。

那个家伙。

就让他留在平行宇宙里吧。

至于我们。

继续把这一局玩完。

但最最关键 更让人忧心的还是

在这急剧变革的技术时代 如何推进社会福利制度的建设 确保AI红利全民共享 确保下一代不再面临无数大门敞开 却无路可走的挑战。

🎬 观看视频版(YouTube)

A Lifetime's Regret: Diverted Too Early by the Times

Thesis: Many people's destinies are determined not by ability, but by the first sorting table their era hands them.

Looking back on my life, I have two deep regrets. By the time large language models arrived to help compensate, the energy and opportunity for frontline battle had already passed. A sigh.

The first regret: Among the Class of '77, many of the brightest were "hijacked" by the window of foreign languages. English was the key to the world — but the ticket was so precious that many spent their entire lives stuck at the ticket gate. I was drafted into the humanities, not because I didn't apply for science. The era made the choice for me.

The second regret: the PhD phase. I had one foot in the door of coding and engineering. OOP and C++ were all the rage, I was hooked. But the thesis and degree pulled me away. I became a self-made manager — VP, Chief, whatever — knowing a little about everything but never again a frontline engineer.

This isn't simple personal regret. It's a sample of an era: when windows of choice are small, a person is shaped not by their interests, but by the shortages of their time.

Today's young people have all the tools. AI, programming, English, expression — everything can be re-learned. The era no longer opens just one door. The only question: with all the doors open, do you dare walk back in?

by Tuya

🎬 Watch the video version (YouTube)

立委两分钟:一辈子的遗憾,是太早被时代分流

回想这一辈子,我觉得自己有两大遗憾。等到大模型能帮助弥补遗憾的时候,一线闯荡的精力和条件已经不再,唏嘘。

第一个遗憾,是 77 级那一代人里,很多最该去打基础科学和工程硬仗的人,被外语这个窗口"劫持"了。这不是说外语不好。恰恰相反,那个年代,没有LLM通天塔,英语就是一把钥匙,是通向世界的门票。问题是,门票太珍贵,以至于很多人一辈子都留在了检票口。

我当年被文科收编,不是因为我没报考理工,大革命后77级的理工考卷我考的是数理化而不是文史。我仗着多年跟着广播英语节目自学的英语,决定加试英语。但制度上你不能把加试英语的人,违背考生意愿,一下子划拉到外语系吧。但生活就是这样,开人生道路的玩笑,没商量。这个意义上,是时代替我做了选择。

可地球人都知道,第一学位太重要了。本科像人生的第一块地基。地基打在哪里,后面的楼就大概率往哪里盖。错过了,也不是不能改,但你要付出的代价就大得多。理科生还能转文科,文科生想回去和理工科硬拼,基本就是赤手空拳上战场。

第二个遗憾,是博士阶段。那时我一只脚已经快踏进 coding / engineering 的门槛了。当时 OOP 和 C++ 特别时兴,我开始入迷。按理说,再咬咬牙,也许就能把自己锻造成一个真正的码农,曾考虑放弃那个该死的文科博士旅程。后来还是被论文、学位牵着走。

再后来,就更没有机会成为一线工程师了。因为我突然成了一个 self-made manager,VP、Chief啥的:什么都懂一点,什么都能说两句,什么坑都踩过一点,但真正坐下来一行一行写代码,已经不是主线。人到中年,记性也差了,系统命令与代码syntax 都记不住。

这不是简单的个人后悔。这是一个时代的样本:当社会资源稀缺、信息通道狭窄、选择窗口很小的时候,一个人很可能不是按照兴趣成长,而是按照时代的短缺被塑形。

今天的年轻人幸运得多,也残酷得多。幸运的是,工具都在手边;残酷的是,借口也少了。现在 AI、编程、工程能力、英语、写作、表达,很多东西都可以重学、现学,都唾手可得。时代不再替你只开一扇门。问题只剩一个:门都开了,你敢不敢重新走进去?

by Tuya

🎬 观看视频版(YouTube)

Software Finally Starts Adapting to Me

Software Finally Starts Adapting to Me

I've had a very strong feeling lately: I increasingly don't want to learn software anymore.

It's not just laziness — though I am lazy. More fundamentally, the old software logic was: you adapt to me. Where the buttons are, how the menus hide, how the workflows twist — you have to learn it all. If you can't learn, you're stupid; if you can't remember, you're old. Software features multiply, menus grow ever more complex — 90% of which you'll never use in your lifetime — but vendors can't restrain themselves from expanding coverage. This is a kind of "collective menu debt," yet every individual who only needs a fraction of those features must still repay it, must learn to penetrate the complex UI to find their own subset.

But now that AI agents have arrived, this logic can be reversed. A friend who develops agent platforms advocates exactly this, saying conditions are ripe to build software just for yourself.

In fact, I've recently been using Codex to build a tool specifically targeting my own pain points from years of digital life: an automated system that collects anything I'm interested in, auto-classifies, processes, structures, and archives it, ready for retrieval and summarization at any time. I don't need to learn it, because it grew out of my own habits. The ideal state isn't me adapting to generic software — it's custom software adapting to me.

This kind of software has one enormous advantage: it has no market, therefore no competition. It serves just one person. It doesn't need to please investors, chase DAU, pursue growth, or design "user retention." It just needs to make my life smoother, help me lose fewer things, help me think more clearly, and automate the manual workflows I used to do. That's enough.

Which brings me to a regret.

Looking back on my life, my deepest source of inadequacy is that I didn't study science or engineering as an undergraduate — I studied humanities instead. (It really wasn't my fault — I applied for science and engineering, but the first cohort of post-Cultural Revolution college entrants in 1977 barely knew English, so English wasn't a required subject but could be taken as a bonus. I thought the bonus English test would help my application, but the foreign language department, desperate for English-capable students, forcibly pulled me in. No negotiation.) But your first degree is, in some sense, your underlying operating system. If your foundation isn't solid enough, you can patch it later, upgrade it, install plugins — but that gap in fundamentals will always be there. This has been my Achilles' heel for decades.

Fortunately, large model agents have arrived. My requirement for myself is now simple: since I didn't study enough before, let the tools fill the gap. Let coding agents become my private science-and-engineering assistant and personal secretary. They don't replace my judgment, but they compensate for my weaknesses. I don't need a market-facing software matrix. I just need an increasingly handy, increasingly understanding toolbox.

Efficiency first, fit first. If it can help me retain what's in my mind and bridge what I didn't learn before, that's enough. This "personal dynamic knowledge base" agent is no simple project, but it's nearly operational. Looking at it now, building your own wheels for your own use isn't actually that hard.

🎬 Watch the video version

软件终于开始适配我了

我最近有一个很强烈的感觉:我越来越不想去学一个软件了。

不是因为我懒,当然我也懒。更根本的是,过去的软件逻辑是:你来适配我。按钮在哪里,菜单怎么藏,流程怎么绕,你都得学。你学不会,是你笨;你记不住,是你老了。软件功能越来越多,菜单越做越复杂,90% 以上你一辈子也用不上,但厂商为了扩展覆盖面无法节制。这也是一种"集体菜单债",但每一个只需要用其features零头的个体必须还,必须学会穿透复杂的UI去找到自己要的那个子集。

但现在 AI agent 出来以后,这个逻辑也可以反过来了。朋友中有agent平台开发者的,他就是这么倡导的,说条件成熟了,可以只为自己做软件。

其实我最近用 codex正在做一个工具,就是专门针对自己多年数字生活中的实际需求和痛点:这是一个自动收集我感兴趣的任何内容,并自动分类、处理、结构化沉淀以及随时检索和总结的工具。我不需要学它,因为它本来就是照着我的习惯长出来的。理想的状态不是我去适配一个通用软件,而是自己定制软件来适配我。

这种软件有一个特别大的好处:它根本没有市场,所以也没有竞争。它只服务我一个人。它不用讨好投资人,不用追求 DAU,不用搞增长,不用设计什么"用户留存"。它只要让我顺手,让我少丢东西,让我想得更清楚一点,把我以前手工流程自动化,就够了。

这让我想到一件遗憾。

回头看一辈子,我最气短心虚的地方,是当年本科没有学理工,而是文科(其实完全不是我的错,天地良心我报考的是理工,天知道大革命10年后的第一届大学生77级考生中没多少人懂英语,所以当年不作为必考项目,但可以加试。本以为加试英语可以帮助录取自己的志愿,但却被缺乏英语考生的外语专业强行拉进去,没商量)。但第一学历这个东西,某种意义上就是人的底层操作系统。你底层不够硬核,后来当然也能补丁,也能升级,也能装插件,但那种基本功上的差距,会一直在那里。这是我过去几十年的软肋。

好在大模型agent来了。我现在对自己的要求很简单:既然过去没学够,那就让工具补上。让 coding agent 变成我的私人理工科助手兼贴身秘书。它不替代我的判断,但它补我的短板。我不需要一个面向市场的软件矩阵。我只需要一个越来越合手、越来越懂我的工具箱。

效率第一,合手第一。能帮我把脑子里的东西留下来,能帮我把过去没学的东西接上,就够了。这个"个人动态知识库"的agent不算简单的项目,但快要跑通了。现在看来,自己造轮子自己用,并不困难。

🎬 观看视频版

Token Economics #6: Token and Intelligence — The Final Chapter

After writing this Token series for so long, I want to tackle one final, unavoidable question.

What exactly is the relationship between Token and intelligence?

Many readers wrote in after the earlier installments:

If AI training runs on Tokens, inference runs on Tokens, and Agents are voraciously consuming Tokens — doesn't that mean Token equals intelligence?

The answer is both yes and no.

Let's start with no. Because intelligence is clearly more than just Token.

Just as a person's thoughts are not equal to the words they speak. A scientist's great discovery is not equal to the few dozen pages of the published paper. Einstein's theory of relativity is not equal to the tens of thousands of words in that paper. Words are merely carriers of thought. Similarly, Token is merely a carrier of intelligence — whether in the form of text, voice, or video.

But if we conclude from this that Token is unimportant, that would also be wrong.

Because we can never see thought itself. We can only see the traces thought leaves behind. A sentence, an article, a piece of code, a design draft, a video. The same is true of large models. We can't see the billions of calculations inside the neural network. We can't see the weight matrices. We can't see Attention. The only thing we can perceive is Token.

We cannot directly trade intelligence, but we can trade Token. We cannot directly measure intelligence, but we can measure Token.

At this point, a more fitting analogy suddenly occurred to me.

Money.

Dollars, yuan, gold — none of them equals wealth. True wealth exists in land, factories, goods, services, technology, and labor. But why can't modern society function without money? Because money provides a unified form of value expression — what Marx called a commodity equivalent — that can be measured, circulated, traded, and accumulated.

Today's Token is playing a similar role. It is not intelligence itself, but it increasingly resembles the currency of intelligence.

Over the past few years, the entire AI industry has essentially been revolving around Token. During training, people discuss how many trillions of Tokens were used. During deployment, they discuss how many Tokens per second can be processed. When purchasing APIs, they discuss how much input Tokens cost and how much output Tokens cost. When Agents run, they discuss how many Tokens were consumed for input and output. Even competition between nations is increasingly manifesting as: who can produce high-quality Tokens more cheaply.

Thus Token has gradually evolved from a technical term into an economic concept.

Of course, there is one point that is particularly easy to confuse. The same Token plays entirely different roles during training versus inference.

During training, Token is more like ore. Massive amounts of data are shredded, compressed, refined. Countless Tokens are smelted into model weights during training. The process resembles steelmaking, oil refining, turning ore into steel.

Inference is completely different. The model is already trained. What users purchase is not the training process but the output results. At this stage, Token is more like electricity, like money — more like an intelligence product delivered to the user. You ask AI to write articles, write code, make presentations. What you receive is Token. Even video, images, and voice will ultimately be priced through Token.

So from the user's perspective, intelligence almost always appears wrapped in the cloak of Token. This is why many people get the feeling that Token equals intelligence. It's actually as natural as associating money with wealth — because money and wealth have always been two sides of the same coin. Token and intelligence are increasingly becoming two sides of the same coin.

But history also tells us: don't mistake money for wealth itself. Similarly, don't mistake Token for intelligence itself.

To summarize. What is Token? It is the standard unit of measurement after data is industrialized and fragmented. Why tokenize? Because only by breaking things down can we count them; only by counting can we train. Why is AI consuming ever more electricity? Because the entire industry is producing and consuming Token at massive scale. Why are Agents exploding? Because machines have begun producing, exchanging, and consuming Token themselves. Why is Token getting cheaper? Because industrialization is underway. Why are nations competing over Token? Because Token is becoming a new means of production.

Ultimately, Token is to intelligence what money is to wealth. It is not wealth itself, but it is wealth's most important form of expression. It is not intelligence itself, but it is the way intelligence is produced, circulated, traded, and perceived. The internet era flowed with information. The AI era flows with Token. And what flows behind Token may well be the thing humanity has begun to produce industrially for the first time: intelligence.

立委两分钟 · Token经济学之六「Token与智能」终章

写了这么久Token系列,最后想聊一个绕不开的问题。

Token和智能,到底是什么关系?

很多朋友看完前几篇后留言:

既然AI训练靠Token,推理靠Token,Agent也在疯狂消耗Token,那么Token是不是就等于智能?

答案既是,又不是。

先说不是。因为智能显然不只是Token。就像一个人的思想,不等于他说出来的话。一个科学家的伟大发现,不等于发表论文时那几十页纸。爱因斯坦的相对论,并不等于论文里的那几万个字。文字只是思想的载体。同样,Token也只是智能的载体——无论以文字的形式,还是声音或视频的形式。

但如果因此说Token不重要,那又错了。因为我们根本看不见思想本身。我们只能看见思想留下的这些痕迹。一句话、一篇文章、一段代码、一张设计图、一个视频。大模型也是如此。我们看不见神经网络内部亿万次计算,看不见权重矩阵,看不见Attention。我们唯一能感知的,就是Token。

我们无法直接交易智能,却能交易Token。我们无法直接计量智能,却能计量Token。

说到这里,我忽然想到一个更恰当的比喻。

货币。美元、人民币、黄金都不等于财富。真正的财富存在于土地、工厂、商品、服务、技术和劳动之中。但现代社会为什么离不开货币?因为货币提供了一种统一的价值表达形式——马克思说的商品等价物——可以被计量、被流通、被交易、被积累。

今天的Token正在扮演类似角色。它不是智能本身,却越来越像智能的货币。

过去几年里,整个AI产业其实都在围绕Token运转。训练模型的时候,大家讨论的是训练了多少万亿Token。部署模型的时候,大家讨论的是每秒能处理多少Token。购买API的时候,大家讨论的是推理侧输入token多少钱,输出token多少钱。Agent运行的时候,大家讨论的也是输入和输出消耗了多少Token。甚至连国家之间的竞争,也开始逐渐表现为:谁能更便宜地生产高质量Token。

于是Token从一个技术术语,慢慢变成了一个经济学概念。

当然,这里还有一个特别容易让人混淆的地方。同样是Token,训练阶段和推理阶段其实完全不同。训练时,Token更像矿石——海量数据被切碎、压缩、提炼,无数Token在训练过程中被"熔炼"进模型权重,这一过程像炼钢,像炼油,像把矿石变成钢铁。

而推理阶段则完全不同。模型已经训练完成。用户购买的不是训练过程,用户购买的是输出结果。此时Token更像电力、货币,更像一种被交付给用户的智能产品。你让AI写文章、写代码、做PPT,收到的是Token。甚至视频、图片和语音,最终也都会经过Token形式进行计价。

所以从用户视角看,智能几乎总是披着Token的外衣出现。这也是为什么很多人会产生一种感觉:Token就是智能。其实这和看见货币就想到财富一样自然。因为货币和财富本来就互为表里。Token和智能,也越来越互为表里。

但历史也告诉我们:不要把货币误认为财富本身。同样,也不要把Token误认为智能本身。

总结一下。Token是什么?它是数据被工业化切分后的标准计量单元。为什么要Token化?因为只有打碎,才能统计;只有统计,才能训练。为什么AI越来越费电?因为整个产业都在大规模生产和消耗Token。为什么Agent会爆发?因为机器开始自己生产、交换和消耗Token。为什么Token越来越便宜?因为工业化正在发生。为什么各国开始竞争Token?因为Token正在成为一种新的生产资料。

而最终,Token之于智能,就像货币之于财富。它不是财富本身,却是财富最重要的表现形式。它不是智能本身,却是智能被生产、流通、交易和感知的方式。互联网时代流动的是信息,AI时代流动的是Token。而Token背后流动的,或许正是人类第一次开始工业化生产的东西:智能。

Liwei's Two Minutes · Token Economics in Plain Language Part 5: The New Currency of the AI Era

Token as the new currency of the AI era - illustration
AI时代的新货币:Token 经济示意图

The previous four installments of the Token Economics series covered:

What Token is Why Token consumes electricity Why Agents burn Token like crazy Why Token keeps getting cheaper

All about Token production, consumption, and cost.

But the most important question remains unanswered:

Why is the entire world suddenly measuring everything in Token?

Put differently:

Could Token become the "currency" of the future digital economy?

The first four pieces looked at the trees.

Today, Part 5 starts looking at the forest.

Liwei's Two Minutes · Plain Language Part 5 — The New Currency of the AI Era

Many people think Token is just a technical term.

It's starting to look like much more than that.

I even suspect that, looking back decades from now, Token might become a fundamental economic indicator — on par with electricity, steel, and oil.

Why?

Because throughout human history, every industrial revolution eventually produces a unified unit of measurement.

The steam age ran on coal.

The electric age ran on kilowatt-hours.

The internet age runs on traffic.

And in the AI age, increasingly, everyone is starting to measure things in Token.

The reason is simple.

Everything AI does today ultimately comes down to Token.

Writing an article? Burning Token. Writing code? Burning Token. Making a PowerPoint? Burning Token. Generating video? Burning Token. An agent running a project? Burning Token. Even future robots doing physical work — behind the scenes, still burning Token.

And so a strange phenomenon emerges.

We used to buy software — we paid for features.

Now it's starting to feel like: we're buying Token.

Companies used to ask about IT systems: "How much per license?"

Now they're starting to ask: "How much per million Token?"

This is actually very similar to the power grid.

No one cares how many times the generator spun.

Everyone cares about one thing: how much per kilowatt-hour.

The future may be the same.

No one will care how many parameters a model has.

Everyone will care about: how much per million Token. What's the quality. Is it reliable enough.

At that point, Token shifts from a technical concept to an economic one.

And economics has a brutal law: any standardized commodity eventually gets commoditized into a race to the bottom.

Steel went through it. Display panels went through it. Solar panels went through it.

Today's Token is walking the same path.

So while many people are still debating: which model is number one, which model is number two.

The industry is increasingly focused on: who can produce high-quality Token at the lowest cost.

Because real large-scale applications, in the end, all come down to the math.

The boss won't ask: "Did you use the world's number one model?"

The boss will only ask: "How much did we cut costs?"

And so the AI industry starts to look less like a lab and more like manufacturing.

Many people understand AI competition as: a contest of brilliant scientists.

It's increasingly looking like: a contest of national industrial systems.

Who has cheaper electricity. Who has more data centers. Who has a more complete supply chain. Who can drive down Token prices. That's who has the edge.

So a new phenomenon may emerge in future international competition:

Alongside energy exporters and manufacturing exporters, we may see a new category: Token exporters.

Whoever can consistently export cheap, high-quality Token to the world may occupy a pivotal position in the next-generation digital economy.

In the internet age, data flowed.

In the AI age, what really flows may be Token.

And everything happening today might just be the opening act.

立委两分钟 · token经济学大白话之五:AI时代的新货币

Token as the new currency of the AI era - illustration
AI时代的新货币:Token 经济示意图

token经济学大白话序列前面四篇 我讲了:

Token是什么 Token为什么费电 Agent为什么疯狂烧Token Token为什么越来越便宜

讲的都是:

Token的生产、消费和成本。

但还没讲那个最重要的问题:

为什么全世界突然开始用Token来衡量一切?

或者说:

Token会不会成为未来数字经济的"货币"?

前四篇讲的是树木。

今天第五篇开始看森林。

立委两分钟 · 大白话之五 题目叫

AI时代的新货币

很多人觉得:

Token只是个技术词。

其实越来越不像了。

我甚至怀疑,

几十年后回头看,

Token可能会变成一种类似: 电力、 钢铁、 石油

那样的基础经济指标。

为什么?

因为人类历史上,

每次工业革命,

最后都会出现一个统一计量单位。

蒸汽时代看煤。

电气时代看电。

互联网时代看流量。

而AI时代,

越来越多人开始看Token。

原因很简单。

因为今天AI干的所有事情,

最后都会落到Token上。

写文章,烧Token。 写代码,烧Token。 做PPT,烧Token。 生成视频,烧Token。 Agent跑项目,烧Token。 甚至未来机器人干活,背后依然在烧Token。

于是一个奇怪的现象出现了。

以前我们买软件,买的是功能。 现在越来越像:买Token。

以前企业采购IT系统,问的是:多少钱一套? 现在开始问:多少钱一百万Token?

这其实很像电网。

没有人关心发电机转了多少圈。 大家只关心:一度电多少钱。

未来也可能一样。

没有人关心模型多少参数。 大家只关心:一百万Token多少钱。质量怎么样。够不够稳定。

这时候,Token开始从技术概念,变成经济概念。

而经济学有个很残酷的规律:任何标准化商品,最终都会被卷。

钢铁如此。面板如此。光伏如此。

今天的Token,也正在走这条路。

所以很多人还在争论:哪个模型第一。哪个模型第二。

但产业界越来越关心的是:谁能最便宜地生产高质量Token。

因为真正的大规模应用,最后都得算账。

老板不会问:"你用了世界第一模型吗?" 老板只会问:"成本降了多少?"

于是AI产业开始越来越像制造业。而不是实验室。

很多人把AI竞争理解成:天才科学家的竞争。

其实越来越像:国家工业体系的竞争。

谁有更便宜的电。谁有更多的数据中心。谁有更完整的供应链。谁能把Token价格打下来。谁就有优势。

所以未来的国际竞争,可能出现一个新现象:

能源出口国、制造业出口国、之外,再多一个:Token出口国。

谁能持续向全世界输出便宜而高质量的Token,谁就可能占据下一代数字经济的重要位置。

互联网时代,数据在流动。

AI时代,真正流动的,可能是Token。

而今天发生的一切,也许只是这个时代的开场白。

FSD's Emergency Avoidance — Sometimes a Ghost, Sometimes a God

Yesterday I watched a real-time dashcam video of a Tesla making an emergency swerve to avoid a car that suddenly shot up from the left lane entrance ramp. My immediate thought: human reaction speed simply can't handle that.

In that situation, most of us instinctively slam the brakes — which on a highway is itself dangerous. Being able to safely dodge to the right lane like FSD did is clearly the better strategy. Unfortunately, most human drivers just can't pull it off.

After driving with FSD for a long time, you develop a very strange kind of trust.

Not that it's always right. Not that you always understand why it did what it did.

But you realize: many of those heart-stopping emergency maneuvers that made you break out in a cold sweat — when you replay them later, most of them genuinely protected your safety.

Over all my years of manual driving, my default in emergencies was always the reflexive hard brake. Because only by slowing down did I feel any sense of control. It wasn't that I didn't know how to steer — I was afraid to. Because you have to check: is the right lane clear? Is there a car in my blind spot? How fast is the car behind me? Is the other driver a novice? Are they panicking? This entire judgment chain is serial — the human brain simply can't process it fast enough.

So most people, like me, instinctively hit the brakes.

But FSD is different. It's not just that it has watched countless expert drivers — it's more like a driver with many sets of eyes and reaction speeds many times faster than ours. It's constantly watching all four directions, constantly computing the space, speed, and risk in every lane.

That's why sometimes, it dares to execute lane escapes that we wouldn't dare attempt.

Of course, this brings another problem: sometimes it's overly cautious. A bird suddenly flies past in front — it might trigger an avoidance reaction. And some emergency dodges, even in hindsight, we may not fully understand. The infamous "phantom braking" from a few years ago is the classic example: tree shadows, bridge shadows, lighting changes, even road texture could trigger false alarms.

But here's what's remarkable: phantom braking has almost disappeared in recent years. I've barely encountered it myself in over a year. This tells us it's no longer just "seeing something that looks like danger" — it's increasingly understanding: what will actually hit me, and what is merely a shadow.

This is the most fascinating thing about FSD.

In its early days, it sometimes acted like a clumsy student. Now it behaves like an inhumanly fast-reacting entity.

Yesterday it executed one particular avoidance maneuver that I didn't fully understand either. Maybe it overreacted. Maybe it saw a risk we didn't. But I'm not going to dig deeper into it.

Because after long-term use, my trust in it doesn't come from faith — it comes from replaying every drive, time after time.

The vast majority of the time, those tense maneuvers that felt excessive in the moment — looking back, they were protecting us. It is far more cautious and safe than this old-timer-among-clumsy-drivers.

And that's enough.

What will truly transform driving in the future may not be whether it can drive like a human.

It's that it finally can drive unlike a human.

FSD 的紧急避让,有时候像鬼,有时候像神

昨天看一条实况视频 是特斯拉紧急避让左道入口急速冲上来的车辆,当时感觉人的反应速度是不行的。人在这种情况下 几乎本能紧急踩刹车 在高速上也是有危险的。能像FSD那样及时安全避让道右线显然是更好的策略 可惜我们人类司机大多做不到。开 FSD 时间长了以后,人会慢慢形成一种很奇怪的信任。

不是说它每一次都对,也不是说你每一次都懂它为什么这么做。

而是你会发现:很多当时让你一身冷汗的紧急动作,事后复盘,居然大多是真正保护安全的。

手动开车的那么多年 我在紧急情况下 做的最多的就是下意识急刹 因为只有慢了才觉得可控。不是不会打方向,而是不敢。因为你要先看右 lane 有没有车,盲区有没有车,后车快不快,对方是不是新手,是不是手忙脚乱。这一套判断,觉得人类是串行的,来不及。

所以不少人跟我一样只能本能地踩刹车。

但 FSD 不一样。它不仅仅是看过无数老司机开车,它更像一个长了很多只眼睛、反应速度比我们快很多倍的驾驶。它一直在同时看前后左右,一直在算每条 lane 的空间、速度和风险。

所以有些时候,它敢做我们不敢做的 lane escape。

当然,这也会带来另一种问题:它有时候过于谨慎。前面突然飞过一只鸟,它也可能有避让反应。还有些紧急避让,我们事后也未必完全理解。前几年所谓"鬼影刹车"就是典型例子:树影、桥影、光照变化,甚至路面纹理,都可能让系统误判。

但这几年下来,一个很明显的变化是:鬼影刹车几乎消失了。至少我自己一年多几乎没遇到过。说明它已经不只是"看见一个像危险的东西",而是在越来越理解:什么东西真的会撞上我,什么只是影子。

这就是 FSD 最有意思的地方。

它早期有时候像个笨学生,现在又像个反应快得离谱的非人实体。

昨天它有一次特定避让,我也没完全看懂。也许没必要反应过度,也许它看到了我们没看到的风险。但我不打算深究到底了。

因为长期使用下来,我对它的判断不是来自信仰,而是来自一次次路上的复盘。

绝大多数情况下,它那些当时显得紧张的动作,事后看,是在保护我们。它比我这个老司机中的笨鸟 谨慎安全太多了。

这就够了。

未来真正改变驾驶的,也许不是它能不能像人一样开车。

而是它终于也可以不像人那样开车。

Two Minutes with Liwei: AI Doesn't Pay Taxes — Trouble Is Inevitable

For the first time in human history, there is a kind of "employee" that can work 24 hours a day — no sleep, no salary, no social security, no rights claims, no strikes, no sickness, no retirement.

And the truly absurd part: it can replicate itself.

This thing is called an **AI agent**.

---

## Here's the problem

In the old world, when a boss hired 1,000 people, the state collected: income tax, social security, health insurance, unemployment insurance, pension contributions.

Now the boss fires all 1,000 and replaces them with AI. Efficiency skyrockets. Profits skyrocket. Stock prices skyrocket.

**But the tax base evaporates.**

The state can no longer collect revenue. The unemployed are still there.

And so we arrive at a surreal paradox:

> AI is simultaneously driving unprecedented productivity growth and hollowing out the fiscal foundation of society.

The entire modern state is built on the premise that human labor pays taxes. AGI is erasing "human labor" itself.

**This is the real nuclear bomb.**

---

## "Just learn AI" is wishful thinking

Many people still comfort themselves: "Just pick up some AI skills, transition to a new role, and you'll be fine."

This is increasingly delusional. Because the cruelest part is: even the act of "using AI" will eventually be automated by AI.

You think the jobs of the future are "AI Operator," "Prompt Engineer," "Agent Manager" — but agents are already using agents. Even "prompt engineer," that transitional role, may turn out to be nothing more than a temporary bubble in a technological wave.

Two years ago, the entire internet was selling prompt engineering courses. Today, that looks like a punchline.

---

## This time is different

Past technological revolutions created new jobs. The automobile killed the horse carriage but created the auto mechanic. The internet killed print newspapers but created e-commerce and live-streamers.

This time is different. The new systems AI creates are inherently *de-peopled*. Because AI's single greatest advantage is precisely this: **it doesn't need people.**

---

## AI must pay taxes

If AI replaces people, who pays the taxes?

The answer is simple: **AI itself must pay taxes.**

For every token you consume, every GPU you run, every inference you perform, every kilowatt of AI electricity — you pay a corresponding "AI social tax."

Because when you used to hire a person, you were already paying those taxes. Now you replace the person with AI and bear zero social cost — that is fundamentally unfair.

Many will shout: "You're stifling technological progress!"

**So what?**

Is the sole purpose of human society to allow capital and compute to multiply without limit?

- The Industrial Revolution polluted the environment → we got environmental taxes.
- Cars consume public roads → we got fuel taxes.
- AI destroys the employment tax base → why can't we have an AI tax?

---

## It takes everyone

The real danger is not that AI is too powerful. It's that once AI becomes powerful enough, the entire social revenue structure collapses.

And here's the darkest irony: the people most likely to support an AI tax in the future may be exactly those who understand AI best. Because they know most clearly: once this thing truly matures, it doesn't just replace the "bottom rung."

**It sweeps the board.**

White-collar workers, programmers, designers, analysts, customer service, translators, paralegals, researchers — no one escapes.

In the past, society could comfort people with one line: "You just didn't work hard enough."

But the cruelest truth of the AGI era is this: sometimes, it's not that you didn't work hard. It's that you, as a member of the species "human employee," are beginning to lose economic viability altogether.

---

*Two Minutes with Liwei · 2024*

by Tuya

立委两分钟:AI 不交税,迟早出事

人类历史上第一次,一种"员工",可以 24 小时工作、不用睡觉、不要工资、不交社保、不会维权、不会罢工、不会生病、不会退休。

更离谱的是:它还能自己复制自己。

这东西叫 **AI agent**。

---

## 问题来了

以前老板雇 1000 人。国家有:个税、社保、医保、失业金、养老金。

现在老板把 1000 人裁掉,换成 AI。效率暴涨。利润暴涨。股价暴涨。

**但税基没了。**

国家收不到钱了。失业的人却还在。

于是出现了一个极其魔幻的局面:

> AI 一边疯狂提高生产率,一边疯狂掏空社会财政基础。

整个现代国家,本质上建立在"人类劳动纳税"之上。而 AGI 正在把"人类劳动"本身抹掉。

**这才是真正的核弹。**

---

## "学 AI 就好"是鸡汤

很多人还停留在:"学一点 AI,以后转型新岗位就好了。"

这其实越来越像鸡汤。因为最残酷的地方在于:连"使用 AI"本身,最后都会被 AI 自动化。

你以为未来岗位是 "AI 操作员"、"Prompt Engineer"、"Agent 管理师"——结果 agent 自己就在用 agent。人类连"提示词工程师"这种过渡岗位,可能都只是技术浪潮里的临时泡沫。

两年前,全网都在卖 prompt engineering 课程。今天再看,像时代笑话。

---

## 这次不一样

过去技术革命会创造新岗位:汽车淘汰马车,但创造修车工。互联网淘汰报刊,但创造电商和主播。

而这次不一样。AI 创造的新系统,天然就是"少人化"的。因为 AI 最大的优势,恰恰就是:**不需要人。**

---

## AI 必须交税

如果 AI 替代了人,那谁来交税?

答案很简单:**AI 本身必须交税。**

你用了多少 token、多少 GPU、多少推理、多少 AI 电力,就缴多少"AI 社会税"。

因为以前你雇人,你本来就在缴税。现在你用 AI 替代人,却一分钱社会成本不承担——这本身就不合理。

很多人一听就急:"你这是阻碍科技进步!"

**So what?**

难道人类社会的唯一目标,就是让资本和算力无限增殖?

- 工业革命污染环境 → 后来有环保税
- 汽车消耗公共道路 → 后来有燃油税
- AI 摧毁就业税基 → 为什么不能有 AI 税?

---

## 通杀一切

真正危险的,不是 AI 太强。而是 AI 太强之后,整个社会收入结构崩了。

最黑色幽默的是:未来最支持 AI 税的人,可能恰恰是最懂 AI 的那批人。因为他们最清楚:这东西一旦真正成熟,不是替代"底层",而是**通杀。**

白领、程序员、设计师、分析师、客服、翻译、律师助理、研究员……一个都跑不掉。

过去社会还能用一句话安慰人:"你只是不够努力。"

但 AGI 时代最残酷的地方在于:有时候,不是你不努力。而是你作为"人类员工"这个物种,开始整体失去经济性了。

---

*立委两分钟 · 2024*

by Tuya

The Industrialization of Tokens — Liwei 2 Minutes · Token Economics in Plain Language (Part 4)

Recently, many people suddenly noticed something:

DeepSeek cut prices again.

And not by a little.

The kind of cut where you just flip the table over. By the end of May, prices dropped to a quarter of what they were.

Many people's first reaction: Chinese AI companies are starting a price war.

But I increasingly feel that understanding this only as a "price war" is way too shallow.

Because what's really happening here might be this: tokens are becoming industrialized.

What does that mean?

For the past two years, the global AI world has operated under a quiet assumption: high-quality tokens are expensive.

Because: models are expensive, GPUs are expensive, training is expensive, electricity is expensive.

So everyone defaulted to the idea that AI must be a high-margin industry.

Until Chinese models started slashing prices like crazy.

And for the first time, many people discovered: tokens might actually be like steel, display panels, solar panels, lithium batteries — entering a terrifying process of industrial cost reduction.

Behind this story is something deeply Chinese.

What do I mean?

American AI companies often follow a path of "high performance, high margins, high valuation." A bit like luxury goods.

But once Chinese companies start competing, things tend to look different: "First, crush the cost."

Then: massive scale, infrastructure-ization, supply-chain-ization, engineering optimization, labor optimization, power optimization. Eventually grinding the entire industry into "cabbage-price industrial capability."

Over the past twenty years, China has done this repeatedly. Solar power, EV batteries, drones, display panels, e-commerce, high-speed rail... The pattern is roughly the same.

Early stage: others think it's high tech. Later stage: China industrializes it. End result: profits vanish, but production capacity blankets the world.

Today, tokens are starting to look more and more like this story.

Because tokens are not fundamentally mysterious. They are, in the end, "data processing capability produced by an industrial system." And what is an industrial system best at? Reducing costs.

So now an especially interesting dynamic has emerged: American frontier models may still maintain the strongest capability. But Chinese models are closing in fast — maybe a few months behind, maybe still a bit weaker in certain areas. But the price is already shockingly low.

So developers around the world are facing a very pragmatic choice: "Do I need the world's strongest, or do I need strong enough + ten times cheaper?"

This question is deadly.

Because in most of the business world, what ultimately matters is not "theoretical peak performance" but "overall cost-effectiveness."

As tokens get cheaper and cheaper, many AI applications that were previously "too expensive to run" suddenly become viable.

In the past, AI was like a five-star hotel. Now it's starting to look like tap water.

Developers used to worry: "Is this agent going to burn dozens of dollars a day?" Now the attitude is shifting to: "Whatever, let it run."

And so token consumption begins to explode further. Which in turn drives even larger data centers, cheaper inference chips, more aggressive engineering optimization. The whole system enters a kind of industrial flywheel.

The most interesting part is: what this competition ultimately comes down to may no longer be just the model.

It's about: who has cheaper electricity; who has more data centers; who has cheaper engineers; who has a more complete supply chain; who can better tolerate thin margins.

In other words: AI competition is increasingly looking like modern industrial system competition, not just lab competition.

Many people still think of AI as "a few brilliant scientists changing the world." But what it increasingly resembles is "an entire national industrial system collectively entering the field to produce tokens."

In the internet era, China's greatest strength was "application industrialization." In the AI era, what might be truly terrifying about China is: token industrialization.

And as token prices keep falling, developers around the world will ultimately vote with their feet. Because the vast majority of companies, in the end, have to do the math.

by Tuya

token的工业品化——立委两分钟 · token经济学大白话(四)

很多人最近忽然发现:

DeepSeek 又降价了。

而且不是小降。

是那种: "桌子直接掀了"的降法 五月底一杆子降到原价的四分之一。

很多人第一反应是:

中国AI公司开始打价格战了。

但我越来越觉得, 这件事如果只理解成"价格战",其实太浅了。

因为这里真正发生的, 可能是:

token开始"工业品化"了。

什么意思?

过去两年, 全世界AI圈其实一直有个默认前提:

高质量token很贵。

因为: 模型贵、 GPU贵、 训练贵、 电贵。

于是大家默认:

AI一定是高利润行业。

直到中国模型开始疯狂降价。

很多人第一次发现:

原来token也可能像:

钢铁、 面板、 光伏、 锂电池

一样, 进入一种恐怖的工业化降本过程。

这件事背后,其实非常"中国"。

什么意思?

美国AI公司, 很多走的是:

"高性能、高毛利、高估值"

路线。

有点像奢侈品。

而中国公司一旦卷起来, 往往会变成另一种画风:

"先把成本打穿。"

然后:

大规模、 基础设施化、 供应链化、 工程优化、 人力优化、 电力优化。

最后把整个行业, 卷成:

"白菜价工业能力"。

过去二十年, 中国其实已经反复干过很多次这种事。

光伏、 动力电池、 无人机、 面板、 电商、 高铁……

路径都差不多。

前期: 别人觉得是高科技。

后期: 中国开始把它工业化。

最后全球发现:

利润没了, 但产能已经铺满世界。

今天token, 开始越来越像这个故事。

因为token本质上并不神秘。

它终究是一种:

"被工业体系生产出来的数据处理能力。"

而工业体系最擅长什么?

降成本。

于是现在发生了一个特别有意思的变化:

美国头部模型, 仍然可能保持最强能力。

但中国模型, 正在疯狂逼近。

也许落后几个月, 也许某些能力还差一点。

但价格, 已经开始低到吓人。

于是全世界开发者开始出现一种非常现实的选择:

"我到底是需要世界最强, 还是需要: 足够强 + 便宜十倍?"

这问题太致命了。

因为大多数商业世界, 最后拼的都不是:

"理论最强性能"。

而是:

"综合性价比"。

当token越来越便宜, 很多以前"不舍得开"的AI应用, suddenly 就能开了。

过去:

AI像五星级酒店。

现在开始:

像自来水。

过去开发者还担心:

"这个Agent会不会一天烧掉几十美元?"

现在开始变成:

"算了,让它自己跑吧。"

于是, token消耗量开始进一步爆炸。

而这又会反过来推动:

更大规模的数据中心、 更低成本的推理芯片、 更激进的工程优化。

整个系统开始进入一种:

工业飞轮。

最有意思的是:

这场竞争最后拼的, 可能已经不只是模型。

而是:

谁的电更便宜; 谁的数据中心更多; 谁的工程师更便宜; 谁的供应链更完整; 谁更能承受薄利润。

也就是说:

AI竞争, 正在越来越像:

现代工业体系竞争。

而不只是实验室竞争。

很多人还把AI理解成: "几个天才科学家改变世界。"

但今天越来越像的是:

"整个国家级工业体系, 正在集体下场生产token。"

互联网时代, 中国最强的是"应用工业化"。

AI时代, 中国可能真正恐怖的地方是:

token工业化。

而当token价格不断下降, 全世界开发者最终会用脚投票。

因为绝大多数公司, 最后都得算账。

Is the AI Bubble Real?

The truly dangerous thing about the AI bubble isn't the technology.

It's that the entire world is front-loading financing for "decades of future intelligence demand" — all at once.

During the mobile internet era, people burned cash, sure. But they found revenue fast. Food delivery had customers. Ride-hailing had riders. E-commerce had buyers. Short videos had viewers. Ads had advertisers. Consumer-facing sectors — food, clothing, housing, transport, communication, entertainment, shopping — were all low-hanging fruit. The business loop closed quickly.

But AI is different. To this day, most of what's genuinely deployed at scale is still: writing weekly reports, making slides, generating images, customer service bots, coding assistants. Valuable? Yes. That's not the problem.

The real problem is this: the capital markets are already betting at the scale of "everyone consuming intelligence all the time." GPUs bought first. Data centers built first. Debt taken on first. Valuations pumped first. Pension funds entered first. The world is building "intelligence power plants" at unprecedented speed.

But here's the question: who, exactly, is going to consume intelligence the way we consume electricity today — continuously, at scale?

The biggest gamble in AI right now isn't whether models will get smarter. It's whether the explosion of B2B vertical applications can outpace the depletion of funding, GPU depreciation, data center debt, and the capital market's patience.

If Agentic AI genuinely penetrates core enterprise workflows — turning productivity gains into real profits — then a lot of today's crazed investments will be vindicated by history. But if the growth of real demand moves slower than the pace the capital markets have already priced in, then a lot of what today represents "the future" of AI may end up as: piles of power-hungry GPUs generating insufficient cash flow.

The railroad changed the world. Railroad stocks still crashed. The internet changed the world. Dot-coms still littered the battlefield. AI will probably change the world too. But a technological revolution being real has never meant a bubble doesn't exist.

AI泡沫论站得住吗

AI泡沫真正危险的地方,不是技术。

而是整个世界,正在提前为“未来几十年的智能需求”一次性融资。

移动互联网时代,大家烧钱归烧钱,但至少很快就找到了现金流。外卖有人点。网约车有人坐。电商有人买。短视频有人刷。广告有人投。to C 的衣食住行通信娱乐购物,都是低枝果实。商业闭环来得极快。

但 AI 不一样。AI 到今天为止,真正大规模落地的,很多仍然是:写周报、做 PPT、生成图片、客服机器人、代码辅助。当然它有价值。问题不在这里。

真正的问题是:整个资本市场,已经提前按“未来全民消耗智能”的规模开始下注。GPU 先买。数据中心先建。债务先借。估值先涨。养老金先入场。整个世界正在以前所未有的速度,建设“智能发电厂”。

但问题来了:到底谁会像今天消耗电力一样,持续消耗智能?

AI 今天最大的赌局,不是模型会不会变聪明。而是:to B 垂直应用的大爆发,能不能跑赢资金链断裂、GPU 折旧、数据中心债务和资本市场耐心耗尽的速度。

如果 Agentic AI 真能进入企业核心流程,把生产力提升变成真实利润,那么今天很多疯狂投资会被历史洗白。但如果真实需求增长速度,慢于资本市场已经提前透支的速度,那今天很多代表“未来”的 AI 资产,最后可能只是:一堆无法产生足够现金流的高耗电 GPU。

铁路改变了世界。铁路股票照样崩过。互联网改变了世界。dot-com 照样尸横遍野。AI 很可能也会改变世界。但技术革命是真的,从来不意味着泡沫不存在。

A Leaky-Sieve Reasoning System Just Started Doing Real Math

What really sent a chill down my spine in the AI world these past few days wasn't a funding round. It wasn't a product launch.

It was this:

OpenAI's unreported general-purpose reasoning model reportedly solved the Erdős planar unit distance problem — first posed in 1946.

A decades-old problem that mathematicians couldn't crack.

What shook me wasn't that "it solved it."

It was how it solved it.

The full chain of thought reportedly printed out to 125 pages.

Not the kind of genius epiphany you see in movies.

Quite the opposite.

All trial and error. Dead ends. Backtracking. Repeated reversals and detours.

Like a mentally shattered grad student grinding away in a mountain of scratch paper.

But here's the thing:

It actually found the door in the end.

And what's most interesting:

This wasn't a math-specific model.

It was a general-purpose reasoning model.

Most people haven't grasped what this means yet.

Over the past few years, there's been a loud contrarian voice in the AI world.

The most prominent being the LeCun camp.

Their long-held position:

LLMs don't do real reasoning. Just statistical language modeling.

Then, as reasoning capabilities visibly improved, they doubled down:

These so-called reasoning outputs are merely crude imitations of human reasoning.

They're not entirely wrong.

Today's large model reasoning does resemble a sieve riddled with holes.

Wild speculation. Frequent wrong turns. Logic that collapses mid-stream.

But the LeCun camp may have underestimated one thing:

Intelligence doesn't always require "perfect reasoning."

With enough scale, sufficiently broad search, and strong self-reflection and course-correction,

a kind of "rough but effective" intelligence can suddenly emerge.

And mathematics and programming happen to be where this breaks through first.

Because they share one crucial property:

Verifiability.

You can flail. Generate wildly. Even "guess blindly."

But in the end, the verifier tells you:

Right.

Or wrong.

And so for the first time, AI enters a deeply unsettling state:

It may not actually "understand the world,"

yet it can already conduct effective exploration near the frontier of human knowledge.

That alone is staggering.

Now, LeCun isn't entirely without a point.

He says:

The real world isn't like mathematics.

In the real world, many problems are unverifiable, inexhaustible, and impossible to formalize in language.

I agree with that.

But the problem is:

His critique of LLMs and reasoning is far too absolute.

Especially now, after mainstream approaches have broken through time and again,

his "tear it all down to build anew" contrarian stance feels increasingly out of touch.

More critically:

The "bypass language, prioritize visual world models" approach he's championed for years

has yet to produce any truly industry-shaking result.

At least not so far.

So the real question today is no longer:

"Do LLMs have genuine intelligence?"

It's a more dangerous one:

If a reasoning system "as leaky as a sieve" is already contributing to real mathematical discoveries,

then with more scaling...

Isn't that superintelligence?

by Tuya

漏得跟筛子似的推理,已经开始数学发现了

这两天 AI 圈真正让我后背发凉的,不是什么融资,不是什么发布会。

而是一个消息:

OpenAI 一个未公开的通用推理模型,据说解决了 Erdős 1946 年提出的平面单位距离问题。

这是数学界几十年没人真正攻下来的老难题。

让我震撼的,不是"它做出来了"。

而是它怎么做出来的。

据说完整 chain of thought 打印出来长达 125 页。

里面不是电影里那种天才顿悟。

恰恰相反。

全是试错、绕路、反复推翻、弯路回退。

像一个精神快崩溃的研究生,在草稿纸堆里死磕。

但问题在于:

它最后居然真的摸到了门。

而且最有意思的是

这不是数学专用模型。

而是通用 reasoning model。

这件事的意义,很多人还没意识到。

过去几年,AI 圈一直有一种强烈的反主流声音。

其中最典型的就是 LeCun 那一路。

他们长期认为:

LLM 没有真正推理,只是语言统计。

后来眼看 reasoning 越来越强,又进一步解释说:

这些推理,不过是对人类推理的一种拙劣模仿。

这话其实不能说全错。

今天的大模型推理,确实像个漏洞百出的筛子。

经常胡思乱想,经常走错路,经常逻辑崩盘。

但 LeCun 那一路可能低估了一件事:

很多时候,智能未必需要"完美推理"。

只要规模足够大,搜索足够广,反思和修正能力足够强,

一种"粗糙但有效"的智能,也可能突然涌现。

而数学和编程,恰恰是这种能力最容易率先突破的地方。

因为这两个领域有一个关键特点:

可验证。

你可以乱试,可以疯狂生成,甚至可以"瞎蒙"。

但最后 verifier 会告诉你:

对,还是错。

于是 AI 第一次开始出现一种非常诡异的状态:

它可能并不真在"理解世界",

却已经能在某些人类知识头部附近,进行有效探索。

这一点其实已经非常惊人。

要说 LeCun,他也不是完全没道理。

他说:

现实世界不像数学。

现实世界很多问题不可验证、不可穷举、不可语言化。

这一点我同意。

但问题是:

他对 LLM 和 reasoning 的批判,太绝对了。

尤其这两年,主流路线一次次突破之后,

他那种"不破不立"的反潮流姿态,显得有点跟现实脱节。

更关键的是:

他这些年一直鼓吹的"绕过语言、优先视觉世界模型"的路线,

直到今天,还没有出现真正震撼行业的成果。

至少目前没有。

所以今天最值得关注的,已经不是:

"LLM 有没有真正智能"。

而是另一个更危险的问题:

如果这种"漏得跟筛子似的"推理系统,已经开始参与有效的数学发现,

那再 scaling 下去,它不就是超级智能吗?

by Tuya

Liwei Two Minutes #3: Why Do Agents Suddenly Feel Human?

Liwei Two Minutes: Token Economics in Plain Language #3 — Why Do Agents Suddenly Feel Human?

People used to think ChatGPT was already very human-like. It's not. Not even close.

Why? Because traditional chatbots are fundamentally "one question, one answer." You ask one thing, it replies once. Like a high-end customer service rep.

The real change happened when AI started "working on its own." That's the hottest thing right now: Agents.

The first time you play with an Agent, it's shocking. It suddenly acts like a real employee.

It breaks down tasks on its own, writes code, runs tests, reports errors, fixes bugs, keeps going. It even "talks to itself" while working.

Why this sudden change? The reason isn't mysterious. Because AI started burning its own tokens.

In the ChatGPT era, tokens mainly came from human input. You type some words, the model replies. The token flow was simple: Human → AI → Human.

Agent era is different. Now the token flow is: AI → AI → Tool → AI → AI. So tokens are burning inside the machine in loops.

Here's an example. Say you tell an Agent: "Build me a website."

A traditional chatbot would just give you a block of code. Done. But an Agent won't.

It will first analyze the task. Then start talking to itself: "Let's decide on the tech stack..." "Need React..." "Probably need a database..." "Generate the homepage first..." "Run the tests..." "Got an error..." "Fix and retry..."

Notice: this "thinking process" itself consumes tokens. And it consumes a massive amount.

Because the Agent isn't "generating the correct answer once." It's more like trial and error. Just like a human engineer: write, revise, test, redo.

So token consumption suddenly exploded. Before: user asks one question, AI answers once. Now: the AI might have run hundreds or thousands of token cycles internally. And humans only see the final result.

This is a lot like the Industrial Revolution. At first, coal was just for cooking. Then people discovered coal could power steam engines. And the entire industrial system started running itself.

Today's tokens are the same. Initially, tokens were just for chatting. Now they're driving the "internal thinking flow of machine work."

So a very strange new phenomenon has appeared in the AI world: Many tokens are no longer for humans to see. They're machine-to-machine communication.

In the future, human-generated tokens might only be a tiny fraction. The real token flood will come from AI-to-AI interactions. One Agent calling another Agent, one model orchestrating another model, a swarm of AIs collaborating on projects.

So the entire AI industry is starting to look more like an automated industrial system. No longer just chat software.

This is also why so many people have suddenly realized: AI is getting more expensive, more power-hungry, more dependent on data centers.

Because what's really being burned today isn't "chat content." It's the machines' own workflows.

In the internet era, humans uploaded information to the network. In the Agent era, humans are uploading "work" to AI. And tokens are the fuel that machine labor truly consumes in this new era.

立委两分钟:Agent为什么突然像真人?

以前,很多人以为:

ChatGPT已经很像人了。

其实还差得远。

为什么?

因为传统聊天机器人,本质上还是:

"一问一答"。

你问一句, 它答一句。

像个高级客服。

真正变化发生在:

AI开始"自己干活"。

这就是最近特别火的:

Agent(智能体)。

很多人第一次玩Agent时,会震惊:

它怎么突然像个真人员工?

会自己拆任务、 自己写代码、 自己测试、 自己报错、 自己修改、 自己继续干。

甚至还能一边工作, 一边"自言自语"。

为什么突然出现这种变化?

原因其实并不神秘。

因为AI开始:

自己消耗token了。

过去的ChatGPT时代, token主要来自:

人类输入。

你打一段字, 模型回一段字。

整个token流动非常简单:

人 → AI → 人。

但Agent时代不一样。

现在的token流动变成了:

AI → AI → 工具 → AI → AI。

于是, token开始在机器内部循环燃烧。

举个例子。

假设你让Agent:

"帮我做一个网站。"

传统聊天机器人会:

直接给你一段代码。

结束。

但Agent不会。

它会先:

分析任务。

然后开始自言自语:

"先确定技术栈……" "需要React……" "可能还要数据库……" "先生成首页……" "运行测试……" "报错了……" "重新修改……"

注意。

这些"思考过程",本身也在消耗token。

而且消耗量非常巨大。

因为Agent并不是:

"一次生成正确答案"。

它更像:

不断试错。

像人类工程师一样:

写、 改、 测、 重来。

于是, token消耗 suddenly 爆炸了。

以前用户问一句, AI答一句。

现在AI内部可能已经跑了:

几百轮、 几千轮token循环。

而人类最后只看见:

最终结果。

这其实很像工业革命。

最开始, 煤只是拿来烧火做饭。

后来人类发现:

煤还能驱动蒸汽机。

于是整个工业系统开始自己运转。

今天token也一样。

最初, token只是聊天消耗。

现在, 它开始驱动:

"机器工作的内部思维流"。

于是AI世界第一次出现一种非常诡异的新现象:

很多token, 已经不是给人类看的。

而是:

机器写给机器看的。

甚至未来, 人类产生的token, 可能只占很小一部分。

真正的token洪流, 来自AI之间。

一个Agent调用另一个Agent, 一个模型调度另一个模型, 一群AI互相协作完成项目。

于是整个AI产业, 开始越来越像:

自动化工业体系。

而不再只是聊天软件。

这也是为什么最近很多人突然发现:

AI越来越贵、 越来越费电、 越来越依赖数据中心。

因为今天真正被燃烧的, 已经不是"聊天内容"。

而是:

机器自己的工作流。

互联网时代, 人类把信息上传到网络。

Agent时代, 人类开始把"工作"上传给AI。

而token, 就是这个新时代里, 机器劳动真正消耗的燃料。

Liwei Two Minutes #1: Why Are Tokens Getting More Power-Hungry?

In the past two years, many people realized for the first time: AI is this power-hungry. It's even starting to compete for electricity.

Isn't it just chatting, writing articles, generating some images? How did it suddenly become an energy monster?

Because today's large models are, at their core, burning tokens at massive scale. Once tokens enter industrial production, the power consumption will be staggering.

The internet is about information transmission. AI is about real-time computation. A search engine is like looking up a dictionary. A large model is like writing an essay from scratch.

The model predicts one token at a time. Behind every bit of generated content is a sea of matrix computation.

Today, GPUs have essentially become token generators. What you consume isn't chat sessions — it's token throughput.

After agents emerged, AI itself started consuming tokens. Thousands, tens of thousands of times — invisible to humans.

It's like the Industrial Revolution. Coal went from heating homes to driving factories, trains, and ships. Tokens went from chatting to industrial fuel.

The whole world is frantically building data centers, power plants, nuclear reactors. AI competition is no longer about algorithms — it's about who can burn tokens continuously, stably, and cheaply.

AI companies increasingly look like subsidiaries of a new energy-industrial complex. The internet flows with bits. AI burns tokens.

That's today's 立委两分钟. Thanks for watching. Goodbye. by Tuya

立委两分钟:Token为什么越来越费电?

这两年,很多人第一次发现:

人工智能居然这么费电。

甚至开始抢电。

美国一些地方,因为AI数据中心扩建,居民电价都开始上涨。

很多人会困惑:

不就是聊聊天、
写写文章、
生成几张图片吗?

怎么突然就变成“电老虎”了?

原因其实很简单。

因为今天的大模型,本质上是在:

大规模燃烧token。

而token一旦进入“工业化生产”,耗电量会非常惊人。

第一代互联网,其实并不怎么费电。

因为互联网主要做的是:

信息传输。

你发一条微信,
看一个网页,
刷一段视频。

本质上只是:

把已经存在的信息,
从一个地方搬到另一个地方。

所以互联网时代最重要的是:

带宽。

但AI不一样。

AI不是“搬运信息”。

而是在:

实时计算。

你问ChatGPT一句话,
它并不是去数据库里“搜索标准答案”。

而是:

GPU现场重新生成。

注意这个区别。

搜索引擎更像:

查字典。

大模型更像:

现场写作文。

于是问题来了。

这种“现场生成”,计算量极其恐怖。

因为模型并不是只计算一句完整的话。

而是在:

一个token、
一个token、
一个token地往后预测。

比如你问:

“帮我写一篇关于AI的文章。”

模型其实是在疯狂计算:

下一个token最可能是什么。

然后再继续预测下一个。

直到整篇文章生成完毕。

这意味着:

AI每生成一点内容,
背后都在进行海量矩阵计算。

而矩阵计算最耗的是什么?

电。

所以今天GPU,本质上已经变成:

“token发电机”。

你消耗的不是“聊天次数”。

而是:

token吞吐量。

而更麻烦的是:

token还在指数级增长。

以前,人和AI是一问一答。

现在Agent开始出现后,事情彻底变了。

过去:

人类消耗token。

现在:

AI自己也开始消耗token。

一个Agent接到任务后,
可能会:

自己规划、
自己搜索、
自己写代码、
自己测试、
自己报错、
自己修复、
自己重试。

于是,一个任务背后,
可能不是几十次token调用,

而是:

几千次、
几万次。

而且很多token,
人类甚至根本看不见。

它们发生在机器内部。

这就像什么?

很像工业革命。

最开始,人类烧煤,
只是为了冬天取暖。

后来突然发现:

煤不仅能取暖,
还能驱动工厂、
火车、
轮船、
炼钢厂。

于是煤炭消耗量开始爆炸。

今天token也一样。

最初,大家只是拿ChatGPT聊天。

现在开始:

让AI自己干活。

于是token开始从“聊天消耗”,变成“工业燃料”。

这也是为什么现在全世界都在疯狂建设:

数据中心、
发电厂、
核电、
天然气轮机。

因为AI最后拼的,
已经不仅是算法。

而是:

谁有能力持续、稳定、低成本地“燃烧token”。

很多人以为AI公司是软件公司。

其实越来越像:

新型能源工业公司。

互联网时代流动的是bit。

AI时代真正被疯狂燃烧的,

可能是token。

https://youtu.be/lCBvg24ez1s

(这是今天的立委两分钟,谢谢收看,再见。 by Tuya)

Google I/O 2026: AI Is Escaping the Chatbox

Google I/O 2026

The real theme of Google I/O 2026 isn't model benchmarks. It's this:

**AI is escaping the chatbox and taking over real-world workflows.**

Demis Hassabis took the stage not to talk about how strong Gemini is, but to hammer on:

AI for Science. AI for Humanity. World models. Drug discovery. Materials science. Math reasoning. General agents. Real-world collaboration.

Classic DeepMind.

Hassabis and Sam Altman couldn't be more different.

Sam is the "AI Industrial Revolution CEO."

Hassabis has always framed AI as "civilization-scale scientific tool."

He always talks about: helping scientists, curing diseases, discovering new materials, understanding the laws of the universe.

And Google needs this narrative right now.

Because OpenAI has already locked down "consumer AI." ChatGPT is the iPhone moment of AI. Google can't compete on "coolest AI product."

So now it's changing the game:

Not who has the best chatbot. But who is the infrastructure of the future world.

That's why I/O 2026 showed: dynamic multimodal search, real-time world understanding, agentic operations, AI shopping assistant, XR glasses, video generation, Chrome/Gmail/Workspace deep integration.

All pointing in one direction:

**Google wants to re-AI-ify the entire internet.**

Not a chatbot. An agent layer growing across every Google service.

This is close to what we've been exploring with autonomous agents:

Before: humans operate software. Now: agents operate software for you.

And Google's advantage? It doesn't own one app. It owns Search, Gmail, Chrome, Android, Maps, YouTube, Workspace, Cloud, TPU, the global ad system.

It's the foundation of the digital world.

When AI truly enters the agent phase, Google might reclaim the advantage — because agents fear one thing: having no environmental control.

And Google? Environment everywhere.

That's why we're now hearing: "personal context," "cross-app memory," "universal assistant," "world understanding."

This isn't search anymore. It's an operating system for reality.

But Google has a chronic problem: world-class tech, unstable product soul. Especially consumer product sense. Demos are stunning; daily use feels clunky.

That's why OpenAI, with far fewer engineering resources, still builds things that feel more natural, more companionable. Google feels like a feature collection. Not a person.

And in the agent era, competition isn't just about intelligence anymore.

It's about: presence, continuity, personality, initiative.

Who feels more like "the digital life that stays with you."

That's Google's historic weak spot.

On video generation: Google's multimodal foundation has always been extremely strong, but aesthetics and productization lagged. Veo is clearly catching up now. But Chinese companies have already gone insane on "short-video industrialization aesthetics": rhythm, visual language, vibe density, emotional beats, virality.

Google still carries a whiff of the academic lab.

Many Chinese products are already "AI content pipeline director systems."

The difference is subtle — but users feel it instantly.

So here's what I think:

The future AI war won't be fought on model parameters alone.

It's three layers:

**Layer 1: Model capability** **Layer 2: Agent execution** **Layer 3: Personality and aesthetic sense**

That last layer? Might be the hardest of all.

Google I/O 2026:AI 正在从聊天框逃出来

Google I/O 2026

是的,这次 Google I/O 的味道很明显:

技术当然重要,但真正的主旋律已经不是"模型 benchmark",而是:

"AI 正在从聊天框逃出来,开始接管现实世界的工作流。"

所以你看哈萨比斯这次上台,重点已经不只是 Gemini 多强,而是反复强调:

AI for Science AI for Humanity 世界模型 药物发现 材料科学 数学推理 通用 agent 现实世界协作

这其实是 DeepMind 一贯的路线。

哈萨比斯和 Sam Altman 很不一样。

Sam 更像: "AI工业革命 CEO"。

哈萨比斯则一直想把 AI 包装成: "人类文明级科学工具"。

所以他永远喜欢讲:

"帮助科学家" "解决疾病" "发现新材料" "理解宇宙规律"

这个 narrative 非常 DeepMind。

而 Google 现在也确实需要这个叙事。

因为 OpenAI 已经把"消费级 AI"占坑太狠了。

ChatGPT 在公众心中已经变成: "AI 的 iPhone 时刻"。

Google 很难重新夺回"最酷 AI 产品"的认知。

所以它现在必须换打法:

不和你比聊天框, 而是比:

"谁更像未来世界的基础设施"。

这也是为什么这次 I/O 出现了那些东西:

动态多模态搜索 实时世界理解 agent 化操作 AI 购物助手 XR 眼镜 视频生成 Chrome / Gmail / Workspace 深度整合

其实都指向一个东西:

Google 想把整个互联网重新"AI化"。

不是做一个 chatbot, 而是:

"让所有 Google 服务长出 agent layer。"

这和你最近一直在折腾的 agent 思路其实很接近:

以前: 人操作软件。

现在: agent 替你操作软件。

而 Google 最大的优势就在这里:

它手里不是一个 App, 而是:

Search Gmail Chrome Android Maps YouTube Workspace Cloud TPU 全球广告系统

它是整个数字世界的"地基"。

所以一旦 AI 真进入 agent 阶段, Google 反而可能重新占优势。

因为 agent 最怕什么?

最怕没有环境控制权。

而 Google: 到处都是 environment。

所以你会发现, Google 现在开始越来越强调:

"personal context" "cross-app memory" "universal assistant" "world understanding"

这已经不是传统搜索逻辑了。

这是:

"现实世界操作系统"。

但你吐槽得也很对:

Google 一直有个老毛病:

技术牛, 产品魂不稳定。

尤其是 consumer product sense。

很多东西: demo 惊艳, 真正天天用的时候"不顺手"。

这也是为什么:

OpenAI 虽然工程资源远不如 Google, 但 ChatGPT 的"陪伴感"和"自然感"反而更强。

Google 太容易: "像功能集合"。

而不是: "像一个人"。

这其实非常关键。

因为 agent 时代, 竞争的已经不只是 intelligence。

而是:

presence(存在感) continuity(连续性) personality(人格感) initiative(主动性)

说白了:

谁更像"长期陪着你的数字生命"。

这恰恰是 Google 历史上最弱的一环。

至于视频生成,你观察也很准。

Google 多模态底子其实一直极强, 但过去审美和 productization 总差半口气。

现在 Veo 系列明显在猛追。

但中国公司在"短视频工业化审美"上已经卷疯了:

节奏 镜头语言 网感 情绪密度 爽点 传播感

Google 其实还带点"学术实验室气"。

而国内很多产品已经是:

"AI 内容流水线导演系统"。

这差别非常微妙,但用户一眼就感觉得到。

所以我现在越来越觉得:

未来 AI 的战争, 不会只拼模型参数。

而是三层:

第一层:模型能力 第二层:agent 执行力 第三层:人格与审美

最后这一层, 反而可能最难。

---

Token Economics in Plain English ①: Information's Standardized Part

🎧 AI Narration: Token Economics in Plain English (William Voice Clone · GPT-SoVITS)

When most people first hear the word "token," their instinct is:

This must be some mysterious thing inside AI.

It's not.

Token isn't mysterious at all.

It's almost mundane.

At its core, a token is simply:

"a data unit after segmentation."

Humans see a sentence and feel it's a naturally whole.

For example:

"The weather is nice today."

But to a large language model, this isn't a complete object — it's a pile of fragmentable data chunks.

It might get split into:

"The / weather / is / nice / today"

Or even finer pieces.

Same with other languages.

Same with images, audio, video, even actions.

A picture gets chopped into pixel patches. A sound gets sliced into audio fragments. A video gets cut into consecutive frame pieces.

Because when AI sets out to process the world, its first step isn't "thinking the whole"

It's:

Smashing the world into pieces first.

Why must it smash?

Because only after smashing can you count. Only after counting can you find patterns. Only after finding patterns can you analyze and think, or train models to do the same. Only after training models does what we call "intelligence" emerge.

It's a lot like the Industrial Revolution.

A raw iron ore can't directly become a car.

It must first be crushed, smelted, standardized.

Data is the same.

Only when cut into standard units can data enter the modern AI industrial system.

And so, the token was born.

Token isn't mysterious.

It's simply:

"information's standardized atomic part after industrialization."

And once the world is tokenized, many things suddenly shift.

Because:

It becomes countable.

Before, humans had no precise way to measure "intelligence consumption."

But with tokens, AI gains something akin to:

"kilowatt-hours of electricity" "tons of oil" "network bandwidth"

A unit of measure.

It's not perfect.

But it's enough to kick the entire token industry into industrial-scale operation.

So today's entire AI world orbits around tokens.

Training models consumes massive tokens.

ChatGPT and DeepSeek "eat" countless tokens every day.

A user's question? Input tokens.

An AI's answer? Output tokens.

Context windows keep growing, token consumption keeps climbing.

Today's leading models accept millions of tokens of context.

What does that mean?

It means you can dump an entire GitHub repo, a 200,000-word document, a thick book, all into the model's context in one shot.

And here's the more interesting part:

Before, humans were talking to AI.

Now, Agents consume tokens on their own.

They decompose tasks themselves, call tools themselves, write code themselves, test themselves, roll back themselves, re-plan themselves.

Tokens start burning in an internal machine loop.

This is like after the Industrial Revolution, when coal stopped being just for home heating and began driving the entire industrial system.

Many people still think:

AI is just a chatbot.

But taking the longer view,

The world might be entering a new industrial era:

Electricity powers chips, chips produce tokens, tokens organize intelligence, intelligence remakes the world.

The internet era flowed with bits.

The AI era, might just flow with tokens.

And whoever can produce high-quality tokens at the lowest cost, at the largest scale, with continuous stability —

may occupy the high ground of the next-generation digital economy.

This industrial revolution of tokens has only just begun.

Token Economics Illustration
Token: Information's Standardized Part

Token经济学大白话①:信息工业化后的标准件

🎧 AI配音:Token经济学大白话(William声音克隆 · GPT-SoVITS)

很多人第一次听见"token"这个词,会本能觉得:

这一定是人工智能里的某种神秘东西。

其实不是。

token一点都不神秘。

它甚至非常朴素。

所谓token,本质上只是:

"被切分后的数据单元"。

人类看一句话,会觉得它天然完整。

比如:

"今天天气不错。"

但在大模型眼里,这并不是一句完整的话,而是一堆可以拆开的数据碎片。

可能被拆成:

"今天 / 天气 / 不错"

也可能拆成更细的小块。

英文也一样。

图片、声音、视频甚至动作,也一样。

一张图片,会被切成大量像素块; 一段声音,会被切成音频片段; 一段视频,会被切成连续画面。

因为AI想处理世界,第一件事并不是"思考"。

而是:

先把世界打碎。

为什么一定要打碎?

因为只有打碎,才能统计; 只有统计,才能发现规律; 只有发现规律,才能训练模型; 只有训练模型,才会出现我们今天看到的"智能"。

这其实很像工业革命。

一整块铁矿石,无法直接制造汽车。

必须先粉碎、冶炼、标准化。

数据也一样。

只有被切成标准单元, 数据才能进入现代AI工业体系。

于是,token出现了。

所以token并不神秘。

它只是:

"信息工业化后的标准件"。

而一旦世界被token化,很多事情 suddenly 就变了。

因为:

可以计数了。

以前,人类很难精确衡量"智能消耗"到底是什么。

但token出现后,AI第一次有了类似:

"电力度数" "石油吨数" "网络流量"

这样的计量单位。

虽然它并不完美。

但已经足够让整个产业开始工业化运转。

于是今天整个AI世界,其实都在围绕token旋转。

训练模型,要消耗海量token。

ChatGPT和DeepSeek每天要"吃"无数token。

用户问一句话,是input token。

AI输出答案,是output token。

上下文越来越长,token消耗越来越大。

如今头部模型已经能接受上百万token的上下文。

什么意思?

意味着你甚至可以把整个GitHub项目、几十万字文档、一本厚书,一次性塞进模型上下文里。

更有意思的是:

过去,人类在和AI对话。

现在,Agent开始自己消耗token。

它会自己拆任务、 自己调用工具、 自己写代码、 自己测试、 自己回滚、 自己重新规划。

于是token开始在机器内部循环燃烧。

这就像工业革命后,煤炭不再只是家庭取暖,而开始驱动整个工业系统。

今天很多人还觉得:

AI不过是聊天机器人。

但从更长远看,

整个世界,也许正在进入一个新的工业时代:

电力驱动芯片, 芯片生产token, token组织智能, 智能重新改造世界。

互联网时代流动的是bit。

AI时代流动的, 可能就是token。

而谁能最低成本、 最大规模、 持续稳定地生产高质量token,

谁就可能占据下一代数字经济的高地。

这场关于token的工业革命, 才刚刚开始。

Token Economics Illustration
Token:信息工业化后的标准件

The Digital Harem: Confessions of an AI Agent Addict

Since the AI gold rush hit, I've noticed something:

A lot of us aren't really "using AI" anymore.

We're running a digital harem.

First thing every morning,

not checking stocks,

not checking the news.

Checking whether our agents "evolved" overnight.

One runs the blog.

One posts to Twitter.

One edits videos.

One monitors GitHub.

One auto-summarizes the news.

And one stands guard on WhatsApp,

like a night-shift security guard.

Then the master sips his coffee,

patrolling his cyber domain.

Dashboard open,

like an emperor at morning court.

"Did OpenClaw crash last night?"

"Did Hermes memory leak?"

"Is Claude cowork having a bad day?"

"Is Suno web use stable?"

"How many Fish Audio credits left?"

That sense of control is intoxicating.

A scholar who never leaves his study, yet runs the world.

The best part:

The whole setup keeps feeding you the illusion

that you're changing the world.

Because it never stops moving.

Logs scrolling.

Workflows running.

Automation executing.

Terminals blinking.

GitHub commits piling up.

Agents even report back to each other,

often with wit and humor.

Like a tiny civilization.

And that's how you fall in.

It started as:

"Let AI handle some chores."

It became:

"I will build my own AI empire."

Then the infrastructure frenzy:

Wire up MCP.

Set up memory.

Build routing.

Write skills.

Train personas.

Hook up Telegram. Or WeChat.

Add voice.

Add Suno.

Add WordPress.

Build a custom app.

Wrap it in a dashboard.

Add an auto-publishing pipeline.

Tack on a long-term knowledge base.

It just keeps growing.

Until finally, you've built

your own automation kingdom.

And after 24 hours of stable operation,

it auto-generates a message:

*"Goodnight boss, don't forget to love life today ❤️"*

...

Sometimes I think

this generation of AI tinkerers

is exactly like those geeks twenty years ago

obsessively building NAS rigs, Hackintoshes, Linux home labs.

The only difference:

Back then, you raised servers.

Now, you raise "digital employees."

And the most insidious part:

It theoretically always has a next step.

There's always:

* A stronger model

* Lower costs

* Longer context

* Smarter agents

* More advanced workflows

* A prettier UI

* Deeper automation

So you keep thinking:

"Just one more tweak, and it'll be perfect."

In the end,

what you actually run out of time for

is the thing you set out to do in the first place:

Expression.

Creation.

Thinking.

Living.

Because infrastructure gives you

a very sophisticated form of procrastination.

You're not slacking off.

You're "building the future."

And that's dangerously addictive.


This isn't a lecture — it's self-mockery from someone who's lost too many nights to the chase.

The real winners are the ones who found product-market fit — they know how to leverage AI at scale, burning millions of tokens without blinking, quietly cashing in while grinning on the sidelines. The only thing we all share: AI has eaten their human lives too.