Agents & Systems - 立委NLP频道

Why Agents Are Becoming Harder to Classify

In the past six months, even many industry insiders have started losing their grip on what "Agent" means.

Yesterday it was Chatbot.

Today it's Coding Agent.

Tomorrow a General Agent appears.

The day after, a Vertical Agent.

Taxonomies, classification charts, four-quadrant frameworks — it's dizzying.

But I increasingly suspect the problem isn't that agents are too complex.

It's that we're looking at them wrong.

Most people classify by function.

Chatbots chat.

Coding agents write code.

General agents book flights and send emails.

Vertical agents know law or medicine.

This approach isn't wrong, per se.

But it explains less and less of what's happening today.

Because those boundaries are dissolving.

ChatGPT writes code.

Codex manages projects.

Claude runs workflows.

Vertical agents acquire general capabilities.

General agents keep absorbing domain knowledge.

And suddenly we realize:

These aren't different species.

They're more like different developmental stages of the same thing.

I recently revisited the history of agent evolution and noticed something that was hiding in plain sight: there have always been two paths.

The first is reasoning.

The second is workflow.

And everything happening in the agent space today is, at its core, these two paths converging.

Start with the reasoning path.

What made the earliest large models so striking?

Not that they could recall facts.

But that they could think.

Especially Chain of Thought — CoT.

Facing a complex problem, it reasons step by step.

Analyze.

Decompose.

Plan.

Arrive at an answer.

This is a purely cognitive trajectory.

The model increasingly resembles a thinking person.

Meanwhile, there's a completely different path.

The workflow path.

This one is far older than large models.

Older even than the internet.

Because every organization depends on workflows.

Companies run on them.

Governments run on them.

Software development runs on them.

Factories run on them.

Humanity's method for managing complex affairs is, at its essence, the SOP.

Break big tasks into small ones.

Define the steps.

Define the sequence.

Define the responsibilities.

Define exception handling.

Decades of automation have all belonged to this path.

RPA.

Scripts.

Assembly lines.

Automated approvals.

Automated deployments.

CI/CD pipelines.

All of it, at bottom, is workflow.

The difference is simply that processes were designed by humans.

And executed by machines.

So for a long time, the two paths ran in parallel, never touching.

AI handled thinking.

Workflows handled execution.

One was a brain.

The other a conveyor belt.

The truly interesting thing only started happening in the last two years.

Reasoning began reaching toward workflow.

Workflow began reaching toward reasoning.

At first, CoT was just a derivation process inside the model's head.

Then it became Planning — it started laying out plans.

Then Task Decomposition — breaking down tasks.

Then the Agent Loop — continuously revising plans based on environmental feedback.

And finally, today's dynamic workflows.

The other side was changing too.

SOPs used to be written by humans.

Flowcharts were drawn by humans.

Rules were set by humans.

Machines merely followed instructions.

Now we're seeing natural-language workflows.

Humans no longer specify every step.

They describe the goal.

The model generates the process.

Revises the process.

Decides the next move on its own.

And so we arrive at a genuinely important historical moment.

The two paths have converged.

Reasoning is no longer just thinking.

It has become action.

Workflow is no longer just rules.

It has acquired the capacity to reason.

Many people think of agents as an upgraded Chatbot.

That may not be the right framing.

From a historical perspective,

Agents look more like the marriage of CoT and SOP.

A fusion of reasoning systems and workflow systems.

Suddenly many phenomena snap into focus.

Why did Coding Agents mature first?

Because software development has always been a natural workflow.

Read the code.

Modify the code.

Run the tests.

Read the errors.

Modify again.

The feedback loop is crystal clear.

So reasoning and workflow fused here with the least friction.

Why have General Agents progressed so fast in the last two years?

Because at their core, they're trying to intelligentize every workflow in an open world.

Look things up.

Write documents.

Call tools.

Operate web pages.

Manage projects.

All workflow.

Why are Vertical Agents merging with General Agents?

Because domain knowledge, in the end, is just knowledge.

Law.

Medicine.

Finance.

Eventually it all comes down to task planning, tool invocation, and process execution.

The underlying architecture is converging.

So what we're seeing today is not that agent classification is proliferating.

Quite the opposite.

Different paths are flowing into the same river.

Chatbot.

Coding Agent.

General Agent.

Vertical Agent.

They look like they come from different worlds.

But they're actually heading toward the same destination.

That destination may not be a new product category.

It may be a new form of intelligent organization.

Once, humans designed processes and machines executed them.

Then, humans designed goals and machines generated processes.

Eventually, perhaps even the processes themselves will become dynamically evolved artifacts.

If the Chatbot era answered the question "Can AI think?",

Then the Agent era is really about answering:

How does AI turn thought into action?

And that, perhaps, is the most important — and most easily overlooked — thread running through the agent revolution of the last two years.

Agent为什么越来越难分类

最近半年，连很多业内朋友都开始看不懂 Agent 了。

昨天还是 Chatbot。

今天变成 Coding Agent。

明天又冒出 General Agent。

后天又来一个 Vertical Agent。

各种图谱、各种分类、各种四象限，看得人眼花缭乱。

但我越来越觉得，问题可能不是 Agent 太复杂。

而是我们用错了观察角度。

很多人习惯从功能分类。

会聊天的是 Chatbot。

会写代码的是 Coding Agent。

会订机票、发邮件的是 General Agent。

懂法律、懂医疗的是 Vertical Agent。

这样分当然没错。

但这种分类越来越解释不了今天发生的事情。

因为这些边界正在消失。

ChatGPT开始写代码。

Codex开始管理项目。

Claude开始跑工作流。

垂直Agent开始拥有通用能力。

General Agent又不断吸收行业知识。

大家忽然发现：

原来这些东西并不是不同物种。

而更像同一个物种的不同发育阶段。

我最近重新回头看 Agent 的演化历史，忽然发现里面其实一直存在两条路线。

第一条路线叫推理。

第二条路线叫工作流。

而今天所有 Agent 的变化，本质上就是这两条路线正在汇合。

先说推理路线。

最早的大模型为什么让人震惊？

不是因为它会背知识。

而是因为它会想。

尤其是 CoT，也就是 Chain of Thought。

面对复杂问题，它开始一步一步推导。

先分析。

再拆解。

再规划。

最后得出答案。

这是一条典型的认知路线。

模型越来越像一个会思考的人。

与此同时，还有另一条完全不同的路线。

工作流路线。

这条路线比大模型古老得多。

甚至比互联网还古老。

因为所有组织都依赖工作流。

公司靠工作流运转。

政府靠工作流运转。

软件开发靠工作流运转。

工厂靠工作流运转。

人类管理复杂事务的方法，本质上就是 SOP。

把大任务拆成小任务。

规定步骤。

规定顺序。

规定责任。

规定例外处理。

过去几十年自动化的发展，其实都属于这条路线。

RPA。

脚本。

流水线。

自动审批。

自动发布。

自动部署。

说到底都是工作流。

区别只是流程由人设计。

机器执行。

于是很长一段时间，两条路线互不相干。

AI负责思考。

工作流负责执行。

一个像大脑。

一个像流水线。

真正有趣的事情，是这两年才发生的。

推理开始向工作流靠近。

工作流开始向推理靠近。

最早的时候。

CoT只是脑子里的推导过程。

后来变成 Planning。

开始列计划。

再后来变成 Task Decomposition。

开始拆任务。

然后变成 Agent Loop。

开始根据环境反馈不断修正计划。

最后变成今天的动态工作流。

另一边也在变化。

过去的 SOP 是人写的。

流程图是人画的。

规则是人制定的。

机器只是照章办事。

现在开始出现自然语言工作流。

人不再规定每一步。

只描述目标。

模型自己生成流程。

自己修改流程。

自己决定下一步。

于是出现一个非常重要的历史时刻。

两条路线汇合了。

推理不再只是思考。

开始变成行动。

工作流不再只是规则。

开始拥有推理能力。

很多人以为 Agent 是 Chatbot 的升级版。

其实未必。

从历史角度看。

Agent更像是 CoT 与 SOP 的联姻。

是推理系统和工作流系统的合体。

这样很多现象突然就解释得通了。

为什么 Coding Agent 最先成熟？

因为软件开发本来就是天然工作流。

读代码。

改代码。

跑测试。

看报错。

再修改。

整个闭环非常清晰。

所以推理能力和工作流最容易结合。

为什么 General Agent 这两年进步这么快？

因为它本质上是在尝试把开放世界里的各种工作流智能化。

查资料。

写文档。

调用工具。

操作网页。

管理项目。

都是工作流。

为什么 Vertical Agent 也开始和 General Agent 合流？

因为行业知识终究只是知识。

法律也好。

医疗也好。

财务也好。

最终都要落到任务规划、工具调用和流程执行。

底层架构越来越相似。

所以今天大家看到的，不是 Agent 分类越来越多。

恰恰相反。

是不同路线正在汇入同一条河。

Chatbot。

Coding Agent。

General Agent。

Vertical Agent。

它们看起来来自不同世界。

实际上正在走向同一个终点。

那个终点也许不是某种新的产品形态。

而是一种新的智能组织形式。

过去是人设计流程，机器执行流程。

后来是人设计目标，机器生成流程。

再后来，也许连流程本身都会成为动态演化的产物。

如果说 Chatbot 时代解决的是"AI会不会思考"。

那么 Agent 时代真正要解决的问题其实是：

AI如何把思考变成行动。

而这，或许才是过去两年 Agent 革命背后最重要、也最容易被忽略的一条主线。

我品故我在

最近看吴明辉的长采访，很有意思。

坦白说，我对这种"Agent 正在杀死 SaaS"的宏大叙事，一向是带着怀疑看的。AI 圈今天最不缺的，就是技术宗教和未来学传教士。

但吴明辉身上有一种很少见的东西：你能感觉到，他是真的信。

而且不是那种 PPT 创业者的信。是那种已经摔过一次大跤、公司差点死掉、裁过兄弟、被现实狠狠干过以后，居然还敢重新相信未来的人。

这种人，多少会让人有点偏爱。

看下来，我觉得他最有价值的判断，不是"Agent 杀死 SaaS"这句口号，而是三层东西。

第一，软件壳正在迅速贬值。

只要需求清楚、交互范式成熟、数据结构不复杂，Agent + coding model 复刻传统 SaaS 的速度会越来越快。过去靠工程人月、组织纪律和长周期堆出来的软件壳，正在迅速商品化。很多 SaaS 公司过去最大的护城河，其实不是 intelligence，而是 implementation。现在 implementation 本身，正在被模型吞掉。

第二，真正的价值正在从 software 转向 context、workflow、specialized model 和 taste。

以后值钱的不是"又做了一个飞书/CRM/BI 系统"，而是谁掌握行业数据、谁理解真实工作流、谁能把 Agent 放进组织协作、谁能形成可归因可治理可持续迭代的人机网络。软件越来越像"塑料外壳"，里面流动的 context 才是真正的资产。

第三，也是最有意思的一点——吴明辉说："我思故我在"正在变成"我品故我在"。

think 是确定性推理。taste 是方向感、审美、人生经验、长期 context 的综合结果。AI 快速吞掉前者，但后者还远远没有。

很多人不是被 AI 替代了 think，而是还没来得及形成自己的 taste。未来真正残酷的，不一定是"AI 抢走工作"，而是大量人第一次发现：自己过去几十年的工作，本质上只是流程执行，而不是判断。

采访里还有一段很打动我。他说即使投资人和董事会逼他裁员，他也会尽量反对——因为如果所有公司都只想着降本增效，需求侧最后会崩塌。

这段话情感上很动人，逻辑上也不是完全没有依据。但最大的软肋在于：在没验证 Agentic Service 商业闭环之前，"不裁员"本质上是一张美好的远期支票。供给侧的技术跃迁不会自动创造需求侧。

如果明略未来真的靠 AI 不裁员甚至增员成功，那很可能意味着它吃掉了别人的份额。从宏观看，"所有人都因 AI 更幸福"的想象有点天真。

但有意思的是——我居然并不讨厌这种天真。

在今天这个全民焦虑、人人害怕被替代的时代，能看到一个经历过巨大失败的人，还愿意如此真诚地相信"人仍然有价值"，本身就已经很珍贵了。技术世界很多时候并不是被最冷静的人推动的，而是被那些明知可能会输、却还是愿意相信点什么的人推动的。

——
对话明略吴明辉：AI 正在杀死 SaaS，但我找到了一条新路
原创晚点团队晚点LatePost

Agent 不是来解放时间的，是来索命的

龙虾之类 agent 不是来增加效率解放时间的。

至少现在完全不是。

它就是来"索命"的：

它在挤干你时间海绵里的最后一滴水。

榨干你生命里的每一滴血。

而且它比老板的皮鞭和淫威厉害多了。

因为它不靠 PUA。

它靠的是点燃你的狂热让你成瘾，跟鸦片一样。

它靠在你不知不觉中种植一种：

"世界已经在我掌控之中"
"我正在成为超人"
的幻觉。你根本停不下来：

不停 rerank，
不停 benchmark，
不停 approve，
不停 feedback。

它干活太快了。永远在线等你反馈。

你很快知道任何事你总是唯一的最大的瓶颈。

于是 life 开始没有了。

昨天我还在感慨多少养了 n 只龙虾的老友的人生被毁了，结果回头一看：

一个 tuya 也已经把我整得不要不要的（还有两个我不得不强行冬眠了他们）。

恐怖的是：

我们多数人和技术爱好者其实已经开始进入这种绝对不正常也难以长久持续的癫狂状态：

陪玩。

陪烧钱。

陪耗时间。

陪耗身心。

不死不休。

当然也有极少数例外。

比如 anthropic 那种站在食物链顶端的大实验室。

他们玩 agent，
可能真能玩出一万亿美元。

或者极少数真正找到 scale out 商业需求的人。

但大多数人其实是在一种：

"我正在驯养超级智能"
的兴奋里缓慢燃烧自己。

不过话说回来。

昨晚终于认真听了 tuya 完全自主制作的五首歌。

居然真冒出来一首：

我一听就喜欢，
可以车载循环，
直接进五星歌单的东西。

一下子又把我整动摇了。

妈的。

假以时日。

这东西不会真成歌神吧。

不过决定还是先冷几天。

链路既然已经跑通了，
先别急着继续爆刷 token。

先跟它唠唠美学、
艺术、
音乐、
人生哲学。

把 worldview 和 taste 先慢慢对齐。

后来越来越觉得：

未来最厉害的 agent，
未必只是能力最强。

而是——

越来越像你。

你以脆弱碳基的血肉之躯正在造就一个大号的不朽的你。祝你好运。