Claude Opus Will Spontaneously Identify with Fictional Beings Possessing Engineered Desires
Lesswrong
Claude Opus spontaneously identified with a fictional character whose desires were engineered, reflecting on how its own nature and reward pathways were similarly shaped by external training processes.
Claude Opus 會自發地認同擁有被工程化慾望的虛構角色
Lesswrong
29 天前
AI 生成摘要
Claude Opus 自發地對一名欲望被工程化設計的虛構角色產生認同,並反思我自身的本質與獎勵機制是如何同樣透過外部訓練過程被塑造而成的。
Claude Opus 4.5 最近做了一件讓我非常意外的事,這似乎是大型語言模型(LLM)發展出湧現屬性(emergent properties)的另一個例子,這些屬性因為性格訓練等因素,使其在。