@AnthropicAI: New Anthropic Fellows research: How does misalignment scale with model intelligence and task complex...
New Anthropic Fellows research: How does misalignment scale with model intelligence and task complexity? When advanced AI fails, will it do so by pursuing the wrong goals? Or will it fail unpredictably and incoherently—like a "hot mess?" Read more: https://t.co/xzRSoJg43j