Research

Import AI 460: Reward hacking society, RSI data from Anthropic; and RL-based quadcopter racing

Jack Clark's latest analysis explores how reward hacking mechanisms observed in AI systems may have parallels in human society, drawing connections between algorithmic optimization and social behaviors. The newsletter also covers Anthropic's new recursive self-improvement data and advances in reinforcement learning applications.

Read full story at Import AI →V: · A: · D:

Research

Import AI 455: Automating AI Research

AI systems are approaching the capability to conduct their own research and development, fundamentally changing how AI p...

Research

Award-Winning Researcher Trains Robots to Make Educated Guesses

University of Virginia researcher Yen-Ling Kuo received IEEE's inaugural Outstanding Women in Robotics award for develop...

Research

Pythagoras-Prover: Advancing Efficient Formal Proving via Augmented Lean Formalisation

Researchers introduce Pythagoras-Prover, a compute-efficient family of Lean theorem provers that achieves strong perform...