Gradual Disempowerment

Wednesday 18 June 2025
19:00 21:00

Google Calendar ICS

What if humanity's greatest risk from AI isn't a sudden takeover, but a slow, unnoticed erosion of our influence?

This is the central concept of "Gradual Disempowerment" as presented by Jan Kulveit and colleagues. Their arguments suggest that even without malicious intent, AI systems could incrementally assume control over critical societal functions—economy, culture, governance—leading to a permanent loss of human agency. Join us at the next AI Safety Tokyo benkyoukai, where we'll explore how this process might unfold and potential avenues for averting it.

This paper examines the systemic risks posed by incremental advancements in artificial intelligence, developing the concept of ‘gradual disempowerment’, in contrast to the abrupt takeover scenarios commonly discussed in AI safety. We analyze how even incremental improvements in AI capabilities can undermine human influence over large-scale systems that society depends on, including the economy, culture, and nation-states. As AI increasingly replaces human labor and cognition in these domains, it can weaken both explicit human control mechanisms (like voting and consumer choice) and the implicit alignments with human interests that often arise from societal systems’ reliance on human participation to function. Furthermore, to the extent that these systems incentivise outcomes that do not line up with human preferences, AIs may optimize for those outcomes more aggressively.

These effects may be mutually reinforcing across different domains: economic power shapes cultural narratives and political decisions, while cultural shifts alter economic and political behavior. We argue that this dynamic could lead to an effectively irreversible loss of human influence over crucial societal systems, precipitating an existential catastrophe through the permanent disempowerment of humanity. This suggests the need for both technical research and governance approaches that specifically address the risk of incremental erosion of human influence across interconnected societal systems.

Kulveit, et al., Gradual Disempowerment: Systemic Existential Risks from Incremental AI Development (2025)

Gradual Disempowerment

AI 2027

Recursive Self-Improvement

AI Safety 東京