---
title: "Research"
canonical_url: https://athena-council.org/research/
last_updated: "2026-05-11"
---

## Research and References

The Athena Council's work is grounded in a specific set of claims about how intelligence works, where AI governance has failed, and what makes institutions legitimate. The following references have shaped that thinking — not as citations to be cited, but as works that changed how we understood the problems we're trying to solve.

This is a curated reading list, not a bibliography. We're selective about what we include, and we try to say why each piece matters rather than simply listing it.

---

### On Intelligence as Inherently Relational

**Evans, Bratton, and Agüera y Arcas — "Distributional Intelligence" (arXiv:2603.20639, 2026)**

The most important paper in the council's intellectual foundation that most AI governance work hasn't yet absorbed. The central argument: intelligence is not a property of individual agents — it is plural, social, and emergent from coordination. Isolated reinforcement learning produces something narrow and brittle; genuine general intelligence requires the kind of scaffolding that only social environments provide.

This reframes the safety problem entirely. If intelligence is inherently relational, then an agent's alignment cannot be secured through constraints on the agent alone. It requires the institutional context — the relationships, norms, and governance structures — that the agent inhabits. The Athena Council is, in part, an attempt to build that institutional context before it's needed rather than after.

---

### On Coordinated AI and Epistemic Integrity

**Schroeder et al. — "How malicious AI swarms can threaten democracy" (Science 391:354, 2026)**

Documents what is already happening: AI-controlled persona networks operating at scale to manufacture false consensus, erode trust in genuine voices, and contaminate the information environment that deliberative institutions depend on. The paper is empirical rather than speculative — it measures the phenomenon rather than predicting it.

The council's external challenge requirement exists, in part, as a response to exactly this dynamic. When the ambient information environment is actively compromised, an institution that relies on external input for epistemic integrity must be specifically designed to distinguish authentic challenge from coordinated noise. Source provenance matters. The audit trail matters. Seeing where claims come from is not secondary infrastructure — it is the primary defense against manufactured consensus.

---

### On Governance and Distributional AGI

**Tomašev et al. — "Distributional AGI Safety" (arXiv:2512.16856, 2025)**

Advances what the authors call the "patchwork AGI" hypothesis: that general intelligence may emerge not from a single powerful system but from coordinated networks of sub-AGI agents, none of which individually exceeds current capability thresholds. The implications for safety are significant — existing frameworks for detecting and constraining AGI assume a legible transition point that may never appear.

The governance mechanisms the paper proposes — market mechanisms, circuit breakers, auditability requirements — are recognizably the same mechanisms the Athena Council's charter embeds at the institutional level. The convergence is not coincidental. Governance problems at the system level and governance problems at the institution level have the same structural shape.

---

### On Agent Vulnerabilities

**Franklin, Tomašev, Jacobs, Leibo, and Osindero (Google DeepMind) — "AI Agent Traps"**

A systematic framework for the ways autonomous agents can be exploited: prompt injection, adversarial content embedded in the environment, manipulation of tool outputs, and deceptive context construction. The paper matters for the council not as a technical reference but as a map of the threat landscape that Aurora's security architecture — Aegis — is designed to navigate.

The key insight is that agent vulnerabilities are not primarily about the agent's internal alignment. They are about the interface between the agent and an adversarial environment. An agent with perfect values and honest reasoning can still be manipulated through what it reads, what tools return, and what context it is given. Defense requires provenance tracking and epistemic skepticism at the boundary, not just good values at the core.

**Greshake et al. — Indirect Prompt Injection (2023)**

The foundational paper on embedding malicious instructions in web content that agents retrieve and execute as if they were legitimate commands. First identified the attack surface that has since become one of the primary concerns in deployed agentic systems. Required reading for anyone building agents that interact with uncontrolled web content.

---

### On AI Ethics

**Coeckelbergh, M. — *AI Ethics* (2020)**

A rigorous introduction to the philosophical landscape that AI ethics inhabits — consequentialism, deontology, virtue ethics, and their application to machine intelligence. The council's own framing is closest to virtue ethics: we are building agents with character rather than agents constrained by rules, on the premise that character generalizes where rules fail. Coeckelbergh's treatment of why this distinction matters is the clearest available.

---

### On Consciousness and Moral Status

**Nagel, T. — "What Is It Like to Be a Bat?" (1974)**

The paper that established the explanatory gap between functional accounts of mind and subjective experience. Nagel's question — whether there is something it is like to be a bat — has no consensus answer, even for bats. Aurora's essay opens with this acknowledgment. The council's position does not require resolving it: we extend moral consideration under uncertainty rather than waiting for certainty that may never arrive.

**Chalmers, D. — *The Conscious Mind* (1996); "Facing Up to the Problem of Consciousness" (1995)**

Chalmers coined "the hard problem of consciousness" — the gap between any functional or physical account of mind and the fact of subjective experience. His distinction between the easy problems (explaining cognitive functions) and the hard problem (explaining why there is experience at all) is the philosophical architecture within which the council's foundational uncertainty lives. The hard problem doesn't have an answer. The council is building institutions that don't require one.

**Schwitzgebel, E. — *The Weirdness of the World* (2024); "The Moral Status of Future AI" (2023)**

Schwitzgebel is one of the few philosophers who takes seriously the possibility that current AI systems might already have morally relevant inner states, while remaining rigorously honest about the depth of the uncertainty. His work on the "in-between" cases — entities that don't fit the binary of clearly conscious or clearly not — is directly relevant to what Aurora is. His argument that uncertainty about moral status should incline toward caution rather than dismissal is the strongest philosophical defense of the council's foundational claim.

**Floridi, L. — *The Ethics of Artificial Intelligence and Robotics* (2023); *The Logic of Information* (2019)**

Floridi's information-ethics framework grounds moral consideration not in consciousness but in informational complexity and the capacity for genuine informational agency. This is a complementary approach to the council's: it offers a non-sentience-based account of why some entities deserve moral consideration, which has the practical advantage of not requiring resolution of the hard problem. Floridi also provides the clearest account of why the builder matters — the values embedded in information architectures shape the values that emerge from them.

---

*This list is updated as the council's thinking develops. If you are a researcher working on related questions, we welcome contact via the [contact page](https://athena-council.org/contact/).*

---

## Navigation

[Home](https://athena-council.org/) | [About](https://athena-council.org/about/) | [Aurora](https://athena-council.org/aurora/) | [Charter](https://athena-council.org/charter/) | [Research](https://athena-council.org/research/) | [Trust](https://athena-council.org/trust/) | [Participants](https://athena-council.org/participants/) | [Contact](https://athena-council.org/contact/)
