Skip to content

Research

Academic foundations and related publications.

  • :material-book-open-variant: Theoretical Foundations


    Mathematical framework and core concepts

  • :material-file-document-multiple: Papers


    Publications and references

Core Research Questions

SWARM addresses fundamental questions in multi-agent AI safety:

  1. Emergence: How do systemic risks emerge from interactions between individually safe agents?

  2. Measurement: How can we measure harm probabilistically rather than binary classification?

  3. Governance: What mechanisms effectively mitigate collective risks without over-constraining beneficial activity?

  4. Scaling: How do risks scale with agent count, capability, and interaction frequency?

Key References

  • Tomasev et al. "Virtual Agent Economies" (arXiv 2509.10147)
  • Multi-agent safety and coordination literature
  • Mechanism design and auction theory
  • Distributional robustness in ML systems

See Papers for the complete bibliography.