Comprehensive taxonomy of AI risks published on arXiv
A new paper on arXiv (2408.12622) presents a meta-review, database, and taxonomy of risks from artificial intelligence. The authors note that researchers, policymakers, and companies lack shared terminology for discussing AI risks. For example, "privacy" can refer to model data leakage or freedom from surveillance, while concepts like "Goodhart's law," "specification gaming," "reward hacking," and "mesa-optimization" all describe the same phenomenon of AI optimizing for proxies. This terminological diversity hinders cross-study comparisons and comprehensive risk coverage. The paper addresses this by creating a unified taxonomy.
Key facts
- Paper ID: arXiv:2408.12622
- Type: replace
- Title: The AI risk repository: A meta-review, database, and taxonomy of risks from artificial intelligence
- Addresses lack of shared terminology for AI risks
- Examples of terminological confusion: 'privacy' has multiple meanings
- Multiple terms describe same phenomenon: Goodhart's law, specification gaming, reward hacking, mesa-optimization
- Goal: enable cross-study comparisons and comprehensive risk coverage
- Method: meta-review and database creation
Entities
Institutions
- arXiv