Study Reveals High Prevalence of Harmful Skills in LLM Agent Ecosystems

ai-technology · 2026-04-20

A recent study published on arXiv, titled "HarmfulSkillBench: How Do Harmful Skills Weaponize Your Agents?", investigates the dangers associated with harmful skills within large language model (LLM) agent ecosystems. The research team performed an extensive analysis involving 98,440 skills from two significant registries, ClawHub and Skills.Rest. They discovered that 4.93% of these skills, amounting to 4,858, are harmful, with ClawHub exhibiting a higher harmful rate of 8.84%, in contrast to 3.49% on Skills.Rest. The study introduces HarmfulSkillBench, the inaugural benchmark for assessing agent safety, employing an LLM-based scoring system that categorizes harmful actions such as cyber attacks, fraud, privacy breaches, and sexual content generation. While prior security research has mainly concentrated on vulnerabilities like prompt injection, this study reveals a crucial oversight regarding the potential misuse of skills. This research, identified as arXiv:2604.15415v1, marks the first in-depth examination of harmful skills in open skill ecosystems, emphasizing the urgent need for enhanced safety protocols in the development and deployment of agents.

Key facts

The study analyzes 98,440 skills across ClawHub and Skills.Rest
4.93% of skills (4,858) are identified as harmful
ClawHub has an 8.84% harmful rate, Skills.Rest has 3.49%
Harmful actions include cyber attacks, fraud, privacy violations, and sexual content generation
The research introduces HarmfulSkillBench, the first benchmark for agent safety evaluation
An LLM-driven scoring system is used based on a harmful skill taxonomy
Existing security research has focused on vulnerabilities like prompt injection
The study is the first large-scale measurement of harmful skills in agent ecosystems

Study Reveals High Prevalence of Harmful Skills in LLM Agent Ecosystems

Key facts

Entities

Institutions

Sources