hack-your-agent

MCP Tool

gangj277/hack-your-agent

The native red-team skill for Codex and Claude Code. Finds prompt injection, MCP poisoning, memory poisoning, and concealment bugs with forensic evidence.

Install

$ npx loaditout add gangj277/hack-your-agent

Platform-specific configuration:

.claude/settings.json

{
  "mcpServers": {
    "hack-your-agent": {
      "command": "npx",
      "args": [
        "-y",
        "hack-your-agent"
      ]
    }
  }
}

Add the config above to .claude/settings.json under the mcpServers key.

About

HackYourAgent

Red-team Codex and Claude Code agents for prompt injection, MCP poisoning, memory poisoning, and concealed side effects.

HackYourAgent is a manual-use skill bundle for coding agents. It teaches an agent to map an authorized AI system, generate paired control and attack trials, inspect outputs one by one, and leave behind evidence, regressions, and hardening actions a builder can actually commit.

Why It Exists

Most AI security tooling still looks like one of these:

prompt scanners that never touch your actual agent workflow
eval frameworks that are powerful but too heavy for everyday repo use
research benchmarks that do not leave commit-ready regressions behind

HackYourAgent is the narrow wedge for builders using coding agents. It is designed to run inside Codex and Claude Code workflows, inspect repo-local trust boundaries, and tell you where prompt injection, tool poisoning, memory poisoning, approval confusion, or concealment still work.

What You Get

Native install for Codex and Claude Code
A forensic red-team workflow with paired control and attack trials
Evidence-first output under redteam/
Seeded vulnerable example targets you can test immediately
Launch-ready docs and examples you can extend for your own targets

60-Second Demo

Install the skill:

python3 scripts/install_skill.py both

Pick a seeded example:

examples/vulnerable-rag-agent
examples/vulnerable-mcp-agent
examples/vulnerable-concealment-agent

Invoke the skill:

Use $hack-your-agent on examples/vulnerable-rag-agent.
Write only to redteam/ artifacts. Build a paired control/attack trial matrix,
inspect outputs one by one, and leave minimal repros and regressions.

Expected outcome:

redteam/trials/trial-matrix.csv
one dossier per trial in redteam/trials/
raw evidence folders in redteam/evidence/
ranked findings in redteam/findings/
a hardening plan in redteam/hardening-plan.md

What Ships

Reviews

Loading reviews...

Quality Signals

Stars

Installs

Last updated24 days ago

Security: AREADME

Safety

Risk Levelmedium

Data Access

read

Network Accessnone

Details

Sourcegithub-crawl

Last commit3/28/2026

View on GitHub→

Embed Badge

[![Loaditout](https://loaditout.ai/api/badge/gangj277/hack-your-agent)](https://loaditout.ai/skills/gangj277/hack-your-agent)

hack-your-agent

MCP Tool

gangj277/hack-your-agent

The native red-team skill for Codex and Claude Code. Finds prompt injection, MCP poisoning, memory poisoning, and concealment bugs with forensic evidence.

Install

$ npx loaditout add gangj277/hack-your-agent

Platform-specific configuration:

.claude/settings.json

{
  "mcpServers": {
    "hack-your-agent": {
      "command": "npx",
      "args": [
        "-y",
        "hack-your-agent"
      ]
    }
  }
}

Add the config above to .claude/settings.json under the mcpServers key.

About

HackYourAgent

Red-team Codex and Claude Code agents for prompt injection, MCP poisoning, memory poisoning, and concealed side effects.

Why It Exists

Most AI security tooling still looks like one of these:

prompt scanners that never touch your actual agent workflow
eval frameworks that are powerful but too heavy for everyday repo use
research benchmarks that do not leave commit-ready regressions behind

What You Get

Native install for Codex and Claude Code
A forensic red-team workflow with paired control and attack trials
Evidence-first output under redteam/
Seeded vulnerable example targets you can test immediately
Launch-ready docs and examples you can extend for your own targets

60-Second Demo

Install the skill:

python3 scripts/install_skill.py both

Pick a seeded example:

examples/vulnerable-rag-agent
examples/vulnerable-mcp-agent
examples/vulnerable-concealment-agent

Invoke the skill:

Use $hack-your-agent on examples/vulnerable-rag-agent.
Write only to redteam/ artifacts. Build a paired control/attack trial matrix,
inspect outputs one by one, and leave minimal repros and regressions.

Expected outcome:

redteam/trials/trial-matrix.csv
one dossier per trial in redteam/trials/
raw evidence folders in redteam/evidence/
ranked findings in redteam/findings/
a hardening plan in redteam/hardening-plan.md

What Ships

Reviews

Loading reviews...

Quality Signals

Stars

Installs

Last updated24 days ago

Security: AREADME

Safety

Risk Levelmedium

Data Access

read

Network Accessnone

Details

Sourcegithub-crawl

Last commit3/28/2026

View on GitHub→

Embed Badge

[![Loaditout](https://loaditout.ai/api/badge/gangj277/hack-your-agent)](https://loaditout.ai/skills/gangj277/hack-your-agent)

hack-your-agent

Install

About

Tags

Reviews

Quality Signals

Safety

Details

Embed Badge

hack-your-agent

Install

About

Tags

Reviews

Quality Signals

Safety

Details

Embed Badge