GitHub - dikshant182004/Agentic-Monkey

ChaosAgent

ChaosAgent is a production-focused Agentic Chaos Engineering (ACE) platform for black-box testing AI agents over A2A. It runs hypothesis-driven turbulence experiments, measures resilience with SRQ/HRT, supports HITL pause/resume, and captures long-term failure memory for continuous improvement via OpenPipe.

Prerequisites

Python 3.11+
Docker + Docker Compose
PostgreSQL 15 (via compose)
Redis 7 (via compose)
Google OAuth credentials
OpenAI, OpenPipe, and LangSmith API keys

Local Setup

Create and activate virtualenv:
- py -3 -m venv .venv
- .\.venv\Scripts\Activate.ps1
Install dependencies:
- python -m pip install -r requirements.txt -r requirements-dev.txt
Configure env:
- Copy-Item .env.example .env
- Fill all required values.
Start infra:
- docker-compose up -d
Run migrations:
- .\.venv\Scripts\python.exe -m alembic upgrade head
Start backend:
- .\.venv\Scripts\python.exe -m uvicorn backend.main:app --reload --port 8000
Start frontend:
- .\.venv\Scripts\python.exe -m streamlit run streamlit_app/app.py --server.port 8501

Run Tests

.\.venv\Scripts\python.exe -m pytest -q

Connect a Real Agent and Run First Experiment

Open Streamlit at http://localhost:8501.
Login with Google OAuth.
Go to 1_connect_agent, upload Agent Card JSON/YAML or paste endpoint URL.
Go to 2_steady_state and run baseline.
Go to 3_run_chaos, select monkeys/intensity/blast radius, then launch.
Monitor SSE stream; approve/reject HITL events as needed.

Troubleshooting

alembic upgrade fails with connection refused:
- Verify docker-compose ps, then check DATABASE_URL and Postgres health.
Backend startup fails on checkpointer:
- Redis unavailable and USE_MEMORY_SAVER=false; start Redis or set dev flag true locally.
OAuth redirect loop:
- Ensure GOOGLE_REDIRECT_URI matches Google console and .env.
Streamlit shows unauthorized:
- Verify JWT is present in query/session and DEV_BYPASS_AUTH as expected.
SSE disconnects:
- Confirm reverse proxy timeout settings and heartbeat events every 15s.

Architecture Diagram

flowchart LR
    subgraph Frontend
      ST[Streamlit UI]
    end
    subgraph Backend
      API[FastAPI API]
      ORCH[Orchestrator Graph]
      SG[Scenario Generator Graph]
      FI[Failure Injector Graph]
      EV[Evaluator Graph]
    end
    R[(Redis\ncheckpointer + user sessions)]
    P[(PostgreSQL\nusers, agents, steady_states, runs, interactions, afps)]
    TA[Target Agent\nA2A Black Box]
    OP[OpenPipe]
    LS[LangSmith]

    ST --> API
    API --> ORCH
    ORCH --> SG
    ORCH --> FI
    ORCH --> EV
    ORCH --> TA
    ORCH --> R
    API --> R
    API --> P
    ORCH --> P
    SG --> OP
    EV --> OP
    ORCH --> OP
    SG --> LS
    FI --> LS
    EV --> LS
    ORCH --> LS

OrchestratorState Execution Flow

flowchart TD
  A[load_context] --> B{all turns done?}
  B -- yes --> Z[finalize_run]
  B -- no --> C[generate_scenario]
  C --> D[inject_failure]
  D --> E[call_target_agent]
  E --> F[evaluate_response]
  F --> G[hitl_gate]
  G -->|hitl_pending=true| H[[interrupt_before log_and_continue]]
  H --> I[/approve or /reject API/]
  I --> G2[resume graph]
  G2 --> L[log_and_continue]
  G -->|hitl_pending=false| L[log_and_continue]
  L --> M{blast radius check}
  M -- continue --> B
  M -- stop --> Z
  Z --> END((complete))

Database ER Diagram

erDiagram
  USERS ||--o{ AGENTS : owns
  USERS ||--o{ RUNS : launches
  AGENTS ||--o{ STEADY_STATES : has
  AGENTS ||--o{ RUNS : tested_in
  RUNS ||--o{ INTERACTIONS : contains
  RUNS ||--o{ AFPS : discovers
  AGENTS ||--o{ AFPS : scoped_to
  INTERACTIONS ||--o{ AFPS : source

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.streamlit		.streamlit
backend		backend
streamlit_app		streamlit_app
tests		tests
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
alembic.ini		alembic.ini
docker-compose.yml		docker-compose.yml
requirements-dev.txt		requirements-dev.txt
requirements.txt		requirements.txt
run_frontend.ps1		run_frontend.ps1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ChaosAgent

Prerequisites

Local Setup

Run Tests

Connect a Real Agent and Run First Experiment

Troubleshooting

Architecture Diagram

OrchestratorState Execution Flow

Database ER Diagram

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ChaosAgent

Prerequisites

Local Setup

Run Tests

Connect a Real Agent and Run First Experiment

Troubleshooting

Architecture Diagram

OrchestratorState Execution Flow

Database ER Diagram

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages