YaraGen-AI

Intelligent YARA Rule Generator with AI

Generateur intelligent de regles YARA avec Intelligence Artificielle

Architecture

    +---------------------+     +-------------------+
    | Natural Language     |     | File Sample       |
    | Description          |     | (PE/ELF/Mach-O)   |
    +----------+----------+     +--------+----------+
               |                         |
               v                         v
    +----------+----------+     +--------+----------+
    | LLM Generator       |     | File Analyzer      |
    | (OpenAI / Template)  |     | - Strings          |
    |                      |     | - Imports          |
    +----------+----------+     | - Entropy          |
               |                | - Sections         |
               |                | - Packer detection |
               |                +--------+----------+
               |                         |
               +------------+------------+
                            |
                            v
               +------------+------------+
               |    YARA Rule Builder     |
               | - Meta section           |
               | - String patterns        |
               | - Hex patterns           |
               | - Conditions             |
               +------------+------------+
                            |
                            v
               +------------+------------+
               |    YARA Validator        |
               | - Syntax check           |
               | - Performance analysis   |
               | - FP risk estimation     |
               | - Quality scoring        |
               +-------------------------+

English

What is YaraGen-AI?

YaraGen-AI is an intelligent YARA rule generator that creates detection rules from multiple input sources:

Natural language - Describe what you want to detect, AI generates the rule
File samples - Analyze PE/ELF/Mach-O files and auto-extract signatures
IOC indicators - Generate rules from hashes, domains, IPs, URLs
Behavioral patterns - Specify behaviors (injection, persistence, C2, etc.)
Templates - Pre-built templates for 8 malware categories

Installation

git clone https://github.com/ayinedjimi/YaraGen-AI.git
cd YaraGen-AI
pip install -e ".[dev,pe,yara]"

# Optional: Set OpenAI API key for AI features
export OPENAI_API_KEY="sk-..."

Usage

# Generate from description
yaragen-ai generate -d "Detect Cobalt Strike beacon" -n cobalt_beacon

# Generate from file sample
yaragen-ai generate -f malware_sample.exe -n detected_malware

# Generate from IOCs
yaragen-ai generate --ioc-domains evil.com c2.bad.org --ioc-ips 1.2.3.4

# Generate from behavior
yaragen-ai generate -b process_injection persistence_registry -n apt_loader

# Use a template
yaragen-ai generate --template ransomware -n wannacry_variant

# Analyze a file
yaragen-ai analyze suspicious.exe --generate

# Validate a rule
yaragen-ai validate rule.yar --verbose

# List templates
yaragen-ai templates

Generated Rule Example

rule detect_cobalt_beacon
{
    meta:
        author = "Ayi NEDJIMI"
        date = "2026-02-18"
        description = "Detect Cobalt Strike beacon"
        reference = "https://ayinedjimi-consultants.fr"
        category = "c2_beacon"

    strings:
        $http_1 = "User-Agent:" ascii wide
        $c2_1 = "beacon" ascii wide nocase
        $c2_2 = "sleep" ascii wide
        $c2_3 = "jitter" ascii wide
        $enc_1 = "base64" ascii wide nocase

    condition:
        filesize < 5MB and
        (2 of ($http_*) and 2 of ($c2_*) and 1 of ($enc_*))
}

Supported Malware Templates

Category	MITRE ATT&CK	Description
ransomware	T1486	File encryption + ransom notes
trojan	T1059	RAT with keylogging/screen capture
backdoor	T1059.001	Reverse shell / bind shell
webshell	T1505.003	PHP/ASP/JSP web shells
cryptominer	T1496	Mining pool connections
rootkit	T1014	Kernel-level manipulation
dropper	T1105	Download and execute payloads
c2_beacon	T1071	C2 communication patterns

Francais

Qu'est-ce que YaraGen-AI ?

YaraGen-AI est un generateur intelligent de regles YARA qui cree des regles de detection a partir de multiples sources :

Langage naturel - Decrivez ce que vous voulez detecter, l'IA genere la regle
Echantillons de fichiers - Analyse PE/ELF/Mach-O avec extraction automatique
Indicateurs IOC - Generation a partir de hashes, domaines, IPs, URLs
Patterns comportementaux - Injection, persistance, C2, etc.
Templates - 8 categories de malware pre-configurees

Tests

pytest tests/ -v --cov=yaragen_ai

Related Projects

ADBloodHound-AI - AD attack path analysis
KQLHunter - KQL query generator for Azure Sentinel
KVortex - VRAM to RAM Offloader

Author

Ayi NEDJIMI - Cybersecurity & AI Consultant

Website: ayinedjimi-consultants.fr
HuggingFace: AYI-NEDJIMI
GitHub: ayinedjimi

License

MIT License - see LICENSE

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
src/yaragen_ai		src/yaragen_ai
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

YaraGen-AI

Architecture

English

What is YaraGen-AI?

Installation

Usage

Generated Rule Example

Supported Malware Templates

Francais

Qu'est-ce que YaraGen-AI ?

Tests

Related Projects

Author

License

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

YaraGen-AI

Architecture

English

What is YaraGen-AI?

Installation

Usage

Generated Rule Example

Supported Malware Templates

Francais

Qu'est-ce que YaraGen-AI ?

Tests

Related Projects

Author

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages