Welcome to the TinyRecursiveModels-AES repository! This index helps you find the right documentation for your needs.
- GETTING_STARTED.md - 5-minute quick start guide
- README_AES.md - Complete AES documentation
- Run
./quickstart.sh- Automated setup and training
- COMPARISON.md - Compare Original TRM vs AES
- CHANGES.md - Technical details of adaptations
- README.md - Original TRM documentation
| File | Purpose | Read Time | Audience |
|---|---|---|---|
| GETTING_STARTED.md | Quick setup and first training run | 5 min | Beginners |
| README_AES.md | Complete guide to AES adaptation | 15 min | All users |
| README.md | Original TRM documentation | 10 min | TRM users |
| File | Purpose | Read Time | Audience |
|---|---|---|---|
| COMPARISON.md | Side-by-side comparison | 10 min | Decision makers |
| CHANGES.md | Technical implementation details | 15 min | Developers |
| instructions.md | Original adaptation requirements | 3 min | Context |
| INDEX.md | This file - navigation hub | 2 min | Everyone |
| File | Purpose | Usage |
|---|---|---|
| quickstart.sh | Automated setup and training | ./quickstart.sh |
| example_usage.py | Usage examples and guides | python example_usage.py |
β Run ./quickstart.sh or read GETTING_STARTED.md
β Read README_AES.md sections 1-3
β GETTING_STARTED.md - Option 2: Manual Setup
β README_AES.md - Dataset Preparation section
β Or see dataset/build_asappp_dataset.py --help
β GETTING_STARTED.md - Step 3 β Or README_AES.md - Training section
β GETTING_STARTED.md - Step 4
β Or run python evaluate_aes.py --help
β README_AES.md - Model Architecture section β GETTING_STARTED.md - Tune Hyperparameters
β README_AES.md - Evaluation Metrics section β GETTING_STARTED.md - Understanding the Metrics
β README_AES.md - Tips for M1 Mac
β GETTING_STARTED.md - Common Issues
β Run python example_usage.py - section 7
β COMPARISON.md - Complete comparison β CHANGES.md - Technical differences
β CHANGES.md - Comprehensive change log β COMPARISON.md - Quick reference
β Run python example_usage.py
β README_AES.md - Training section
β COMPARISON.md - Migration Guide β CHANGES.md - Architecture Adaptations
| File | Purpose |
|---|---|
train_aes_m1.py |
M1-optimized training script for AES |
evaluate_aes.py |
Evaluation script with AES metrics |
pretrain.py |
Original TRM training script |
| File | Purpose |
|---|---|
dataset/build_asappp_dataset.py |
Build ASAPPP datasets from HuggingFace |
dataset/build_arc_dataset.py |
Build ARC-AGI datasets (original) |
dataset/common.py |
Shared dataset utilities |
| File | Purpose |
|---|---|
models/recursive_reasoning/ |
TRM model implementations |
models/ema.py |
Exponential Moving Average |
models/layers.py |
Neural network layers |
evaluators/aes_evaluator.py |
AES-specific metrics (QWK, MSE, etc.) |
| File | Purpose |
|---|---|
config/cfg_aes.yaml |
AES training configuration |
config/cfg_pretrain.yaml |
Original TRM configuration |
| File | Purpose |
|---|---|
puzzle_dataset.py |
Dataset loading and batching |
requirements.txt |
Python dependencies |
- Read GETTING_STARTED.md (5 min)
- Run
./quickstart.sh(5 min setup) - Wait for training (20 min - let it run)
- Check results
- Read GETTING_STARTED.md (5 min)
- Read README_AES.md (15 min)
- Manual setup and training (1.5 hours)
- Experiment with hyperparameters (30 min)
- Read README.md - Original TRM (10 min)
- Read COMPARISON.md (10 min)
- Read CHANGES.md (15 min)
- Read README_AES.md (15 min)
- Run
python example_usage.py(10 min)
- Read all documentation (1 hour)
- Set up environment manually (30 min)
- Build dataset (30 min)
- Run training with different configs (1 hour)
- Evaluate and analyze results (30 min)
| File | Lines | Purpose |
|---|---|---|
| README_AES.md | ~350 | Main documentation |
| CHANGES.md | ~420 | Technical details |
| COMPARISON.md | ~345 | Side-by-side comparison |
| GETTING_STARTED.md | ~290 | Quick start guide |
| train_aes_m1.py | ~575 | Training implementation |
| evaluate_aes.py | ~380 | Evaluation implementation |
| build_asappp_dataset.py | ~375 | Dataset builder |
| aes_evaluator.py | ~250 | Evaluation metrics |
- Start: GETTING_STARTED.md
- Learn: README_AES.md
- Practice:
python example_usage.py
- Context: README.md
- Adaptation: CHANGES.md
- Results: README_AES.md - Expected Results
- Architecture: CHANGES.md - Architecture Adaptations
- Code: Review Python files
- API:
--helpflags on scripts
- Overview: COMPARISON.md - At a Glance
- Cost: COMPARISON.md - Performance & Resources
- ROI: COMPARISON.md - When to Use Each
Installation
- GETTING_STARTED.md - Step 1
- README_AES.md - Installation section
Dataset
- README_AES.md - Dataset Preparation
- CHANGES.md - Dataset Differences
dataset/build_asappp_dataset.py
Training
- GETTING_STARTED.md - Step 3
- README_AES.md - Training section
train_aes_m1.py
Evaluation
- README_AES.md - Evaluation Metrics
- GETTING_STARTED.md - Step 4
evaluate_aes.py
Model Architecture
- README_AES.md - Model Architecture
- CHANGES.md - Architecture Adaptations
- COMPARISON.md - Model Architecture table
Performance
- COMPARISON.md - Performance & Resources
- CHANGES.md - Performance Expectations
- README_AES.md - Expected Results
Troubleshooting
- GETTING_STARTED.md - Common Issues
- README_AES.md - Tips for M1 Mac
python example_usage.py- section 7
Hardware Requirements
- COMPARISON.md - Hardware & Environment
- README_AES.md - Requirements section
- Start with GETTING_STARTED.md if you're new
- Use COMPARISON.md to understand differences
- Reference README_AES.md for detailed information
- Check CHANGES.md for technical implementation details
- Run example_usage.py for interactive help
- Original TRM Paper: https://arxiv.org/abs/2510.04871
- Original Repository: https://github.com/AlexiaJM/TinyRecursiveModels
- ASAPPP Dataset: https://huggingface.co/datasets/llm-aes
- Kaggle Competition: https://www.kaggle.com/c/asap-aes
- Quick questions: Check GETTING_STARTED.md - Common Issues
- Technical issues: See README_AES.md - Troubleshooting
- Understanding code: Run
python example_usage.py - Bug reports: Open a GitHub issue
- Feature requests: Open a GitHub issue
Before training:
- Read GETTING_STARTED.md
- Environment set up
- Dataset prepared
- Understand target metrics
After training:
- Model evaluated (QWK > 0.60)
- Results documented
- Checkpoints saved
If you use this work, cite the original TRM paper:
@misc{jolicoeurmartineau2025morerecursivereasoningtiny,
title={Less is More: Recursive Reasoning with Tiny Networks},
author={Alexia Jolicoeur-Martineau},
year={2025},
eprint={2510.04871},
archivePrefix={arXiv},
primaryClass={cs.LG},
url={https://arxiv.org/abs/2510.04871},
}Last Updated: Compatible with the current repository structure
Maintained By: Repository contributors
License: Same as original TRM repository
Feedback: Open an issue or pull request