Proteus is a computational research pipeline designed to simulate the self-assembly and nanoprecipitation of polymer chains. By automating the transition from chemical text (SMILES) to physical simulation (Molecular Dynamics), it allows researchers to rapidly screen and analyze polymer folding behaviors.
- Text-to-Structure: Instantly converts SMILES strings into 3D molecular geometries with explicit hydrogens.
- Automated Topology: Generates LAMMPS-compliant data files with generic force field parameters (Lennard-Jones).
- Physics Engine: Runs implicit solvent simulations using Langevin dynamics with optimized viscosity for realistic molecular drifting.
-
Built-in Analytics: Automatically parses simulation logs to calculate the final Radius of Gyration (
$R_g$ ) and generates stability plots. - Automated Visualization: Uses Ovito to render high-quality, color-coded GIF animations of the simulation.
Proteus uses uv for high-performance dependency management and virtual environment creation.
If you don't have uv installed, you can install it using the following command (Linux/macOS):
curl -LsSf https://astral.sh/uv/install.sh | shFor other platforms or methods, refer to the uv documentation.
git clone <repository-url>
cd proteusmake setupsource venv/bin/activateThe easiest way to run the pipeline is using the run.sh helper script.
chmod +x run.sh
./run.sh "<SMILES>" "<NAME>" [STEPS]- Example (3-unit PEO):
./run.sh "CCOCCOCCO" "PEO_3" 10000 - Example (Multi-chain):
./run.sh "CCO.CCO.CCO" "Triple_Chain" 10000
SMILES: The chemical structure. Use a dot.to separate independent molecules.NAME: The folder name for your results inoutput/.STEPS: (Optional) Simulation duration. Default is 10,000.
Results are saved in output/<NAME>/:
animation.gif: High-quality video of the simulation (color-coded by molecule).polymer.data: LAMMPS topology file (3D structure).simulation.in: The generated LAMMPS input script.simulation.log: Raw simulation data (energies, temperatures).stability.png: Equilibrium graph showing Temperature and Potential Energy over time.trajectory.dump: Raw atom positions (viewable in external tools like VMD/Ovito).
-
Simulation Core (
src/):-
Topology Architect (
topology.py): SMILES$\to$ 3D Topology via RDKit & UFF. -
Simulation Engine (
simulation.py): Langevin dynamics (300K) with tuned viscosity. -
Analytics (
analysis.py): Log parsing and$R_g$ calculation. -
Visualization (
visualization.py): Headless rendering of trajectories via Ovito. -
Reporting (
report.py): PDF report generation. -
HTS (
hts.py): High-Throughput Screening automation.
-
Topology Architect (
The simulation uses a Generic Hydrophobic Interaction model:
- Force Field: Lennard-Jones (epsilon=0.105, sigma=2.5).
- Solvent: Implicit solvent via Langevin thermostat.
- Viscosity: Tuned damping for realistic drifting rather than high-frequency vibrations.