Skip to content

add end-to-end benchmark suite#677

Open
KimBioInfoStudio wants to merge 1 commit intoOpenGene:masterfrom
KimBioInfoStudio:feat/benchmark-suite
Open

add end-to-end benchmark suite#677
KimBioInfoStudio wants to merge 1 commit intoOpenGene:masterfrom
KimBioInfoStudio:feat/benchmark-suite

Conversation

@KimBioInfoStudio
Copy link
Copy Markdown
Member

Summary

  • Modular benchmark package (benchmark/) with 8 focused modules
  • 9 I/O format modes (PE/SE x fq/gz combinations + stdin-stdout)
  • 7 feature test modes: merge, correction, dedup-pe/se, qualcut-pe/se, high-thread (32 workers)
  • Overlapping PE data generator for merge/correction testing (insert ~220bp, injected mismatches with skewed quality)
  • --r1/--r2 flags for custom FASTQ input with automatic format conversion (gz<->fq)
  • Mode groups: all (9 I/O), all-feat (7 feature), everything (16 total)

Test plan

  • python3 -m benchmark --help shows all modes and options
  • python3 -m benchmark --mode all-feat --opt ./fastp runs all feature modes
  • python3 -m benchmark --r1 real_R1.fq.gz --r2 real_R2.fq.gz --mode merge --opt ./fastp works with custom data
  • python3 -m benchmark --mode everything --opt ./fastp --orig ./fastp_baseline full regression test with verification

🤖 Generated with Claude Code

Split the monolithic bench_e2e.py into a modular benchmark/ package with
focused modules (modes, datagen, runner, verify, report, sysinfo). Added
7 feature test modes (merge, correction, dedup, qualcut, high-thread)
that exercise fastp code paths beyond default filtering, including
overlapping PE data generation for merge/correction testing. Supports
custom FASTQ input via --r1/--r2 with automatic format conversion.

Co-Authored-By: Claude Opus 4.6 (1M context) <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant