Skip to content

How to make reproducible runs of the pipeline? #170

@IanCodes

Description

@IanCodes

I noticed that after running the same workflow (v1.7.1) on the same data that output files differ. For example 'wc -l' results:
111935 de_analysis/filtered_transcript_counts_with_genes.tsv #Run 1
111922 de_analysis/filtered_transcript_counts_with_genes.tsv #Run 2

I am not sure when the analyses start diverging (probably during Stringtie). However, output BAM files are the same.

Is there a way of being able to rerun the workflow and get exactly the same output? The intention is to be able to delete large fastq and fasta files, and recreate in the future if needed.

Metadata

Metadata

Assignees

No one assigned

    Labels

    questionFurther information is requested

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions