Eel Pond¶
This is a lightweight protocol for assembling up to a few hundred million mRNAseq reads, annotating the resulting assembly, and doing differential expression analysis. The input is short-insert paired-end Illumina reads. This protocol can be run in a single command because it uses the snakemake automated workflow management system.
Previous versions of this protocol included line-by-line commands that the user could follow along with using a test dataset provided in the instructions. Since the recent development of snakemake workflow management tool and snakemake-wrappers to manage sofware installation of commonly-used bioinformatics tools, we have re-implemented the Eel Pond Protocol to make it easier for users to install software and run a de novo transcriptome assembly, annotation, and quick differential expression analysis on a set of short-read Illumina data using a single command.
The software for this protocol can be found here.
To run the protocol on your own computer system (requires Ubuntu 16.04):
Install miniconda (for Ubuntu 16.04 Jetstream image).
wget https://repo.continuum.io/miniconda/Miniconda3-latest-Linux-x86_64.sh bash Miniconda3-latest-Linux-x86_64.sh -b echo export PATH="$HOME/miniconda3/bin:$PATH" >> ~/.bash_profile source ~/.bash_profile
Then install snakemake:
conda install -c bioconda -c conda-forge -y snakemake
And finally, grab the eelpond code and test data
git clone https://github.com/dib-lab/eelpond.git cd eelpond git submodule update --init --recursive #download test data submodule
Now, run the tiny test data, to make sure things are working:
#run eelpond snakemake --use-conda --configfile rna_testdata/nema_config.yaml
Each step of the pipeline is highly customizable via the configuration file: see Configuration.
References: original eel-pond protocol docs, last updated 2015 eel-pond protocol docs, last updated 2016 DIBSI, nonmodel RNAseq workshop, July 2017 SIO-BUG, nonmodel RNAseq workshop, October 2017
intended workflows: - Read Quality Trimming and Filtering - Digital Normalization - Assembly - Quality Assessment - Annotation - Transcript Quantification - Differential Expression
snakemake style follows rna-seq-star example workflow