De Novo Genome Assembly
Introduction
De novo assembly is used to piece together shorter sequenced fragments (reads) into more extensive continuous segments (contigs and scaffolds), which are often required for subsequent analyses. This is a key step in many analyses where a reference genome is unavailable.
Workflow
An overview of our de novo assembly pipeline is shown below. Key steps include sequence quality evaluation using FastQC, read trimming and filtering using Trimmomatic, and de novo assembly using SPAdes.
The final results of the pipeline are outlined in blue.
- An HTML formatted sequencing quality report produced by FastQC.
- A FASTA formatted file containing assembled contigs produced by SPAdes.
- A FASTA formatted file containing assembled scaffolds produced by SPAdes.
- A final analysis report in PDF format containing a description of the methods used and some additional QC and summary figures.
Pricing
Our current price for this analysis is $60 per sample.
References
Please refer to these resources and tools that form an integral part of the pipeline process.
- FastQC – A quality control tool for high throughput sequence data. https://www.bioinformatics.babraham.ac.uk/projects/fastqc/
- Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014 Aug 1;30(15):2114-20. doi: 10.1093/bioinformatics/btu170
- Prjibelski A, Antipov D, Meleshko D, Lapidus A, Korobeynikov A. Using SPAdes De Novo Assembler. Curr Protoc Bioinformatics. 2020 Jun;70(1):e102. doi: 10.1002/cpbi.102
Contact
Please contact us for a free consultation. We look forward to working with you on your analysis projects.
Phone: 509-368-6668
Email: Spok.GenomicsCore@wsu.edu