TIGRA: Targeted Iterative Graph Routing Assembler for breakpoint[s ]http://GENOME.CSHLP.org/content/24/2/310.long key steps: read extraction & de Bruijn #assembly
This presents a breakpoint assembler used for many projects including 1000 Genomes. It uses a targeted iterative graph routing approach. The program consists of two steps: read extraction and then assembly. The assembly step uses a de Bruin graph-based approach to create contigs from the selected reads. A shortcoming of TIGRA is it depends on the success of the first step of the program, selection of reads that span breakpoints. Thus TIGRA is sensitive to the breakpoint annotation accuracy input. Breakpoints determined from discordant paired-end or split-end alignments and by predictors like breakdancer, delly, genomestrip are excellent for TIGRA, but those determined only by read-depth such as CNVnator and RDX are poor performers.
As input TIGRA requires putative breakpoints annotation/prediction (preferably at nucleotide level or at least within 100bp resolution) and BAM files (sequence reads aligned to reference genome).
In the read extraction TIGRA tries to select all the reads that are likely associated with the breakpoint as long ass they have at least one ned or subsegment that is confidently mapped. For known SV types, TIGRA extract reads selectively to reduce the over representation of the reference allele. The assembly step uses the a de Bruin graph-based approach to create contigs from the selected reads. For this TIGRA first uses an iterative procedure to explore multiple k-mers and thus increases the chance of assembling of low coverage reads. Next it records alternative path in the contain graph