Skip to content

andreagrioni/NanoFG

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

285 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Forked from NanoFG

NanoFG is a complete pipeline that process Nanopore/Pac-Bio Long-Reads to detect fusion genes.

To date, minor changes to the original NanoFG pipelines:

  • maf-convert.mod.py: utilities from LAST tool, update for python3 compatibility.
  • RegionSelection.py: regions provided as BED file format; reads isolated by pybedtools.
  • mv paths.ini > paths.config.
  • add BED file with breakpoint regions for demo.

INSTALL

clone repository from GitHub

git clone https://github.com/andreagrioni/NanoFG.git
cd NanoFG

create anaconda environment

anaconda environment can be created from the environment.yml file. Note that The first line of the yml file sets the new environment's name.

conda env create -f environment.yml

config file

Update the path.config file with absolute paths for tools and reference genome.

How to run

bash NanoFG.sh -f </path/to/fastqdir>  [-n SAMPLE_NAME ] [-r REGIONS] [-cf] [-cc] [-df] [-dc]

OR

bash NanoFG.sh -b </path/to/bam> [-v </path/to/vcf>] [-n SAMPLE_NAME ] [-r REGIONS] [-cf] [-cc] [-df] [-dc]

Citation

Original NanoFG project at https://github.com/SdeBlank/NanoFG/wiki

Original citation: https://www.biorxiv.org/content/10.1101/807545v2

Required Tools:

Li, H. et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).

Li, H. (2018). Minimap2: pairwise alignment for nucleotide sequences. Bioinformatics, 34:3094-3100.

Kielbasa, S. M., Wan, R., Sato, K., Horton, P. & Frith, M. C. Adaptive seeds tame genomic sequence comparison. Genome Research 21, 487–493 (2011).

Cretu Stancu, M. et al. Mapping and phasing of structural variation in patient genomes using nanopore sequencing. Nat. Commun. 8, 1326 (2017).

Ruan, J. and Li, H. (2019) Fast and accurate long-read assembly with wtdbg2. Nat Methods

About

Third-generation fusion gene detection

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages

  • Python 84.6%
  • Shell 15.4%