-
Notifications
You must be signed in to change notification settings - Fork 91
Description
Are you using the latest release?
v1.8.1 in conda env, one downgraded library (perl DB fix) "libdb=6.1.26"
Describe the bug
at the predict step funannotate fails, just after aligning (or during) proteins with diamnd/exonerate. It complains that the predict_misc/protein_alignments.gff3 is not found/doesnt exist. Its strange because the last couple of days (before downgrading libdb=6.1.26), I used the same set of proteins and the same conda env to annotate successully some other genomes. These genomes are much more contiguous with few Chromosome-sized sequences (up to 16 Mb). It runs fro quite a while so it is probably busy aligning. The current DIR has the tmp folder "p2g_9595" containing only diamond.dmnd, an empty "diamond.matches.tab" and empty folder "failed" and "scaffolds".
What command did you issue?
export OPENBLAS_NUM_THREADS=1
(
funannotate predict
-i $ASM
-o $OUT
-s "$SPECIES"
--name $NAME"_"
--busco_seed_species bombus_impatiens1
--busco_db hymenoptera
--cpus $CPUs
--keep_no_stops
--keep_evm
--SeqCenter XXXX
--organism other
--max_intronlen 10000
--optimize_augustus
--ploidy 1
--repeat_filter overlap blast
--soft_mask 5000
--protein_evidence
$INPUT2
$INPUT4
$INPUT5
$INPUT6
$INPUT7
$INPUT8
$INPUT9
$INPUT10
$INPUT11
$INPUT12
$INPUT13
$INPUT14
--transcript_evidence
$INPUT16
$INPUT17
)
Logfiles
cat funannotate-p2g.log
[10/16/20 22:00:50]: /home/ek/.conda/envs/funnotate181/lib/python3.7/site-packages/funannotate/aux_scripts/funannotate-p2g.py -p 2020_10_16_TetragonulaCarbonaria/predict_misc/proteins.combined.fa -g /scratch/ek/stingless.bee.genomics/annotation/funannotate/TetragonulaCarbonaria/2020_10_16_TetragonulaCarbonaria/predict_misc/genome.softmasked.fa -o 2020_10_16_TetragonulaCarbonaria/predict_misc/protein_alignments.gff3 --maxintron 10000 --cpus 20 --exonerate_pident 80 --ploidy 1 -f diamond --tblastn_out 2020_10_16_TetragonulaCarbonaria/predict_misc/p2g.diamond.out --logfile 2020_10_16_TetragonulaCarbonaria/logfiles/funannotate-p2g.log
[10/16/20 22:00:51]: Mapping 754,312 proteins to genome using diamond and exonerate
[10/16/20 22:00:51]: Diamond v2.0.4; Exonerate v2.4.0
[10/16/20 22:00:51]: diamond makedb --threads 8 --in /scratch/ek/stingless.bee.genomics/annotation/funannotate/TetragonulaCarbonaria/2020_10_16_TetragonulaCarbonaria/predict_misc/proteins.combined.fa --db diamond
[10/16/20 22:00:57]: diamond blastx --threads 8 -q /scratch/ek/stingless.bee.genomics/annotation/funannotate/TetragonulaCarbonaria/2020_10_16_TetragonulaCarbonaria/predict_misc/genome.softmasked.fa --db diamond -o diamond.matches.tab -e 1e-10 -k 0 --more-sensitive -f 6 sseqid slen sstart send qseqid qlen qstart qend pident length evalue score qcovhsp qframe
[10/17/20 06:43:09]: CMD ERROR: diamond blastx --threads 8 -q /scratch/ek/stingless.bee.genomics/annotation/funannotate/TetragonulaCarbonaria/2020_10_16_TetragonulaCarbonaria/predict_misc/genome.softmasked.fa --db diamond -o diamond.matches.tab -e 1e-10 -k 0 --more-sensitive -f 6 sseqid slen sstart send qseqid qlen qstart qend pident length evalue score qcovhsp qframe
OS/Install Information
Ubuntu 16.04
** error message**
[10:00 PM]: Mapping 754,312 proteins to genome using diamond and exonerate
[06:43 AM]: CMD ERROR: diamond blastx --threads 8 -q /scratch/ek/stingless.bee.genomics/annotation/funannotate/TetragonulaCarbonaria/2020_10_16_TetragonulaCarbonaria/predict_misc/genome.softmasked.fa --db diamond -o diamond.matches.tab -e 1e-10 -k 0 --more-sensitive -f 6 sseqid slen sstart send qseqid qlen qstart qend pident length evalue score qcovhsp qframe
b'diamond v2.0.4.142 (C) Max Planck Society for the Advancement of Science\nDocumentation, support and updates available at http://www.diamondsearch.org\n\n#CPU threads: 8\nScoring parameters: (Matrix=BLOSUM62 Lambda=0.267 K=0.041 Penalties=11/1)\nTemporary directory: \nOpening the database... [0.159s]\n#Target sequences to report alignments for: unlimited\nReference = diamond.dmnd\nSequences = 754312\nLetters = 341564446\nBlock size = 2000000000\nOpening the input file... [
....
]\nSearching alignments... [7.341s]\nProcessing query block 1, reference block 1/1, shape 16/16, index chunk 4/4.\nBuilding reference seed array... [1.046s]\nBuilding query seed array... [1.506s]\nComputing hash join... [0.695s]\nBuilding seed filter... [0.085s]\nSearching alignments... [7.513s]\nDeallocating buffers... [0.078s]\nClearing query masking... [0.234s]\nComputing alignments... '
Traceback (most recent call last):
File "/home/ek/.conda/envs/funnotate181/bin/funannotate", line 688, in
main()
File "/home/ek/.conda/envs/funnotate181/bin/funannotate", line 678, in main
mod.main(arguments)
File "/home/ek/.conda/envs/funnotate181/lib/python3.7/site-packages/funannotate/predict.py", line 983, in main
lib.exonerate2hints(Exonerate, hintsP)
File "/home/ek/.conda/envs/funnotate181/lib/python3.7/site-packages/funannotate/library.py", line 3877, in exonerate2hints
with open(file, 'r') as input:
FileNotFoundError: [Errno 2] No such file or directory: '/scratch/ek/stingless.bee.genomics/annotation/funannotate/TetragonulaCarbonaria/2020_10_16_TetragonulaCarbonaria/predict_misc/protein_alignments.gff3'