Skip to content

Empty eggnogg.annotations.txt - EggNog Parse ERROR #892

@CesarBertinetti

Description

@CesarBertinetti

Are you using the latest release? Yes

Describe the bug
I am missing the annotation of certain genes which seem to be not properly parsed by EggNog. Please any advice would be helpful I'm almost there just missing some important genes

What command did you issue?
funannotate annotate -i ${tmp_dir} --cpus $NSLOTS --busco_db actinopterygii --iprscan bluegill1.proteins.fa.xml
-s "bluegill"

Interproscan is run externally using:

./interproscan.sh -i /scratch365/cbertine/funanotate/BLG_EYE/fun/predict_results/bluegill1.proteins.fa
-f XML -goterms -pa --cpu $NSLOTS

Logfiles
- Annotate command seems fine:

[Mar 13 12:15 PM]: OS: Red Hat Enterprise Linux 8.7, 24 cores, ~ 263 GB RAM. Python: 3.8.15
[Mar 13 12:15 PM]: Running 1.8.14
[Mar 13 12:15 PM]: No NCBI SBT file given, will use default, however if you plan to submit to NCBI, create one and pass it here '--sbt'
[Mar 13 12:15 PM]: Parsing input files
[Mar 13 12:15 PM]: Existing tbl found: /tmp/185292.1.long/BLG_EYETRANS_ANNOT_V1/update_results/Lepomis_machrochirus.tbl
[Mar 13 12:19 PM]: Adding Functional Annotation to Lepomis machrochirus, NCBI accession: None
[Mar 13 12:19 PM]: Annotation consists of: 44,391 gene models
[Mar 13 12:19 PM]: 44,064 protein records loaded
[Mar 13 12:19 PM]: Running HMMer search of PFAM version 35.0
[Mar 13 12:32 PM]: 44,578 annotations added
[Mar 13 12:32 PM]: Running Diamond blastp search of UniProt DB version 2022_05
[Mar 13 12:33 PM]: 9,394 valid gene/product annotations from 12,206 total
[Mar 13 12:33 PM]: Running Eggnog-mapper
[Mar 13 01:17 PM]: Parsing EggNog Annotations
[Mar 13 01:17 PM]: EggNog version parsed as 2.1.10
[Mar 13 01:17 PM]: Combining UniProt/EggNog gene and product names using Gene2Product version 1.88
[Mar 13 01:17 PM]: 9,394 gene name and product description annotations added
[Mar 13 01:17 PM]: Running Diamond blastp search of MEROPS version 12.0
[Mar 13 01:18 PM]: 1,413 annotations added
[Mar 13 01:18 PM]: Annotating CAZYmes using HMMer search of dbCAN version 11.0
[Mar 13 01:26 PM]: 364 annotations added
[Mar 13 01:26 PM]: Annotating proteins with BUSCO actinopterygii models
[Mar 13 01:36 PM]: 4,482 annotations added
[Mar 13 01:36 PM]: Skipping phobius predictions, try funannotate remote -m phobius
[Mar 13 01:36 PM]: Skipping secretome: neither SignalP nor Phobius searches were run
[Mar 13 01:36 PM]: 0 secretome and 0 transmembane annotations added
[Mar 13 01:36 PM]: Parsing InterProScan5 XML file
[Mar 13 01:57 PM]: Found 0 duplicated annotations, adding 999,840 valid annotations
[Mar 13 01:58 PM]: Converting to final Genbank format, good luck!
[Mar 13 02:03 PM]: Creating AGP file and corresponding contigs file
[Mar 13 02:04 PM]: Writing genome annotation table.
[Mar 13 02:22 PM]: Funannotate annotate has completed successfully!

    We need YOUR help to improve gene names/product descriptions:
       0 gene/products names MUST be fixed, see /tmp/185292.1.long/BLG_EYETRANS_ANNOT_V1/annotate_results/Gene2Products.must-fix.txt
       0 gene/product names need to be curated, see /tmp/185292.1.long/BLG_EYETRANS_ANNOT_V1/annotate_results/Gene2Products.need-curating.txt
       749 gene/product names passed but are not in Database, see /tmp/185292.1.long/BLG_EYETRANS_ANNOT_V1/annotate_results/Gene2Products.new-names-passed.txt

    Please consider contributing a PR at https://github.com/nextgenusfs/gene2product

- But many genes show the EggNog Parse ERROR (just one example):

[03/28/23 11:40:11]: EggNog Parse ERROR: FUN_057458-T1 215358.XP_010734348.1 2.11e-289 792.0 COG2126@1|root,KOG3713@2759|Eukaryota,38HR0@33154|Opisthokonta,3BC0S@33208|Metazoa,3CX9N@33213|Bilateria,48437@7711|Chordata,497IX@7742|Vertebrata,4A0YR@7898|Actinopterygii 33208|Metazoa P Potassium voltage-gated channel, subfamily G, member KCNG3 GO:0003674,GO:0005215,GO:0005216,GO:0005244,GO:0005249,GO:0005251,GO:0005261,GO:0005267,GO:0005575,GO:0005622,GO:0005623,GO:0005737,GO:0005783,GO:0005886,GO:0006810,GO:0006811,GO:0006812,GO:0006813,GO:0008150,GO:0008324,GO:0009987,GO:0012505,GO:0015075,GO:0015077,GO:0015079,GO:0015267,GO:0015318,GO:0015672,GO:0016020,GO:0022803,GO:0022832,GO:0022836,GO:0022838,GO:0022839,GO:0022843,GO:0022857,GO:0022890,GO:0030001,GO:0034220,GO:0043226,GO:0043227,GO:0043229,GO:0043231,GO:0044424,GO:0044444,GO:0044464,GO:0046873,GO:0051179,GO:0051234,GO:0055085,GO:0071804,GO:0071805,GO:0071944,GO:0098655,GO:0098660,GO:0098662 - ko:K04902 - - - - ko00000,ko04040 1.A.1.2.17 - - BTB_2,Ion_trans

- The eggnog.annotations.txt in predict_misc is empty

OS/Install Information

Checking dependencies for 1.8.14

You are running Python v 3.8.15. Now checking python packages...
biopython: 1.76
goatools: 1.2.3
matplotlib: 3.4.3
natsort: 8.2.0
numpy: 1.24.2
pandas: 1.5.3
psutil: 5.7.0
requests: 2.28.2
scikit-learn: 1.2.1
scipy: 1.10.0
seaborn: 0.12.2
All 11 python packages installed

You are running Perl v b'5.032001'. Now checking perl modules...
Carp: 1.50
Clone: 0.46
DBD::SQLite: 1.72
DBD::mysql: 4.046
DBI: 1.643
DB_File: 1.855
Data::Dumper: 2.183
File::Basename: 2.85
File::Which: 1.24
Getopt::Long: 2.54
Hash::Merge: 0.302
JSON: 4.10
LWP::UserAgent: 6.67
Logger::Simple: 2.0
POSIX: 1.94
Parallel::ForkManager: 2.02
Pod::Usage: 1.69
Scalar::Util::Numeric: 0.40
Storable: 3.15
Text::Soundex: 3.05
Thread::Queue: 3.14
Tie::File: 1.06
URI::Escape: 5.12
YAML: 1.30
threads: 2.25
threads::shared: 1.61
ERROR: local::lib not installed, install with cpanm local::lib

Checking Environmental Variables...
$FUNANNOTATE_DB=/scratch365/cbertine/funanotate/funannotate_db
$PASAHOME=/afs/crc.nd.edu/user/c/cbertine/.conda/envs/funannotate/opt/pasa-2.5.2
$TRINITY_HOME=/afs/crc.nd.edu/user/c/cbertine/.conda/envs/funannotate/opt/trinity-2.8.5
$EVM_HOME=/afs/crc.nd.edu/user/c/cbertine/.conda/envs/funannotate/opt/evidencemodeler-1.1.1
$AUGUSTUS_CONFIG_PATH=/afs/crc.nd.edu/user/c/cbertine/.conda/envs/funannotate/config/
ERROR: GENEMARK_PATH not set. export GENEMARK_PATH=/path/to/dir

Checking external dependencies...
ERROR: pslDnaFiler found but error running: pslCDnaFilter: error while loading shared libraries: libssl.so.1.0.0: cannot open shared object file: No such file or directory

PASA: 2.5.2
CodingQuarry: 2.0
Trinity: 2.8.5
augustus: 3.5.0
bamtools: bamtools 2.5.1
bedtools: bedtools v2.30.0
blat: BLAT v35
diamond: 2.0.15
emapper.py: 2.1.1
ete3: 3.1.2
exonerate: exonerate 2.4.0
fasta: 36.3.8g
glimmerhmm: 3.0.4
gmap: 2021-08-25
hisat2: 2.2.1
hmmscan: HMMER 3.3.2 (Nov 2020)
hmmsearch: HMMER 3.3.2 (Nov 2020)
java: 17.0.3-internal
kallisto: 0.46.1
mafft: v7.515 (2023/Jan/15)
makeblastdb: makeblastdb 2.2.31+
minimap2: 2.24-r1122
pigz: 2.6
proteinortho: 6.1.7
salmon: salmon 0.14.1
samtools: samtools 1.16.1
snap: 2006-07-28
stringtie: 2.2.1
tRNAscan-SE: 2.0.11 (Oct 2022)
tantan: tantan 40
tbl2asn: 25.8
tblastn: tblastn 2.2.31+
trimal: trimAl v1.4.rev15 build[2013-12-17]
trimmomatic: 0.39
ERROR: gmes_petap.pl not installed
ERROR: pslCDnaFilter not installed
ERROR: signalp not installed

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions