-
Notifications
You must be signed in to change notification settings - Fork 91
Description
Are you using the latest release? Yes
Describe the bug
I am missing the annotation of certain genes which seem to be not properly parsed by EggNog. Please any advice would be helpful I'm almost there just missing some important genes
What command did you issue?
funannotate annotate -i ${tmp_dir} --cpus $NSLOTS --busco_db actinopterygii --iprscan bluegill1.proteins.fa.xml
-s "bluegill"
Interproscan is run externally using:
./interproscan.sh -i /scratch365/cbertine/funanotate/BLG_EYE/fun/predict_results/bluegill1.proteins.fa
-f XML -goterms -pa --cpu $NSLOTS
Logfiles
- Annotate command seems fine:
[Mar 13 12:15 PM]: OS: Red Hat Enterprise Linux 8.7, 24 cores, ~ 263 GB RAM. Python: 3.8.15
[Mar 13 12:15 PM]: Running 1.8.14
[Mar 13 12:15 PM]: No NCBI SBT file given, will use default, however if you plan to submit to NCBI, create one and pass it here '--sbt'
[Mar 13 12:15 PM]: Parsing input files
[Mar 13 12:15 PM]: Existing tbl found: /tmp/185292.1.long/BLG_EYETRANS_ANNOT_V1/update_results/Lepomis_machrochirus.tbl
[Mar 13 12:19 PM]: Adding Functional Annotation to Lepomis machrochirus, NCBI accession: None
[Mar 13 12:19 PM]: Annotation consists of: 44,391 gene models
[Mar 13 12:19 PM]: 44,064 protein records loaded
[Mar 13 12:19 PM]: Running HMMer search of PFAM version 35.0
[Mar 13 12:32 PM]: 44,578 annotations added
[Mar 13 12:32 PM]: Running Diamond blastp search of UniProt DB version 2022_05
[Mar 13 12:33 PM]: 9,394 valid gene/product annotations from 12,206 total
[Mar 13 12:33 PM]: Running Eggnog-mapper
[Mar 13 01:17 PM]: Parsing EggNog Annotations
[Mar 13 01:17 PM]: EggNog version parsed as 2.1.10
[Mar 13 01:17 PM]: Combining UniProt/EggNog gene and product names using Gene2Product version 1.88
[Mar 13 01:17 PM]: 9,394 gene name and product description annotations added
[Mar 13 01:17 PM]: Running Diamond blastp search of MEROPS version 12.0
[Mar 13 01:18 PM]: 1,413 annotations added
[Mar 13 01:18 PM]: Annotating CAZYmes using HMMer search of dbCAN version 11.0
[Mar 13 01:26 PM]: 364 annotations added
[Mar 13 01:26 PM]: Annotating proteins with BUSCO actinopterygii models
[Mar 13 01:36 PM]: 4,482 annotations added
[Mar 13 01:36 PM]: Skipping phobius predictions, try funannotate remote -m phobius
[Mar 13 01:36 PM]: Skipping secretome: neither SignalP nor Phobius searches were run
[Mar 13 01:36 PM]: 0 secretome and 0 transmembane annotations added
[Mar 13 01:36 PM]: Parsing InterProScan5 XML file
[Mar 13 01:57 PM]: Found 0 duplicated annotations, adding 999,840 valid annotations
[Mar 13 01:58 PM]: Converting to final Genbank format, good luck!
[Mar 13 02:03 PM]: Creating AGP file and corresponding contigs file
[Mar 13 02:04 PM]: Writing genome annotation table.
[Mar 13 02:22 PM]: Funannotate annotate has completed successfully!
We need YOUR help to improve gene names/product descriptions:
0 gene/products names MUST be fixed, see /tmp/185292.1.long/BLG_EYETRANS_ANNOT_V1/annotate_results/Gene2Products.must-fix.txt
0 gene/product names need to be curated, see /tmp/185292.1.long/BLG_EYETRANS_ANNOT_V1/annotate_results/Gene2Products.need-curating.txt
749 gene/product names passed but are not in Database, see /tmp/185292.1.long/BLG_EYETRANS_ANNOT_V1/annotate_results/Gene2Products.new-names-passed.txt
Please consider contributing a PR at https://github.com/nextgenusfs/gene2product
- But many genes show the EggNog Parse ERROR (just one example):
[03/28/23 11:40:11]: EggNog Parse ERROR: FUN_057458-T1 215358.XP_010734348.1 2.11e-289 792.0 COG2126@1|root,KOG3713@2759|Eukaryota,38HR0@33154|Opisthokonta,3BC0S@33208|Metazoa,3CX9N@33213|Bilateria,48437@7711|Chordata,497IX@7742|Vertebrata,4A0YR@7898|Actinopterygii 33208|Metazoa P Potassium voltage-gated channel, subfamily G, member KCNG3 GO:0003674,GO:0005215,GO:0005216,GO:0005244,GO:0005249,GO:0005251,GO:0005261,GO:0005267,GO:0005575,GO:0005622,GO:0005623,GO:0005737,GO:0005783,GO:0005886,GO:0006810,GO:0006811,GO:0006812,GO:0006813,GO:0008150,GO:0008324,GO:0009987,GO:0012505,GO:0015075,GO:0015077,GO:0015079,GO:0015267,GO:0015318,GO:0015672,GO:0016020,GO:0022803,GO:0022832,GO:0022836,GO:0022838,GO:0022839,GO:0022843,GO:0022857,GO:0022890,GO:0030001,GO:0034220,GO:0043226,GO:0043227,GO:0043229,GO:0043231,GO:0044424,GO:0044444,GO:0044464,GO:0046873,GO:0051179,GO:0051234,GO:0055085,GO:0071804,GO:0071805,GO:0071944,GO:0098655,GO:0098660,GO:0098662 - ko:K04902 - - - - ko00000,ko04040 1.A.1.2.17 - - BTB_2,Ion_trans
- The eggnog.annotations.txt in predict_misc is empty
OS/Install Information
Checking dependencies for 1.8.14
You are running Python v 3.8.15. Now checking python packages...
biopython: 1.76
goatools: 1.2.3
matplotlib: 3.4.3
natsort: 8.2.0
numpy: 1.24.2
pandas: 1.5.3
psutil: 5.7.0
requests: 2.28.2
scikit-learn: 1.2.1
scipy: 1.10.0
seaborn: 0.12.2
All 11 python packages installed
You are running Perl v b'5.032001'. Now checking perl modules...
Carp: 1.50
Clone: 0.46
DBD::SQLite: 1.72
DBD::mysql: 4.046
DBI: 1.643
DB_File: 1.855
Data::Dumper: 2.183
File::Basename: 2.85
File::Which: 1.24
Getopt::Long: 2.54
Hash::Merge: 0.302
JSON: 4.10
LWP::UserAgent: 6.67
Logger::Simple: 2.0
POSIX: 1.94
Parallel::ForkManager: 2.02
Pod::Usage: 1.69
Scalar::Util::Numeric: 0.40
Storable: 3.15
Text::Soundex: 3.05
Thread::Queue: 3.14
Tie::File: 1.06
URI::Escape: 5.12
YAML: 1.30
threads: 2.25
threads::shared: 1.61
ERROR: local::lib not installed, install with cpanm local::lib
Checking Environmental Variables...
$FUNANNOTATE_DB=/scratch365/cbertine/funanotate/funannotate_db
$PASAHOME=/afs/crc.nd.edu/user/c/cbertine/.conda/envs/funannotate/opt/pasa-2.5.2
$TRINITY_HOME=/afs/crc.nd.edu/user/c/cbertine/.conda/envs/funannotate/opt/trinity-2.8.5
$EVM_HOME=/afs/crc.nd.edu/user/c/cbertine/.conda/envs/funannotate/opt/evidencemodeler-1.1.1
$AUGUSTUS_CONFIG_PATH=/afs/crc.nd.edu/user/c/cbertine/.conda/envs/funannotate/config/
ERROR: GENEMARK_PATH not set. export GENEMARK_PATH=/path/to/dir
Checking external dependencies...
ERROR: pslDnaFiler found but error running: pslCDnaFilter: error while loading shared libraries: libssl.so.1.0.0: cannot open shared object file: No such file or directory
PASA: 2.5.2
CodingQuarry: 2.0
Trinity: 2.8.5
augustus: 3.5.0
bamtools: bamtools 2.5.1
bedtools: bedtools v2.30.0
blat: BLAT v35
diamond: 2.0.15
emapper.py: 2.1.1
ete3: 3.1.2
exonerate: exonerate 2.4.0
fasta: 36.3.8g
glimmerhmm: 3.0.4
gmap: 2021-08-25
hisat2: 2.2.1
hmmscan: HMMER 3.3.2 (Nov 2020)
hmmsearch: HMMER 3.3.2 (Nov 2020)
java: 17.0.3-internal
kallisto: 0.46.1
mafft: v7.515 (2023/Jan/15)
makeblastdb: makeblastdb 2.2.31+
minimap2: 2.24-r1122
pigz: 2.6
proteinortho: 6.1.7
salmon: salmon 0.14.1
samtools: samtools 1.16.1
snap: 2006-07-28
stringtie: 2.2.1
tRNAscan-SE: 2.0.11 (Oct 2022)
tantan: tantan 40
tbl2asn: 25.8
tblastn: tblastn 2.2.31+
trimal: trimAl v1.4.rev15 build[2013-12-17]
trimmomatic: 0.39
ERROR: gmes_petap.pl not installed
ERROR: pslCDnaFilter not installed
ERROR: signalp not installed