To extract the gene name using API in PubTator:
- PubTator_read.py is reading gene name from "Export Annotations".
- PubTator_submit.py is transforme pdf into json,then submit json to API.
- PubTator_retrieve.py is retrieve what has handled json by a session number.
- pdf_file was used storage your upload pdf.
- run it in command line
python PubTator_retrieve.py
- Input your need format, type, identifiers in order
- then gene name output in the gene_name.csv
- Taking your all pdf into pdf_file" (this step is important)
- run it in command line
python PubTator_submit.py
- All successful submit was storage into SessionNumber.txt
- run it in command line
python PubTator_retrieve.py
- All gene name was taked into the gene_name.csv