Skip to content

Latest commit

 

History

History
23 lines (13 loc) · 902 Bytes

File metadata and controls

23 lines (13 loc) · 902 Bytes

Automatically annotate articles with custom dictionary

This is a R package for automatical data annotation with terms in a dictionary. The main function is 'CountTermsInStudies'. It takes 'searchingData' to search from as either data frame/matrix or a link to the data in 'tsv','csv' or 'txt' formate. It also takes another parameter 'dictionary', which could be any dictionary with one name column and one search term (regular expression accepted) column. Please refer to manual for specific header names.

Installation:

install.packages("devtools")
devtools::install_github("camaradesuk/AutoAnnotation")

Pdf to text software

Code calls 'pdftotext' to convert pdf to text.

One may download the Xpdf command line tools at http://www.xpdfreader.com/download.html And add the software path to parameter 'conversionSoftware'.

Stage

The package is at alpha testing stage.