cleanNLP-package | cleanNLP: A Tidy Data Model for Natural Language Processing |
annotate | Run the annotation pipeline on a set of documents |
cleanNLP | cleanNLP: A Tidy Data Model for Natural Language Processing |
combine_annotators | Combine a set of annotations |
dep_frequency | Universal Dependency Frequencies |
doc_id_reset | Reset document ids |
download_clean_nlp | Download java files needed for cleanNLP |
get_coreference | Access coreferences from an annotation object |
get_dependency | Access dependencies from an annotation object |
get_document | Access document meta data from an annotation object |
get_entity | Access named entities from an annotation object |
get_sentiment | Access sentiment scores from an annotation object |
get_token | Access tokens from an annotation object |
get_triple | Access triples from an annotation object |
init_clean_nlp | Initialize the cleanNLP java object |
obama | Annotation of Barack Obama's State of the Union Addresses |
pos_frequency | Universal Part of Speech Code Frequencies |
print.annotation | Print a summary of an annotation object |
read_annotation | Read annotation files from disk |
set_language | Easy interface for setting up the pipeline |
set_properties | Set properties for the coreNLP pipeline |
word_frequency | Most frequent English words |
write_annotation | Write annotation files to disk |