quanteda - Quantitative Analysis of Textual Data
A fast, flexible, and comprehensive framework for quantitative text analysis in R. Provides functionality for corpus management, creating and manipulating tokens and n-grams, exploring keywords in context, forming and manipulating sparse matrices of documents by features and feature co-occurrences, analyzing keywords, computing feature similarities and distances, applying content dictionaries, applying supervised and unsupervised machine learning, visually representing text and text analyses, and more.
Last updated 10 days ago
corpusnatural-language-processingquantedatext-analytics
16.74 score 843 stars 48 packages 4.9k scripts 23k downloadsreadtext - Import and Handling for Plain and Formatted Text Files
Functions for importing and handling text files and formatted text files with additional meta-data, such including '.csv', '.tab', '.json', '.xml', '.html', '.pdf', '.doc', '.docx', '.rtf', '.xls', '.xlsx', and others.
Last updated 9 months ago
encodingquantedatext
10.71 score 120 stars 3 packages 1.2k scripts 4.9k downloadsspacyr - Wrapper to the 'spaCy' 'NLP' Library
An R wrapper to the 'Python' 'spaCy' 'NLP' library, from <https://spacy.io>.
Last updated 6 months ago
extract-entitiesnlpspacyspeech-tagging
10.63 score 251 stars 6 packages 390 scripts 2.3k downloadsstopwords - Multilingual Stopword Lists
Provides multiple sources of stopwords, for use in text analysis and natural language processing.
Last updated 3 years ago
text-analysis
10.50 score 113 stars 60 packages 1.1k scripts 14k downloadsquanteda.textstats - Textual Statistics for the Quantitative Analysis of Textual Data
Textual statistics functions formerly in the 'quanteda' package. Textual statistics for characterizing and comparing textual data. Includes functions for measuring term and document frequency, the co-occurrence of words, similarity and distance between features and documents, feature entropy, keyword occurrence, readability, and lexical diversity. These functions extend the 'quanteda' package and are specially designed for sparse textual data.
Last updated 3 months ago
9.10 score 14 stars 8 packages 904 scripts 4.1k downloadsquanteda.textplots - Plots for the Quantitative Analysis of Textual Data
Plotting functions for visualising textual data. Extends 'quanteda' and related packages with plot methods designed specifically for text data, textual statistics, and models fit to textual data. Plot types include word clouds, lexical dispersion plots, scaling plots, network visualisations, and word 'keyness' plots.
Last updated 3 months ago
7.10 score 6 stars 668 scripts 3.1k downloadsnsyllable - Count Syllables in Character Vectors
Counts syllables in character vectors for English words. Imputes syllables as the number of vowel sequences for words not found.
Last updated 3 years ago
5.50 score 9 stars 9 packages 9 scripts 2.6k downloads