Packages by Jan Wijffels

BTM — 0.3.1

Biterm Topic Models for Short Text

ETLUtils — 1.4.1

Utility Functions to Execute Standard Extract/Transform/Load Operations (using Package 'ff') on Large Data

Myrrix — 1.2

Interface to Myrrix. Myrrix is a Complete, Real-Time, Scalable Clustering and Recommender System, Evolved from Apache Mahout

Myrrixjars — 1.0-2

R/Myrrix Interface Jars

RMOA — 1.0.1

Connect R with MOA for Massive Online Analysis

RMOAjars — 1.0.1

External jars Required for Package RMOA

crfsuite — 0.3.2

Conditional Random Fields for Labelling Sequential Data in Natural Language Processing

cronR — 0.4.0

Schedule R Scripts and Processes with the 'cron' Job Scheduler

dlib — 1.0.3

Allow Access to the 'Dlib' C++ Library

ruimtehol — 0.2.3

Learn Text 'Embeddings' with 'Starspace'

spark.sas7bdat — 1.2

Read in 'SAS' Data ('.sas7bdat' Files) into 'Apache Spark'

taskscheduleR — 1.4

Schedule R Scripts and Processes with the Windows Task Scheduler

textplot — 0.1.2

Text Plots

textrank — 0.3.0

Summarize Text by Ranking Sentences and Finding Keywords

tokenizers.bpe — 0.1.0

Byte Pair Encoding Text Tokenization

udpipe — 0.8.3

Tokenization, Parts of Speech Tagging, Lemmatization and Dependency Parsing with the 'UDPipe' 'NLP' Toolkit