Packages by Jan Wijffels

BTM — 0.3.7

Biterm Topic Models for Short Text

ETLUtils — 1.5

Utility Functions to Execute Standard Extract/Transform/Load Operations (using Package 'ff') on Large Data

RMOA — 1.1.0

Connect R with MOA for Massive Online Analysis

RMOAjars — 1.2.0

External jars Required for Package RMOA

crfsuite — 0.4.2

Conditional Random Fields for Labelling Sequential Data in Natural Language Processing

cronR — 0.6.5

Schedule R Scripts and Processes with the 'cron' Job Scheduler

dlib — 1.0.3.1

Allow Access to the 'Dlib' C++ Library

doc2vec — 0.2.0

Distributed Representations of Sentences, Documents and Topics

image.CannyEdges — 0.1.1

Implementation of the Canny Edge Detector for Images

image.ContourDetector — 0.1.1

Implementation of the Unsupervised Smooth Contour Line Detection for Images

image.CornerDetectionF9 — 0.1.0

Find Corners in Digital Images with FAST-9

image.CornerDetectionHarris — 0.1.2

Implementation of the Harris Corner Detection for Images

image.LineSegmentDetector — 0.1.0

Detect Line Segments in Images

image.Otsu — 0.1

Otsu's Image Segmentation Method

image.binarization — 0.1.3

Binarize Images for Enhancing Optical Character Recognition

image.dlib — 0.1.1

Image Processing Functionality using the 'dlib' Package

image.libfacedetection — 0.1

Convolutional Neural Network for Face Detection

image.textlinedetector — 0.2.3

Segment Images in Text Lines and Words

nametagger — 0.1.3

Named Entity Recognition in Texts using 'NameTag'

recogito — 0.2.1

Interactive Annotation of Text and Images

ruimtehol — 0.3.2

Learn Text 'Embeddings' with 'Starspace'

sentencepiece — 0.2.3

Text Tokenization using Byte Pair Encoding and Unigram Modelling

spark.sas7bdat — 1.4

Read in 'SAS' Data ('.sas7bdat' Files) into 'Apache Spark'

taskscheduleR — 1.8

Schedule R Scripts and Processes with the Windows Task Scheduler

text.alignment — 0.1.4

Text Alignment with Smith-Waterman

textplot — 0.2.2

Text Plots

textrank — 0.3.1

Summarize Text by Ranking Sentences and Finding Keywords

tokenizers.bpe — 0.1.3

Byte Pair Encoding Text Tokenization

topicmodels.etm — 0.1.0

Topic Modelling in Embedding Spaces

udpipe — 0.8.11

Tokenization, Parts of Speech Tagging, Lemmatization and Dependency Parsing with the 'UDPipe' 'NLP' Toolkit

word2vec — 0.4.0

Distributed Representations of Words