Text Mining for Bahasa Malaysia

It is built to handle Bahasa Malaysia text. We provide functions and data sets that will help handling Bahasa Malaysia to be much easier. For word stemming in particular, we will find the Malay words in a dictionary and then proceed to remove "extra suffix" as explained in Khan, Rehman Ullah, Fitri Suraya Mohamad, Muh Inam UlHaq, Shahren Ahmad Zadi Adruce, Philip Nuli Anding, Sajjad Nawaz Khan, and Abdulrazak Yahya Saleh Al-Hababi (2017) < https://ijrest.net/vol-4-issue-12.html> . A dictionary of Malay words provided in this package can be used as a dictionary to perform word stemming.


Reference manual

It appears you don't have a PDF plugin for this browser. You can click here to download the reference manual.


0.1.1 by Zahier Nasrudin, a month ago


Report a bug at https://github.com/zahiernasrudin/malaytextr/issues

Browse source code at https://github.com/cran/malaytextr

Authors: Zahier Nasrudin [aut, cre]

Documentation:   PDF Manual  

MIT + file LICENSE license

Imports dplyr, magrittr, rlang, stringr

Suggests rmarkdown, knitr, testthat

See at CRAN