Text Mining for Bahasa Malaysia

It is designed to work with text written in Bahasa Malaysia. We provide functions and data sets that will make working with Bahasa Malaysia text much easier. For word stemming in particular, we will look up the Malay words in a dictionary and then proceed to remove "extra suffix" as explained in Khan, Rehman Ullah, Fitri Suraya Mohamad, Muh Inam UlHaq, Shahren Ahmad Zadi Adruce, Philip Nuli Anding, Sajjad Nawaz Khan, and Abdulrazak Yahya Saleh Al-Hababi (2017) < https://ijrest.net/vol-4-issue-12.html> . This package includes a dictionary of Malay words that may be used to perform word stemming.


Reference manual

It appears you don't have a PDF plugin for this browser. You can click here to download the reference manual.


0.1.2 by Zahier Nasrudin, 3 months ago


Report a bug at https://github.com/zahiernasrudin/malaytextr/issues

Browse source code at https://github.com/cran/malaytextr

Authors: Zahier Nasrudin [aut, cre]

Documentation:   PDF Manual  

MIT + file LICENSE license

Imports dplyr, magrittr, rlang, stringr

Suggests rmarkdown, knitr, testthat

See at CRAN