METACRAN search results

arkhe — by Nicolas Frerebeau, 2 months ago

Tools for Cleaning Rectangular Data

A dependency-free collection of simple functions for cleaning rectangular data. This package allows to detect, count and replace values or discard rows/columns using a predicate function. In addition, it provides tools to check conditions and return informative error messages.

https://codeberg.org/tesselle/arkhe, https://packages.tesselle.org/arkhe/

framecleaner — by Harrison Tietze, a year ago

Clean Data Frames

Provides a friendly interface for modifying data frames with a sequence of piped commands built upon the 'tidyverse' Wickham et al., (2019) . The majority of commands wrap 'dplyr' mutate statements in a convenient way to concisely solve common issues that arise when tidying small to medium data sets. Includes smart defaults and allows flexible selection of columns via 'tidyselect'.

https://harrison4192.github.io/framecleaner/, https://github.com/Harrison4192/framecleaner

bdclean — by Thiloshon Nagarajah, 6 years ago

A User-Friendly Biodiversity Data Cleaning App for the Inexperienced R User

Provides features to manage the complete workflow for biodiversity data cleaning. Uploading data, gathering input from users (in order to adjust cleaning procedures), cleaning data and finally, generating various reports and several versions of the data. Facilitates user-level data cleaning, designed for the inexperienced R user. T Gueta et al (2018) . T Gueta et al (2017) .

https://github.com/bd-R/bdclean, https://bd-r.github.io/The-bdverse/index.html

WGCNA — by Peter Langfelder, 10 months ago

Weighted Correlation Network Analysis

Functions necessary to perform Weighted Correlation Network Analysis on high-dimensional data as originally described in Horvath and Zhang (2005) and Langfelder and Horvath (2008) . Includes functions for rudimentary data cleaning, construction of correlation networks, module identification, summarization, and relating of variables and modules to sample traits. Also includes a number of utility functions for data manipulation and visualization.

cleanepi — by Karim Mané, 4 months ago

Clean and Standardize Epidemiological Data

Cleaning and standardizing tabular data package, tailored specifically for curating epidemiological data. It streamlines various data cleaning tasks that are typically expected when working with datasets in epidemiology. It returns the processed data in the same format, and generates a comprehensive report detailing the outcomes of each cleaning task.

https://epiverse-trace.github.io/cleanepi/, https://github.com/epiverse-trace/cleanepi

emend — by Jiajia Li, 3 months ago

Cleaning Text Data with an AI Assistant

Provides functions to clean and standardize messy data, including textual categories and free-text addresses, using Large Language Models. The package corrects typos, expands abbreviations, and maps inconsistent entries to standardized values. Ideal for Bioinformatics, business, and general data cleaning tasks.

https://github.com/anuopensci/emend, https://anuopensci.github.io/emend/

salty — by Matthew Lincoln, 10 months ago

Turn Clean Data into Messy Data

Take real or simulated data and salt it with errors commonly found in the wild, such as pseudo-OCR errors, Unicode problems, numeric fields with nonsensical punctuation, bad dates, etc.

https://github.com/mdlincoln/salty

fabR — by Guillaume Fabre, 19 days ago

Wrapper Functions Collection Used in Data Pipelines

The goal of this package is to provide wrapper functions in the data cleaning and cleansing processes. These function helps in messages and interaction with the user, keep track of information in pipelines, help in the wrangling, munging, assessment and visualization of data frame-like material.

https://github.com/GuiFabre/fabR

cstidy — by Richard Aubrey White, 2 years ago

Helpful Functions for Cleaning Surveillance Data

Helpful functions for the cleaning and manipulation of surveillance data, especially with regards to the creation and validation of panel data from individual level surveillance data.

https://www.csids.no/cstidy/, https://github.com/csids/cstidy

covid19india — by Max Salvatore, 4 years ago

Pulling Clean Data from Covid19india.org

Pull raw and pre-cleaned versions of national and state-level COVID-19 time-series data from covid19india.org < https://www.covid19india.org>. Easily obtain and merge case count data, testing data, and vaccine data. Also assists in calculating the time-varying effective reproduction number with sensible parameters for COVID-19.

https://github.com/maxsal/covid19india

Search results

R links

R homepage

Download R

Mailing lists

R documentation

R manuals

R FAQs

The R Journal

CRAN links

CRAN homepage

CRAN repository policy

Submit a package

METACRAN stuff

About METACRAN

At github

Report a bug