METACRAN search results

hacksaw — by David Ranzolin, 4 years ago

Additional Tools for Splitting and Cleaning Data

Move between data frames and lists more efficiently with precision splitting via 'dplyr' verbs. Easily cast variables to different data types. Keep rows with NAs. Shift row values.

fossilbrush — by Joe Flannery-Sutherland, 9 days ago

Automated Cleaning of Fossil Occurrence Data

Functions to automate the detection and resolution of taxonomic and stratigraphic errors in fossil occurrence datasets. Functions were developed using data from the Paleobiology Database.

https://cran.r-project.org/package=fossilbrush

messy — by Nicola Rennie, 6 months ago

Create Messy Data from Clean Data Frames

For the purposes of teaching, it is often desirable to show examples of working with messy data and how to clean it. This R package creates messy data from clean, tidy data frames so that students have a clean example to work towards.

https://nrennie.rbind.io/messy/, https://github.com/nrennie/messy

DataCombine — by Christopher Gandrud, 9 years ago

Tools for Easily Combining and Cleaning Data Sets

Tools for combining and cleaning data sets, particularly with grouped and time series data.

http://CRAN.R-project.org/package=DataCombine

SemNetCleaner — by Alexander P. Christensen, a month ago

An Automated Cleaning Tool for Semantic and Linguistic Data

Implements several functions that automates the cleaning and spell-checking of text data. Also converges, finalizes, removes plurals and continuous strings, and puts text data in binary format for semantic network analysis. Uses the 'SemNetDictionaries' package to make the cleaning process more accurate, efficient, and reproducible.

https://github.com/AlexChristensen/SemNetCleaner

cgmanalysis — by Tim Vigers, 4 days ago

Clean and Analyze Continuous Glucose Monitor Data

This code provides several different functions for cleaning and analyzing continuous glucose monitor data. Currently it works with 'Dexcom', 'iPro 2', 'Diasend', 'Libre', or 'Carelink' data. The cleandata() function takes a directory of CGM data files and prepares them for analysis. cgmvariables() iterates through a directory of cleaned CGM data files and produces a single spreadsheet with data for each file in either rows or columns. The column format of this spreadsheet is compatible with REDCap data upload. cgmreport() also iterates through a directory of cleaned data, and produces PDFs of individual and aggregate AGP plots. Please visit < https://github.com/childhealthbiostatscore/R-Packages/> to download the new-user guide.

lab2clean — by Ahmed Zayed, 9 months ago

Automation and Standardization of Cleaning Clinical Lab Data

Navigating the shift of clinical laboratory data from primary everyday clinical use to secondary research purposes presents a significant challenge. Given the substantial time and expertise required for lab data pre-processing and cleaning and the lack of all-in-one tools tailored for this need, we developed our algorithm 'lab2clean' as an open-source R-package. 'lab2clean' package is set to automate and standardize the intricate process of cleaning clinical laboratory results. With a keen focus on improving the data quality of laboratory result values, our goal is to equip researchers with a straightforward, plug-and-play tool, making it smoother for them to unlock the true potential of clinical laboratory data in clinical research and clinical machine learning (ML) model development. Version 1.0 of the algorithm is described in detail in 'Zayed et al. (2024)' .

worldfootballR — by Jason Zivkovic, 3 years ago

Extract and Clean World Football (Soccer) Data

Allow users to obtain clean and tidy football (soccer) game, team and player data. Data is collected from a number of popular sites, including 'FBref', transfer and valuations data from 'Transfermarkt'< https://www.transfermarkt.com/> and shooting location and other match stats data from 'Understat'< https://understat.com/> and 'fotmob'< https://www.fotmob.com/>. It gives users the ability to access data more efficiently, rather than having to export data tables to files before being able to complete their analysis.

https://github.com/JaseZiv/worldfootballR

SwimmeR — by Greg Pilgrim, 2 years ago

Data Import, Cleaning, and Conversions for Swimming Results

The goal of the 'SwimmeR' package is to provide means of acquiring, and then analyzing, data from swimming (and diving) competitions. To that end 'SwimmeR' allows results to be read in from .html sources, like 'Hy-Tek' real time results pages, '.pdf' files, 'ISL' results, 'Omega' results, and (on a development basis) '.hy3' files. Once read in, 'SwimmeR' can convert swimming times (performances) between the computationally useful format of seconds reported to the '100ths' place (e.g. 95.37), and the conventional reporting format (1:35.37) used in the swimming community. 'SwimmeR' can also score meets in a variety of formats with user defined point values, convert times between courses ('LCM', 'SCM', 'SCY') and draw single elimination brackets, as well as providing a suite of tools for working cleaning swimming data. This is a developmental package, not yet mature.

exceldata — by Lisa Avery, 2 years ago

Streamline Data Import, Cleaning and Recoding from 'Excel'

A small group of functions to read in a data dictionary and the corresponding data table from 'Excel' and to automate the cleaning, re-coding and creation of simple calculated variables. This package was designed to be a companion to the macro-enabled 'Excel' template available on the GitHub site, but works with any similarly-formatted 'Excel' data.

Search results

R links

R homepage

Download R

Mailing lists

R documentation

R manuals

R FAQs

The R Journal

CRAN links

CRAN homepage

CRAN repository policy

Submit a package

METACRAN stuff

About METACRAN

At github

Report a bug