Securely Wrangle Dataset According to Data Usage Agreement

Create shareable data sets from raw data files that contain protected elements. Relying on master crosswalk files that list restricted variables, package functions warn users about possible violations of data usage agreement and prevent writing protected elements.


duawranglr 0.6.2

Bug fix

  • Appending crosswalk, deid_dua() no longer rewrites column names

duawranglr 0.6.0


  • When reading in existing crosswalk, only append crosswalk file if new IDs rather than rewriting the entire file

duawranglr 0.5.1

Bug fix

  • fixed bug that dropped existing IDs from crosswalk that weren't also in current data frame being deidentified

duawranglr 0.5.0


  • turn off readr::read_delim() progress bar from when reading files
  • converted default path in functions to tempdir() rather than '.'
  • update dua_env when reading in new file so that it has to be rechecked with every new file

Bug fixes

  • deid_dua() only makes IDs for unique old IDs, which was import to set since salt is added to the old ID before making new hashed IDs

duawranglr 0.3.0


  • added read_dua_file() that reads in admin files and sets check_pass to FALSE whenever a new file is read in as added level of security

duawranglr 0.2.0


  • deid_dua() now can use existing crosswalk file, which is good for building panel data sets. (fixes #6)
  • update to page link in gh-pages information

Bug fixes

  • fixed error where write_dua_df() would still write the file even if not passing data set check

duawranglr 0.1.0

  • initial release

Reference manual

It appears you don't have a PDF plugin for this browser. You can click here to download the reference manual.


0.6.7 by Benjamin Skinner, 7 months ago

Report a bug at

Browse source code at

Authors: Benjamin Skinner [aut, cre]

Documentation:   PDF Manual  

MIT + file LICENSE license

Imports haven, readxl, readr, digest, dplyr

Suggests testthat, knitr, rmarkdown

See at CRAN