Access USPTO Bulk Data in Tidy Rectangular Format

Converts TXT and XML data curated by the United States Patent and Trademark Office (USPTO). Allows conversion of bulk data after downloading directly from the USPTO bulk data website, eliminating need for users to wrangle multiple data formats to get large patent databases in tidy, rectangular format. Data details can be found on the USPTO website <>. Currently, only TXT data (1976-2001) conversion is implemented; XML formats are in the process of being added. Relevant literature that uses data from USPTO includes Wada (2020) and Plaza & Albert (2008) .


Reference manual

It appears you don't have a PDF plugin for this browser. You can click here to download the reference manual.


0.1.0 by Raoul Wadhwa, 4 months ago

Report a bug at

Browse source code at

Authors: Raoul Wadhwa [aut, cre] , James Yu [aut] , Peter Erdi [aut]

Documentation:   PDF Manual  

MIT + file LICENSE license

Imports Rcpp, data.table, utils, lubridate, magrittr, dplyr, rlang

Suggests testthat

Linking to Rcpp

See at CRAN