Access USPTO Bulk Data in Tidy Rectangular Format

Converts TXT and XML data curated by the United States Patent and Trademark Office (USPTO). Allows conversion of bulk data after downloading directly from the USPTO bulk data website, eliminating need for users to wrangle multiple data formats to get large patent databases in tidy, rectangular format. Data details can be found on the USPTO website <>. Currently, all 3 formats: 1. TXT data (1976-2001); 2. XML format 1 data (2002-2004); and 3. XML format 2 data (2005-current) can be converted to rectangular, CSV format. Relevant literature that uses data from USPTO includes Wada (2020) and Plaza & Albert (2008) .


Reference manual

It appears you don't have a PDF plugin for this browser. You can click here to download the reference manual.


0.1.4 by Raoul Wadhwa, 4 months ago

Report a bug at

Browse source code at

Authors: Raoul Wadhwa [aut, cre] , James Yu [aut] , Hayley Beltz [aut] , Milind Desai [aut] , Jacob Scott [aut] , Peter Erdi [aut]

Documentation:   PDF Manual  

MIT + file LICENSE license

Imports Rcpp, utils, lubridate, magrittr, dplyr, rlang, xml2, progress

Suggests testthat, covr, knitr, readr, rmarkdown, tibble

Linking to Rcpp

See at CRAN