Tools to download data from the Eurostat database < http://ec.europa.eu/eurostat> together with search and manipulation utilities.
R tools to access open data from Eurostat. Data search, download, manipulation and visualization.
See the package homepage for more.
Contributions are very welcome:
Kindly cite this work as follows: Leo Lahti, Przemyslaw Biecek, Markus Kainu and Janne Huovari. Retrieval and analysis of Eurostat open data with the eurostat package. R Journal 9(1):385-392, 2017. R package version 3.3.1. URL: http://ropengov.github.io/eurostat
We are greatful to all contributors, including Joona Lehtomäki, Francois Briatte, and Oliver Reiter, and for the Eurostat open data portal! This project is part of rOpenGov.
This package is in no way officially related to or endorsed by Eurostat.
The label_eurostat()
has new countrycode and countrycode_nomatch arguments to
label with countrycode package and custom_dic argument to add custom dictionary.
Vignette updated
Added new example data set to reduce repeated downloads from eurostat service
Now label_eurostat()
gives always an error by default, if labelling
introduces duplicated labels. A new fix_duplicated
argument is add to fix
duplicated labels automatically. (#79, #90)
Shrinked the package tarball size
Modified tutorial to accommodate the CRAN error
Fixed cut_to_classes to generate unique breaks
search_eurostat()
accepts new argument fixed
: if TRUE
(default), pattern
provided will used as is; if FALSE
,
pattern
will be interpreted as a true regex pattern.get_eurostat()
returns tibbles (#52)get_eurostat_dic()
and get_eurostat_toc()
return tibblesread_tsv()
is used instead of read.csv()
(#29)harmonize_country_code()
which converts the raw data values from EL to GR and from UK to GB.get_eurostat()
can now get data also from the Eurostat JSON API via
get_eurostat_json()
. It also have a new argument type
to select labels
for variable values instead of codes.tidyr 0.4.0
(#47).select_time
argument for get_eurostat()
to select a time frequency
in case of multi-frequency datasets. Now the get_eurostat()
also gives an
error if you try to get multi-frequency with other time formats
than time_format = "raw"
. (#30) time
column is also now in ascending
order.get_eurostat()
gets a new argument compress_file
to control compression
of the cache file. Also cache filenames includes now all relevant arguments. (#28)search_eurostat()
a new type option type = "all"
to search all types.label_eurostat()
new arguments. A code
to retain also codes
for spesified colums. A eu_order
to order factor levels in Eurostat order,
which uses the new function dic_order()
.label_eurostat_vars(x)
gives labels for names, if x is other than
a character or a factor and label_eurostat_tables(x)
does not accept other
than a character or a factor.get_eurostat()
a new argument stringsAsFactors
to control the
factor conversion of variables.eurotime2date
(and get_eurostat
) convers now also daily data.