- hard-deprecate functions with old naming scheme (those not starting with
sp_
); they will be removed in the next release - add
isektor
andnace
as available codelists - minor fix to messages
sp_get_table()
can now handle CSV files with non-standard header rows in the format '"Výkaz"ZC_VYKAZ:ZC_VYKAZ"', fixes #94- updated documentation to avoid CRAN notes
- switched messaging to {cli} package, dropping the {usethis} dependency
- improve naming of output columns (additional column code translated into readable scheme)
- fixed a bug in
sp_load_table()
which caused thevykaz_date
column in its output to be allNA
- loading data on state funds now works, IDs and details are newly listed in
sp_tables
sp_get_dataset()
no longer unzips the downloaded archive. It therefore returns the path to the downloaded zip file, not to the unzipped files. This is needed to support a more modular workflow (see below andvignette("workflow", package = "statnipokladna")
).- The
year
param insp_get_table()
andsp_get_dataset()
now has no default. This is more sensible when there is no easy way to determine the latest available table/dataset and in any case better for reproducibility. The newsp_get_dataset_url()
also has this updated behaviour. - the columns signifying the time period of the result of
sp_get_table()
have been renamed for more clarity and consistency:period_vykaz
tovykaz_date
per_m
tovykaz_month
per_yr
tovykaz_year
sp_add_codelist()
no longer creates messy column names in the form of[codelist name]_nazev_nazev
- the core functions have been rewritten into a more modular architecture and their constituent modules exported. This allows more fine-grained control over workflows using lower-level functions to accommodate caching and reproducibility e.g. via {targets} or {drake}.
- sp_get_[dataset|table|codelist] are now effectively wrappers around several lower-level functions
- those previously using these core functions should see no change except for one breaking change in
sp_get_dataset()
(see above). - the lower-level functions enable step-by-step workflows with transparency of intermediate steps (URLs, downloaded ZIP archives, pointers to specific CSV files, etc.) See
vignette("workflow", package = "statnipokladna")
- better error messages around mismatches between table ID and file in archive
- more informative errors when online files are not available
- improve date parsing in
sp_get_codelist()
to handle inconsistent date formats in some codelists - faster loading of tables from previously downloaded and unzipped datasets
- empty strings in codelists are now returned as NA
- more columns are now properly named in the output so they can be linked to codelists, incl. columns related to programme spend and rifngfenced ('purpose'-marked) spend
rozprog
(programme spend ID) as well asnastroj
andnastrojanal
codelists are now recognised
- when setting
dest_dir
insp_get_*()
functions, files are now put into the right directory even ifdest_dir
does not contain a trailing slash sp_get_dataset_doc()
now createsdest_dir
if it does not exist, as advertised in the documentation- upgrade dplyr dependency to aviod mysterious "unused argument" error in codelist functions
- balance sheets of city districts can now be loaded for all years where they are available
- examples are now safe to run on CRAN
- directory for downloading raw data and documentation in all
sp_get_*()
functions can now be set in the optionstatnipokladna.dest_dir
. Set this at the beginning of your script or once per session, or in your.Rprofile
(e.g.setOption(statnipokladna.dest_dir = "~/sp_data")
for cross-session storage orsetOption(statnipokladna.dest_dir = "sp_data")
for storing all data downloaded by statnipokladna in ansp_data
subfolder of the current project.) - downloaded data is no longer nested in a
statnipokladna
directory insidedest_dir
. - minor improvements and updates in vignettes.
- properly included the tibble package to ensure correct print methods for output tibbles
sp_get_dataset()
andsp_get_dataset_doc()
now return their outputs invisibly and provide better messages
- adapt
sp_get_table()
to a new structure of some data dumps on the part of the data provider sp_get_codelist()
no longer issues a tibble-related warningsp_get_codelist()
now parses all dates correctly
Functions with names not starting with sp_
are deprecated and emit a warning. Use equivalent functions named sp_*
.
- adapt to new URL scheme for data exports at statnipokladna.cz, fixes #74
- simple codelists with no
*_date
columns now work insp_add_codelist()
, fixes #66
sp_add_codelist()
returns a message if joining by multiple columnssp_add_codelist()
gains aby
parameter
- fixed all outdated links to statnipokladna.cz
- nicer vignette index entries
- vignette and functions now document working with multiple time periods, which differs between datasets/tables
- several clarifications on how to use codelists
- CRAN fix: replace \dontrun{} with \donttest{} and T/F with TRUE/FALSE
- updated examples so they do not use deprecated functions
- fixed bug in sp_get_table() - missing
usethis::
- changed default in
sp_get_dataset()
- update examples so nothing breaks
- fixed bug in sp_add_codelist introduced when fixing CRAN note
- fix last URL in README by making it absolute
- added URL into Description field in DESCRIPTION
- fixed URLs in README
- There is now a basic Get Started vignette, which is complementary to the README and contains basic background to the data
- the original Czech data vignette was renamed to "how-the-data-works-cz.Rmd/html" and the data gotchas part was hived off into "data-issues-cz.Rmd/html"
- deprecation warnings now show correct package name
- added informative error messages when internet is unavailable
sp_get_dataset_doc()
now checks that dataset exists before pointing/downloading.- updated README with a bit more detail
- fixed dplyr-related NOTE on unavailable objects
- added CRAN notes
- updated LICENSE to work for CRAN
- added cran-comments.md
- all exported functions renamed to
sp_*
to avoid conflicts with other packages and for better discoverability via autocomplete; original functions are soft-deprecated.
get_table()
now provides access to changes-in-equity data (přehled o změnách vlastního kapitálu, use"changes-in-equity"
table id) and cash flow statements (přehled o peněžních tocích, use"cash-flow"
id)
- all functions downloading data now have
dest_dir
parameter which allows you to store the downloaded files anywhere, not just in temp dir; this in effect enables cross-session storage and avoids unnecessary redownloads
- adapt month parameter check to upgrade in the published data which now contains monthly releases for some reports
- argument
force_redownload
inget_[table|dataset|codelist]()
renamed toredownload
- improved UI: messaging and guidance
- added examples to documentation
get_table()
now fully documents the columns in its output, seehelp("get_table")
- fixed bug where
get_codelist()
did not create the right temp directory
- value columns (previously starting with ZU_) now renamed to human-readable English names
- all key tables, except state funds (FINSF) now included in sp_tables and thus available in
get_table()
- improved documentation - added details to explain what each function assumes and does
- updated README
- fixed issue so package now also handles pre-2014 data
- minor improvement to messages
- speed improvement to data processing by
get_table()
whenico
is set - codelists downloaded by
*_codelist()
are now stored in tempdir
- the
ico
parameter inget_table()
now directly follows the key parameters; this will break cases where the function was called with unnamed parameters but will make use more straighforward. - the
table_id
parameter inget_table()
is now a human-readable alphanumeric code, not a meaningless number. This will break all uses of get_table but make future use more sensible.
- new
add_codelist()
function for joining codelists to core financial data get_codelist()
gainsn
parameter, allowing only a limited number of rows of a codelist to be retrieved (to save time)get_codelist()
has a new paramater which opens the online codelist viewer- filtering using
ico
parameter inget_table()
now works
- updated table of tables (now covers all key tables, includes notes)
get_codelist()
now returns a codelist df ready to join to core data
- handle transformation of consolidation columns in
get_codelist()
- bug fix in get_table and get_codelist: corrected logic around druhove trideni (polozka codelist)
- clearer README
- correction in vignette regarding consolidation
- improved documentation
- updated README
- new
get_dataset()
function for retrieving files - new
get_table()
function for loading data - only experimental - new
get_dataset_doc()
function for accessing/downloading official documentation for each dataset - exported data frames with descriptions of available tables, datasets and codelists
- minor updates to data vignette
- added feedback info to data vignette
- fixed typos and testing infrastructure
- exported
get_codelist()
- added vignette explaining the underlying open data
- Added a
NEWS.md
file to track changes to the package.