Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ausmacrodata R package for Australian Macroeconomic Data #1

Open
robjhyndman opened this issue Mar 2, 2017 · 3 comments
Open

ausmacrodata R package for Australian Macroeconomic Data #1

robjhyndman opened this issue Mar 2, 2017 · 3 comments

Comments

@robjhyndman
Copy link

Recently, I helped set up a new website for scraping Australian macroeconomic data from the ABS and RBA: ausmacrodata.org.

It would be great to have an R package to pull in data from it. Something like the dataseries package which pulls in data from dataseries.org (an analogous site for Swiss data). Another similar package is BETS for Brazilian economic time series.

So I propose we build the ausmacrodata R package. That should be easily achievable in a couple of days, including a vignette and a CRAN submission.

@jonocarroll
Copy link

+1

Cool. Does that site have an API, or can the one it uses be leveraged?

I proposed something slightly more broad last year (ropensci/auunconf#16) and would be keen to work on something like this. data.gov.au has a working API and presumably some of these datasets can be extracted that way. A vetted list of datasets (as per ausmacrodata.org) could be a tractable goal.

By the looks of it, the ABS links go to their own ABS.Stat pages which isn't the most helpful. They seem to have a framework set up though, so a project could wrap that up. There's an update schedule for data http://www.abs.gov.au/websitedbs/D3310114.nsf/home/absstat+Release+calendar so we could try to keep things up to date.

@robjhyndman
Copy link
Author

The ausmacrodata series all have unique ID, and they all have a csv file in a consistent format (first column date, second column value). So it is easy to write a small function to return a ts object given an ID. It only returns time series, and only quarterly, monthly or annual, so it is a fairly trivial exercise.

The main work is done in the back end of ausmacrodata, which converts all the ABS/RBF data into a consistent format. There is a daily cron job which scans for updates, so ausmacrodata is always within a day of being up-to-date. As you probably know, the ABS and RBF use xls files (ugh) which are hopelessly inconsistent, sometimes changing between releases for the same series. So working with their data directly is pretty difficult -- hence ausmacrodata.

We could expand the scope somewhat by including (some of?) data.gov.au as well.

@jonocarroll
Copy link

Emphasis on the "some of" -- I've seen some trainwreck data sets on there. Hence the notion of selected, curated sets is great.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants