Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bank accounts data from PMJDY website #10

Open
answerquest opened this issue Feb 19, 2018 · 8 comments
Open

Bank accounts data from PMJDY website #10

answerquest opened this issue Feb 19, 2018 · 8 comments
Assignees

Comments

@answerquest
Copy link
Collaborator

Datameet group thread: https://groups.google.com/forum/#!searchin/datameet/pdfs%7Csort:date/datameet/ErNY82gA7dw/mmBUxH5DAgAJ

Site: https://www.pmjdy.gov.in/archive

@dhaneshsabane
Copy link
Collaborator

The website gives you an option to export the data in PDFs. Do we want to export it in that format or use the data on the webpage to create a CSV or JSON file?

@answerquest
Copy link
Collaborator Author

@DHANESH95 we want to get the numbers out in a way that they can be combined across time. I'd pitch for scraping into JSON as the data will likely get hierarchical when we combine it across different dates.

Note: the data seems to be available for each wednesday only. And possibly some dates data may not be available. So the scraper will need to be able to handle that.

@dhaneshsabane
Copy link
Collaborator

@answerquest I was thinking on the same lines. Generating a JSON file becomes highly convenient as it can be converted into any other data format we want. I also noted that the data is available for each Wednesday and I'm confident I can build a scraper for this use case. Do you mind if I get started on this right away? Maybe we can finish it off at the hackathon.

@answerquest
Copy link
Collaborator Author

@DHANESH95 sorry just seeing this now, on the day of the Hackathon 😆

@dhaneshsabane
Copy link
Collaborator

A basic scraper for the website is ready which can generate a CSV file of all the data from the website. You can find the code on this link - https://git.fosscommunity.in/Dhanesh95/pmjdyScraper

@dhaneshsabane
Copy link
Collaborator

@answerquest The work here is not 100% complete and I have a few ideas in mind that I'd like to implement. Please assign this issue to me.

@answerquest
Copy link
Collaborator Author

@DHANESH95 ok done

@answerquest
Copy link
Collaborator Author

Suggestion: can visualize using Highcharts: https://www.highcharts.com/stock/demo

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants