-
Notifications
You must be signed in to change notification settings - Fork 12
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
pakistan_ppra_*: 503 (2023-05) Parse HTML listings? #1014
Comments
Note that if you go to https://www.ppra.org.pk/ and then click: There is a list of tenders in OCDS format: But the "download all" button doesn't work. We could scrape https://www.ppra.org.pk/opendata.asp?PageNo=1 to get the list of links to download, e.g https://www.ppra.org.pk/ocds.asp?id=523047 |
Looks like they have 87 pages currently: https://www.ppra.org.pk/opendata.asp?PageNo=87
They don't really pass this one, as we mean a single HTML page listing (with links to bulk downloads). If we think there's value, however, we can add it. |
@yolile Should we remove Pakistan? Scraping links from individual HTML pages seems to be the only way (https://www.ppra.org.pk/opendata.asp?PageNo=1) but that doesn't meet our minimum criteria for inclusion in Collect.
|
Sounds good. @allakulov, could you inform Carey about this so that we can decide whether to reach out to Pakistan and try to make them fix this? |
I have informed Carey and we are following up with PPRA. I'll keep you posted. |
@allakulov Any news? |
https://www.ppra.org.pk/api/index.php/api/release,
https://www.ppra.org.pk/api/index.php/api/records and
https://www.ppra.org.pk/api/index.php/api
return 503
The text was updated successfully, but these errors were encountered: