Short tech challenge for candidates applying to Open Cities Lab
- All code and your response should be on a public GitHub repository.
- Share the GitHub repository link with [email protected].
Write a Python script to:
- Scrape data from the following options on Valuation 2022 Durban:
- Full Title Property
- Sectional Title Property
- Clean the scraped data to ensure consistency and accuracy.
- Store the scraped data in a PostgreSQL database.
Part 2: API Development [https://swagger.io/docs/specification/about/]
Create a GET API to query the data stored in Part 1 using the OpenAPI specification. Ensure the API includes:
- The schema for both the request and response objects.
- The OpenAPI specification attached to the repository.
Write a short solution outlining how you would deploy this application to production. Your solution should address the following:
- How to schedule the scrapers to run daily and save the data to the database.
- Which Python web application framework you would use to deploy the API from Part 2 and what the deployment would look like on your chosen cloud provider. Include details such as the architecture, services used, and any relevant configurations.
- How you would handle errors, downtime & alerts.
Please ensure all parts of the assessment are completed and included in this repository. Once done, share the link to the repository with [email protected].
Should Valuation 2022 Durban not be working use Valuation 2012 Durban or Valuation 2017 Durban