ARAVALI makes predictions better and alerts more relevant. An Open Source System Prototype to predict, prevent, or manage the impact of natural disasters. This project is developed for hackathon codefundo++ 2018.
ARAVALI is based on (A)gg(R)egated (A)rri(V)al of (L)atent (I)nformation to:
- Improve predictions (in time and accuracy) by combining predictions from curated data (offline) and aggregated real-time public social media and sensor systems from IMD, CWC etc.
- Monitor keywords from social media posts – update database (online)
- If emergency, disseminate curated alerts via SMS in regional languages to concerned demographics
ARAVALI now has a webapp front-end. It allows updating database and retraining models at regular intervals. It is written In Python and Flask and deployed using Docker Image or simply on a VM Server. To login to the WebApp in Azure VM browse to http://104.211.227.124:5000 Once you sign up and login, you can upload datasets in desired format and retrain the AI model. The backend uses a Random Forest classifier.
Synopsis :
CWC, a national body, manages a flood forecasting network of 175 stations across India. IMD, another national body, handles cyclone and rainfall warnings, currently involved with “Titli”. As per data from 2009, roughly 5000 forecasts classified into 4 categories are issued every year during flood season. Even before IMD was established, cyclone warning systems are in place since 1865 that has a 4 stage system in place today. The alerts are issued 1-2 days in advance currently. When there are a lot of false warnings, it creates unnecessary panic and widespread chaos for preparedness like moving army, evacuation, arranging shelters. People on the move, in the seas need to take precautions, cancel travel, return to coast, move to high grounds and more. During an emergency, generic alerts in non-regional languages via websites, news and radio is not adequate today. The SMS system for registered mobiles with IMD is not great. We can assist them better.
We have data from IMD and news (e.g. The-Hindu) archives for the past 100 years. We have huge amount of data generated by social media today. If we can combine the available historical records of actual events (cyclones, landslides, loss of life, infrastructure, trajectory etc.) from news APIs with historical sensor data collected by IMD & other depts, and fuse the two together, poor prediction resulting from false positives can be greatly reduced and latent patterns can be learned. If we can aggregate data from Tweets (social media with permissions) real-time updates can make alerts more relevant.
Secondly, despite the penetration of internet, traditional SMS (no internet) alerts with on-demand regular relevant status updates, predictions, and safety precautions in regional languages is necessary. If we can do this, a larger audience can be reached.
The real-time measurements from satellite, sensors and gauges always offer greater weightage. They are complementary.
Finally, using feedback from the community via SMS in real-time can be used as surrogate sensors to gather additional data. For ex: Pings on rising water levels in streets or mud flow
The front-end is a web application, while back-end does computing. The end-user interacts offline.
- Avaiable datasets from IMD and Unisys is stored
- Newsapi & NyTimes API collects data like casualties, inches of rain, days of rain, people evacuated, shelters, expenses etc.
- Database is periodically fused, updated
- Tweets, feedbacks are monitored
- Messages are composed with updates, forecast, damage etc
- If classified emergency, alerts are send via SMS to selected phone numbers
We make use of Azure Functions for serverless architecture, Azure Storage solutions, newsapi's and NYTimes API for datamining, tweepy API for updates, way2sms API for SMS services, translator and transliteration API for alerts, analysis, and DSVM for machine learning. Docker and Azure Container Registry for hosting application. C# and Python.
- IMD (https://data.gov.in/catalogs/ministry_department/ministry-earth-sciences)
- IMD (https://www.kaggle.com/rajanand/rainfall-in-india)
- Unisys http://weather.unisys.com/hurricanes/1972/north-indian
- http://digitalgateway.in/Mobile-Database (must be curated, currently using sample data)
To contribute clone the repo, prepare credentials for APIs, and submit updates.
Note: API policies are subject to change
Future Works:
- Learn trajectory of cyclones and floods from historical data
- No SIM coverage reachability
- NLP for IVR
- On demand Status Update by SMS and IVR