❗ We are sunsetting Orion. You can watch a video of the platform on YouTube.
Orion is a research measurement and knowledge discovery tool that enables you to monitor progress in science, visually explore the scientific landscape and search for relevant publications.
This repo contains Orion's data collection, enrichment and analysis pipeline for scientific documents from Microsoft Academic Graph. You can find the rest of our work in the following repositories:
- Search engine
- Web-interface
- Talks, demos, papers and tutorials on Orion. Most of the content was made for presentations at venues such as the SciNLP, NetSci, IC2S2 and WOOC.
- Micro-service deployment [WIP]
To learn more about Orion, check out the documentation website.
Orion is open-source. If you want to use our work or parts of it, be a good citizen of the Internet and drop us an acknowledgement. We would also love to know what you are developing so get in touch!
- Clone Orion's ETL
git clone https://github.com/orion-search/orion
- Modify Orion using the
model_config.yaml
and the.env
files as shown in this tutorial. - Run Orion's ETL in docker
docker-compose up
- Access and run Orion's DAG at
http://localhost:8080/admin/
- Update data schema.
- Change Airflow operators to kubernetes.