This repository contains the code and data for scraping and analyzing data from SINTA (Science and Technology Index) for the Biology Faculty at UGM (Universitas Gadjah Mada) and also other life science researchers at 4 other Indonesian Universities
These instructions will guide you on how to run the analysis on your local machine.
You need to have Mamba installed on your machine.
- Clone this repository to your local machine:
git clone https://github.com/yourusername/sinta_bio_ugm.git
- Navigate to the cloned repository:
cd sinta_bio_ugm
- Install the required Python packages using the provided env.yaml file:
mamba env create -f env.yaml
- Activate the conda environment:
conda activate your_env_name
- Navigate to the notebooks_UGM directory (or the other notebooks):
cd notebooks_UGM
- Run the Jupyter notebooks in order:
jupyter notebook 00_data_scraping.ipynb
jupyter notebook 01_EDA.ipynb
jupyter notebook 02_co-author-network.ipynb
The 00_data_scraping.ipynb
notebook scrapes the data, 01_EDA.ipynb
performs exploratory data analysis, and 02_co-author-network.ipynb
creates a co-author network.
The results of the analysis are stored in the figures directory. The scraped data is stored in the data directory.