Skip to content

fedepacher/MOOCProject

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Massive Open Online Course Analysis Project

Introduction

Analysis to present data on course revenue, in order to know which Course has the highest revenue, Also to understand and track where opportunities may lie to increase revenue generation, and track the performance of courses.

About this project

This is a project for the SoyHenry academy.

Repository content

  • eda.ipynb: Contain the EDA process for the CSV files.
  • web_scrap.py: Contain the web scraping to the udemy web page.
  • query.sql: Contain all the query to generate the dimension and fact tables based on the CSV files created on te EDA process.
  • Datasets: Folder that contains original datasets.
  • Dashboard: Folder that contains new generated dataset that were used to create the Power Bi dashboard.
  • udemy_dashboard.pbix: Interactive dashboard in Power Bi.

Workflow

For the project is used Github Action Project, you can access this project in the following link.
This project will follow the conventional commits of the following link.
Each task will be divided into an issue. Here you can find all the issues available at the moment.
The scheduled task timetable can be access in the following link.

Issues

1. Web scraping

It has been done a web scraping precess to the Udemy platform in order to get missing information such as rating and language.
For this task it has been created the following issue where you can find the task description to solve.

2. Exploratory Data Analysis (EDA)

For this task it has been created the following issue where you can find the task description to solve.

3. Dashboard

For this task it has been created the following issue where you can find the task description to solve.
The following image shows an interactive dashboard:

Contains the KPIs values and different graphs that shows:

  • Number of Courses per Year
  • Average subscriber per subject
  • Courses Subjects
  • Average cost per subject and level
  • Average of content duration per subject
  • Average of rating per Subject and level

4. SQL Table

For this task it has been created the following issue where you can find the task description to solve.
It has created the following star model to work in the Power Bi environment:

License

This project is licensed under the GPL-2.0 license.

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published