This project aims to provide guide for decision making for Microsoft Movie. It analyzes three datasets using python and the findings are that animated genre produce highest net profit, the more the production budget there's a tendency that the net profit would increase too.
Microsoft has a new movie studio and would like to produce movies that would bring in profits and enjoyed by the audience. The objective is to gain comprehensive insights into the characteristics and genres that resonate most with audiences.
- which genre is more profitable?
- What is the ideal budget production?
- How long movies should be?
Three datasets are used in this project:
- Internet Movie Database (IMDB)
It contains multiple tables but the tables of interest are movie_basics and movie_ratings where the averagerating and runtime_minutes will be used to determine the ideal length of movie.
- The Numbers (TN)
TN dataset contains 5,782 records and 6 columns. The column of interest are production budget and worldwide gross. This will enable us to calculate the profit and investigate on Return on Investment.
- The MovieDB (TMDB)
TMDB dataset contains 26,517 records and 9 columns. Genre_id column is used to map genre of the movies, to find which genre is more profitable.
The TN and TMDB datasets are merged after dropping unnecesaary columns and missing values. Analysis like Net profit, Return on Investment and finding profitable genre is performed using this dataset.Scatter plots and bar graphs were used to visualize the findings. In calculating ROI median is used due to presence of outliers.
- Animation was the most profitable genre.
- High buget movies generate huge profits.
- Ideal length of movie liked by audience ranges from 80 to 90 minutes.
-
- Produce movies within the Animation genres.
-
- Invest in High Budget movies.
-
- Make movies length near 90 minutes.