Skip to content

Analyze Wine Reviews Data and find the insights by using python libraries

License

Notifications You must be signed in to change notification settings

gautam2000/Wine-Reviews-Data-EDA-Project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

17 Commits
 
 
 
 
 
 
 
 

Repository files navigation

EDA Analysis of The Wine Reviews Data with Python

image

Project Data :

Doing EDA on a Wine-reviews Dataset from kaggle Kaggle Dataset Link: https://www.kaggle.com/zynicide/wine-reviews

Content:

This dataset contains three files:

winemag-data-130k-v2.csv contains 10 columns and 130k rows of wine reviews.

winemag-data_first150k.csv contains 10 columns and 150k rows of wine reviews.

winemag-data-130k-v2.json contains 6919 nodes of wine reviews.

Getting Useful Insights from the Dataset :

Performed EDA analysis and found out the following insights:

1.Top 10 Rated Wine according to the rating in Wine Reviews Data.

2.Top 10 most expensive wines according to the rating in Wine Reviews Data.

3.Top 10 Cheapest wines according to the rating in Wine Reviews Data.

Performed Data Visualisation using Seaborn and Matplotlib Libraries and got the following insights:

1. Distribution of Category using pair plot graph.

2. Found out the Relationship between alcohol and Rating using a Regression Plot.

3. Find out the Relationship between Rating and Price using a Scatter Plot.