The Data Analysis Program is a desktop application developed with PyQt5, Pandas, Seaborn, and Matplotlib to facilitate data exploration, analysis, and visualization. This program provides a user-friendly interface for loading datasets, exploring their structures, generating descriptive statistics, conducting correlation analysis, and more.
-
Load Datasets:
-
Inspect Dataset:
-
Prepare Data:
- Fill null values and encode non-numeric data types to make the dataset machine learning-ready.
-
Basic Machine Learning Model:
- Categorize Columns:
- Identify categorical and numerical columns, distinguishing between high-cardinality categorical columns and numerical columns treated as categorical.
-
Target Variable Summary:
-
Correlation Analysis:
-
Column Summary: