OpenRefine is a free, open source power tool for working with messy data and improving it
-
Updated
Dec 4, 2024 - Java
OpenRefine is a free, open source power tool for working with messy data and improving it
A Scalable Data Cleaning Library for PySpark.
Data visualisations in Power BI
Table Enforcer is my attempt to apply a sort of "test driven development" workflow to data cleaning and validation. A python package to facilitate the iterative process of developing and using schema-like representations of DataFrames in pandas for recoding and validating instances of these data.
Examples for Optimus a Data Cleansing Library for Big Data.
GitHub Repo of our Tidyverse workshop organized on Sep 8, 2022
-This project targets the textual analysis of Egyptian movie plot summaries that were curated from online sources, covering the four golden decades of Egyptian Cinema.
Advance Guide Of Cleaning & 20+ ways of cleaning data with python
"Telewire Analytics," an innovative project aimed at optimizing resource utilization within the telecom industry.
This is the curated pile of notebooks/small projects which contains linear and non-linear regression models.
This project extracts data from Azure datalake gen 2 storage, transforming it and then transferring it to SQL database.
This course by University of Michigan introduces the basics of the python programming environment, including fundamental python programming techniques such as lambdas, reading and manipulating csv files, and the numpy library. The course will also introduces data manipulation and cleaning techniques using python pandas data science library.
This project is an internal project with INTEL where a framework for monitoring data quality from disparate sources and automating it using python.
Analyzed a survey recieved using Power BI tool to draw useful insights.
This Project is based of an Online Retail store that wants to analyse major contributing factors to the revenue so they can strategically plan for next year.
Data cleansing and validation for Data Science Master degree
Cleaned a movies dataset to present specific visuals to answer research questions
Add a description, image, and links to the datacleansing topic page so that developers can more easily learn about it.
To associate your repository with the datacleansing topic, visit your repo's landing page and select "manage topics."