Skip to content

MarcusChong123/Text-Clustering-with-Python

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 

Repository files navigation

Text-Clustering-with-Python

I am going to show you step by step how to perform text clustering with Python. For full article, feel free to visit https://learndatascienceskill.com/index.php/2020/08/06/text-clustering-with-python/

In this tutorial, I will show you how to perform Unsupervised Machine learning with Python using Text Clustering. We will look at how to turn text into numbers with using TF-IDF Vectorizer from sklearn. What we will also do is to check the centroid of each cluster. Once we know the centroid, we will know the movies that are closed to the centroids and that helps us to understand the similarities between these movies.

I will show you step by step of:

  1. How to load the data into Google Colab notebook
  2. How to explore the data
  3. How to pre-process the data with TF-IDF Vectorizer from sklearn
  4. How to perform K-Means clustering with using Scikit-Learn library
  5. How to evaluate the results of the clustering

Youtube video: https://www.youtube.com/watch?v=ORpDAUQUnkU

About

I am going to show you step by step how to perform text clustering with Python. For full article, feel free to visit https://learndatascienceskill.com/index.php/2020/08/06/text-clustering-with-python/

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages