Date: 2005-03-22
Language: C++
Early in 2005, I stumbled across Damashek's 1995 Science article "Gauging Similarity with n-Grams: Language-Independent Categorization of Text". This was my early attempt at implementing the algorithm and investigating how it worked with different languages.