Skip to content
This repository has been archived by the owner on Mar 1, 2021. It is now read-only.
/ language-detector Public archive

Detects languages out of any text (50+ languages supported).

Notifications You must be signed in to change notification settings

Norconex/language-detector

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

16 Commits
 
 
 
 

Repository files navigation

Norconex Language Detector

This project is no longer maintained.

This project was created to provide language-detection features to the Norconex Importer project. The Importer project now uses Apache Tika built-in language-detection capabilities instead.

Detects languages out of any text (50+ languages supported).

At the moment, it is mainly a wrapper around the great "language-detection" library from Nakatani Shuyo, with some additions:

  • It allows concurrent detectors with different language profiles initializations on the same JVM.
  • It offers different ways to initialize the language profiles (as input streams, from classpath, etc).

Original Shuyo language-detection project is hosted at: https://code.google.com/p/language-detection. You can find a fork of that project on github at: https://github.com/Norconex/language-detection

About

Detects languages out of any text (50+ languages supported).

Resources

Stars

Watchers

Forks

Packages

No packages published

Languages