Add Rosetta - Python text processing library

Rosetta is a Python library to make data-science on textual data easier.
The library contains a number of high-performance utilities for common tasks
encountered during text-mining (e.g. tokenization).
The library also contains a number of wrappers to make working with libraries
such as Vowpal Wabbit and Gensim simpler.

Introductory video: http://youtu.be/Xv6exLVbGPk
This commit is contained in:
Clemens Wolff 2014-07-16 08:24:42 +01:00
parent 5c0090b459
commit f3356784c5

View File

@ -207,6 +207,7 @@ on MNIST digits[DEEP LEARNING]
* [loso](https://github.com/victorlin/loso) - Another Chinese segmentation library.
* [genius](https://github.com/duanhongyi/genius) - A Chinese segment base on Conditional Random Field.
* [nut](https://github.com/pprett/nut) - Natural language Understanding Toolkit
* [Rosetta](https://github.com/columbia-applied-data-science/rosetta) - Text processing tools and wrappers (e.g. Vowpal Wabbit)
#### General-Purpose Machine Learning