From f3356784c5882b74a3e7d9ace30b3c45dc13a170 Mon Sep 17 00:00:00 2001 From: Clemens Wolff Date: Wed, 16 Jul 2014 08:24:42 +0100 Subject: [PATCH] Add Rosetta - Python text processing library Rosetta is a Python library to make data-science on textual data easier. The library contains a number of high-performance utilities for common tasks encountered during text-mining (e.g. tokenization). The library also contains a number of wrappers to make working with libraries such as Vowpal Wabbit and Gensim simpler. Introductory video: http://youtu.be/Xv6exLVbGPk --- README.md | 1 + 1 file changed, 1 insertion(+) diff --git a/README.md b/README.md index a74caf6..f99cd79 100644 --- a/README.md +++ b/README.md @@ -207,6 +207,7 @@ on MNIST digits[DEEP LEARNING] * [loso](https://github.com/victorlin/loso) - Another Chinese segmentation library. * [genius](https://github.com/duanhongyi/genius) - A Chinese segment base on Conditional Random Field. * [nut](https://github.com/pprett/nut) - Natural language Understanding Toolkit +* [Rosetta](https://github.com/columbia-applied-data-science/rosetta) - Text processing tools and wrappers (e.g. Vowpal Wabbit) #### General-Purpose Machine Learning