From 0c7cdc3ac07bdbcd5902c364da6e7799fd2f7f74 Mon Sep 17 00:00:00 2001 From: Eric Schles Date: Tue, 21 Jun 2016 09:37:16 -0400 Subject: [PATCH 1/2] Update README.md --- README.md | 6 ++++++ 1 file changed, 6 insertions(+) diff --git a/README.md b/README.md index f0e24f3..db0a52b 100644 --- a/README.md +++ b/README.md @@ -691,6 +691,7 @@ on MNIST digits[DEEP LEARNING] * [SimpleCV](http://simplecv.org/) - An open source computer vision framework that gives access to several high-powered computer vision libraries, such as OpenCV. Written on Python and runs on Mac, Windows, and Ubuntu Linux. * [Vigranumpy](https://github.com/ukoethe/vigra) - Python bindings for the VIGRA C++ computer vision library. * [OpenFace](https://cmusatyalab.github.io/openface/) - Free and open source face recognition with deep neural networks. +* [PCV](https://github.com/jesolem/PCV) - Open source Python module for computer vision #### Natural Language Processing @@ -716,6 +717,11 @@ on MNIST digits[DEEP LEARNING] * [colibri-core](https://github.com/proycon/colibri-core) - Python binding to C++ library for extracting and working with with basic linguistic constructions such as n-grams and skipgrams in a quick and memory-efficient way. * [spaCy](https://github.com/honnibal/spaCy/) - Industrial strength NLP with Python and Cython. * [PyStanfordDependencies](https://github.com/dmcc/PyStanfordDependencies) - Python interface for converting Penn Treebank trees to Stanford Dependencies. +* [Distance](https://github.com/doukremt/distance) - Levenshtein and Hamming distance computation +* [Fuzzy Wuzzy](https://github.com/seatgeek/fuzzywuzzy) - Fuzzy String Matching in Python +* [jellyfish](https://github.com/jamesturk/jellyfishå) - a python library for doing approximate and phonetic matching of strings. +* [editdistance](https://pypi.python.org/pypi/editdistance) - fast implementation of edit distance +* [textacy](https://github.com/chartbeat-labs/textacy) - higher-level NLP built on Spacy #### General-Purpose Machine Learning From ce2f7ced0a8331917d3d42241a575598b505bdab Mon Sep 17 00:00:00 2001 From: Eric Schles Date: Tue, 21 Jun 2016 09:41:45 -0400 Subject: [PATCH 2/2] Update README.md --- README.md | 2 ++ 1 file changed, 2 insertions(+) diff --git a/README.md b/README.md index db0a52b..fd32c05 100644 --- a/README.md +++ b/README.md @@ -825,6 +825,8 @@ on MNIST digits[DEEP LEARNING] * [caravel](https://github.com/airbnb/caravel) - A data exploration platform designed to be visual, intuitive, and interactive. * [Dora](https://github.com/nathanepstein/dora) - Tools for exploratory data analysis in Python. * [Ruffus](http://www.ruffus.org.uk) - Computation Pipeline library for python. +* [SOMPY](https://github.com/sevamoo/SOMPY) - Self Organizing Map written in Python (Uses neural networks for data analysis). +* [HDBScan](https://github.com/lmcinnes/hdbscan) - implementation of the hdbscan algorithm in Python - used for clustering #### Misc Scripts / iPython Notebooks / Codebases