Accompanying Source Code for the Haskell Data Analysis Cookbook
Go to file
2014-08-09 08:55:42 +08:00
Ch01 Adapted code to current version of Database.MongoDB 2014-08-09 08:53:23 +08:00
Ch02 Fixed wording in README 2014-05-31 14:38:24 -04:00
Ch03 Corrected expected output 2014-08-09 08:55:42 +08:00
Ch04 Added Ch 5, fixed Ch 4 2014-06-13 00:33:05 -04:00
Ch05 Added comments to Ch05 2014-06-21 11:10:20 -04:00
Ch06 Added chapter 6 2014-06-21 11:31:36 -04:00
Ch07 Added Chapter 7 2014-06-21 11:53:35 -04:00
Ch08 Added Chapter 8 2014-06-21 12:03:14 -04:00
Ch09 Added Chapter 9 2014-06-24 09:12:11 -04:00
Ch10 Added Chapter 10 2014-06-24 19:53:01 -04:00
Ch11 Added cabal file 2014-07-06 11:01:12 -04:00
Ch12 Added Chapter 12 2014-06-24 20:28:06 -04:00
README.md Updated README 2014-06-24 20:31:37 -04:00

Haskell-Data-Analysis-Cookbook

This is the accompanying source code for Haskell Data Analysis Cookbook.

The latest source code is available on GitHub: https://github.com/BinRoot/Haskell-Data-Analysis-Cookbook

Summary

Chapter 1

The Hunt for Data, identifies core approaches in reading data from various external sources such as CSV, JSON, XML, HTML, MongoDB, and SQLite.

Chapter 2

Integrity and Inspection, explains the importance of cleaning data through recipes about trimming whitespace, lexing, and regular expression matching.

Chapter 3

The Science of Words, introduces common string manipulation algorithms including base conversions, substring matching, and computing the edit distance.

Chapter 4

Data Hashing, covers essential hashing functions such as MD5, SHA256, GeoHashing, and perceptual hashing.

Chapter 5

A Dance with Trees, establishes an understanding of the tree data structure through examples including tree traversals, balancing trees, and Huffman coding.

Chapter 6

Graph Fundamentals, manifests rudimentary algorithms for graphical networks such as graph traversals, visualization, and maximal clique detection.

Chapter 7

Statistics and Analysis, begins the investigation of important data analysis techniques encompassing regression algorithms, Bayesian networks, and neural networks.

Chapter 8

Clustering and Classification, involves quintessential analysis methods involving k-means clustering, hierarchical clustering, constructing decision trees, and implementing the k-Nearest Neighbors classifier.

Chapter 9

Parallel and Concurrent Design, introduces advance topics in Haskell such as forking IO actions, mapping over lists in parallel, and benchmarking performance.

Chapter 10

Real-time Fugue, incorporates streamed data interactions from Twitter, Internet Relay Chat (IRC), and sockets.

Chapter 11

Stunning Visuals, comprises of sundry approaches to plotting graphs including line charts, bar graphs, scatter plots, and D3.js visualizations.

Chapter 12

The Final Exporting, concludes the book with an enumeration of algorithms for exporting data to CSV, JSON, HTML, MongoDB, and SQLite.

Artwork Attribution

Illustrations by Lonku.