4 years ago1,000+ Views
"To me, the best programmers are empty cups, who constantly explore new technology (noSQL, node-js, whatever) with open minds." - Ron Zacharski With companies such as Amazon, Netflix, and Pandora, data is being organized and analyzed more and more each day. As a programmer, it is important to understand the basics of data mining in order to be up to date with machine learning in the 21st century. I found this helpful e-book by Ron Zacharski that has many exercises in Python as well as data sets. I've been reading through it and it's a great and quick guide to data mining that is very applicable. Here are the pdfs: A Programmer's Guide to Data Mining: The Ancient Art of Numerati by Ron Zacharski Website: CHAPTERS 1. Intro 2. Get Started With Recommended Systems Code & Data: 3. Implicit Ratings and Item-based Filtering Code & Data: 4. Classification Code & Data: 5. Further Explorations in Classification Code & Data: 6. Naïve Bayes Code & Data: 7. Naïve Bayes and unstructured text Code & Data: 8. Clustering ( 1st half of chapter, still in "Beta") Code & Data: