Category Archives: Big Data

Slides of 12 tutorials at ACM SIGKDD 2014

Slides of 12 tutorials taught by data science experts and thought leaders at ACM SIGKDD 2014 are provided at http://www.kdd.org/kdd2014/tutorials.html. Below is a list of them. 1.Scaling Up Deep Learning Yoshua Bengio 2. Constructing and mining web-scale knowledge graphs Antoine … Continue reading

Posted in Big Data, Data Mining | Tagged , | Leave a comment

Step-by-Step Guide to Setting Up an R-Hadoop System

by Yanchang Zhao RDataMining.com Following my first R-Hadoop system setup guide written in Sept 2013, I have further tested setting up a Hadoop system for running R code, as well as using HBase. I have tested it both on a … Continue reading

Posted in Big Data, R | Tagged , | 2 Comments

Step by step to build my first R Hadoop System

by Yanchang Zhao, RDataMining.com After reading documents and tutorials on MapReduce and Hadoop and playing with RHadoop for about 2 weeks, finally I have built my first R Hadoop system and successfully run some R examples on it. My experience … Continue reading

Posted in Big Data, R | Tagged , | 2 Comments

An excellent introduction to MapReduce and Hadoop

by Yanchang Zhao, RDataMining.com The lectures in week 3 of a free online course Introduction to Data Science give an excellent introduction to MapReduce and Hadoop, and demonstrate with examples how to use MapReduce to do various tasks, such as, … Continue reading

Posted in Big Data, Data Mining | 14 Comments