A Sequence of 9 Courses on Data Science Starts on Coursera on 2 June and 7 July 2014

A sequence of 9 courses on Data Science will start on Coursera on 2 June and 7 July 2014, to be lectured by(Associate/Assistant) Professors of Johns Hopkins University. The courses are designed for students to learn to become Data Scientists and apply their skills in a capstone project.

You can take the courses for free. However, if you want to get a Verified Certificate in the course, the Specialization Certificate or taking the Capstone Project, you will have to pay for it. The cost is
$49 each × 9 courses + $49 Capstone project = $490 Specialization Certificate.

Below is course information picked up from the courses homepage on Coursera website, and more details can be found at https://www.coursera.org/specialization/jhudatascience/1.

Course 1: The Data Scientist’s Toolbox
Upcoming Session: 2 June, 7 July
Duration: 4 weeks
Estimated Workload: 3-4 hours/week
URL: https://www.coursera.org/course/datascitoolbox
Description: Upon completion of this course you will be able to identify and classify data science problems. You will also have created your Github account, created your first repository, and pushed your first markdown file to your account.

Course 2: R Programming
Upcoming Session: 2 June, 7 July
Duration: 4 weeks
Estimated Workload: 3-5 hours/week
URL: https://www.coursera.org/course/rprog
Description: The course will cover the following material each week:
Week 1: Overview of R, R data types and objects, reading and writing data
Week 2: Control structures, functions, scoping rules, dates and times
Week 3: Loop functions, debugging tools
Week 4: Simulation, code profiling

Course 3: Getting and Cleaning Data
Upcoming Session: 2 June, 7 July
Duration: 4 weeks
Estimated Workload: 3-5 hours/week
URL: https://www.coursera.org/course/getdata
Description: Upon completion of this course you will be able to obtain data from a variety of sources. You will know the principles of tidy data and data sharing. Finally, you will understand and be able to apply the basic tools for data cleaning and manipulation.

Course 4: Exploratory Data Analysis
Upcoming Session: 2 June, 7 July
Duration: 4 weeks
Estimated Workload: 3-5 hours/week
URL: https://www.coursera.org/course/exdata
Description: After successfully completing this course you will be able to make visual representations of data using the base, lattice, and ggplot2 plotting systems in R, apply basic principles of data graphics to create rich analytic graphics from different types of datasets, construct exploratory summaries of data in support of a specific question, and create visualizations of multidimensional data using exploratory multivariate statistical techniques.

Course 5: Reproducible Research
Upcoming Session: 2 June, 7 July
Duration: 4 weeks
Estimated Workload: 3-5 hours/week
URL: https://www.coursera.org/course/repdata
Description: In this course you will learn to write a document using R markdown, integrate live R code into a literate statistical program, compile R markdown documents using knitr and related tools, and organize a data analysis so that it is reproducible and accessible to others.

Course 6: Statistical Inference
Upcoming Session: 2 June, 7 July
Duration: 4 weeks
Estimated Workload: 3-5 hours/week
URL: https://www.coursera.org/course/statinference
Description: In this class students will learn the fundamentals of statistical inference. Students will receive a broad overview of the goals, assumptions and modes of performing statistical inference. Students will be able to perform inferential tasks in highly targeted settings and will be able to use  the skills developed as a roadmap for more complex inferential challenges.

Course 7: Regression Models
Upcoming Session: 2 June, 7 July, 4 August
Duration: 4 weeks
Estimated Workload: 3-5 hours/week
URL: https://www.coursera.org/course/regmods
Description: In this course students will learn how to fit regression models, how to interpret coefficients, how to investigate residuals and variability.  Students will further learn special cases of regression models including use of dummy variables and multivariable adjustment. Extensions to generalized linear models, especially considering Poisson and logistic regression will be reviewed.

Course 8: Practical Machine Learning
Upcoming Session: 2 June, 7 July, 4 August
Duration: 4 weeks
URL: https://www.coursera.org/course/predmachlearn
Description: Upon completion of this course you will understand the components of a machine learning algorithm. You will also know how to apply multiple basic machine learning tools. You will also learn to apply these tools to build and evaluate predictors on real data.

Course 9: Developing Data Products
Upcoming Session: 2 June, 7 July, 4 August
Duration: 4 weeks
Estimated Workload: 3-5 hours/week
URL: https://www.coursera.org/course/devdataprod
Description: Students will learn how communicate using statistics and statistical products. Emphasis will be paid to communicating uncertainty in statistical results. Students will learn how to create simple Shiny web applications and R packages for their data products.

Capstone Project
Duration: 4 weeks
Description: The capstone project class will allow students to create a usable/public data product that can be used to show your skills to potential employers. Projects will be drawn from real-world problems and will be conducted with industry, government, and academic partners. The capstone project will be four weeks long, offered in conjunction with the series. The capstone class will be offered thrice yearly. The Capstone Project is available after you’ve completed all courses in the Specialization.

About Yanchang Zhao

I am a data scientist, using R for data mining applications. My work on R and data mining: RDataMining.com; Twitter; Group on Linkedin; and Group on Google.
This entry was posted in Data Mining, R and tagged , . Bookmark the permalink.

6 Responses to A Sequence of 9 Courses on Data Science Starts on Coursera on 2 June and 7 July 2014

  1. Pingback: A Sequence of 9 Courses on Data Science Starts on Coursera on 2 June and 7 July 2014 ← Patient 2 Earn

  2. Lauri says:

    Sadly the courses are of low quality. A bunch of these have been offered before and absolutely no changes have been made, despite the requests by both users and TAs. So take the free version first and then take them again the month after for money if you are still sure you’re willing to pay for it.

  3. Jason Byrne says:

    What do you mean by low quality? I am just partway into it and I am finding the instruction to be very clear (within the confines of boring slide-presentation format at least) and the examples and assignments are useful for obtaining practice at what is being taught. It is unfortunate that there is not a more challenging projects for example (so far at least), but I imagine that is hard to accomplish in a free online course. So I am looking towards building my own projects; though there is a lot to be said for having deadlines in place to keep the focus and motivation. As an introduction to data science I am very excited about completing it.
    Also good advice about completing it for free first, which I’m doing and recommend at least initially too. I’m not sure how useful an official certification is, once you can prove you have the skills on top of whatever Bachelors/PhD qualifications might be required by an employer.

  4. LightRiver says:

    Reblogged this on Deep thought and commented:
    9 courses về Data Science trên Coursera!

  5. I would agree with Lauri when she mentioned that the courses do lack quality but considering they are free, one cannot expect them to be the best. I have personally completed the first one i.e. ” Data Scientist Tollbox” and would now be moving on to paid online course either by Jigsaw Academy (Data Scientist Certification) or Data Science Central (Certification Course). Anyone here who has taken any of these courses?

  6. andrewb47 says:

    I’ve done the first three courses and am now on the fourth in the series of nine. I wrote a blog here evaluating my experience so far. The series of courses makes few assumptions about students’ capability at the start, so the initial courses are at an elementary level.

Leave a comment