Course Description

This course is an introduction to the interdisciplinary and emerging field of data science, which lies at the intersection of statistics, computer science, data visualization and the social sciences. The course will be organized around three central threads: (1) statistical modeling and machine learning, (2) data pipelines, programming languages and “big data” tools, and (3) real world topics and case studies. Correspondingly there will be (1) core lectures, (2) labs and (3) guest lectures from researchers and scientists who are experts in their fields. Topics and tools will include logistic regression, predictive modeling, clustering algorithms, decision trees, Hadoop, data pipelines, visualization, data journalism, R, python.

The class will be held Wednesday evenings in 503 Hamilton, 6:10-8:55pm (with a break)



  1. I found this free online course by Coursera to be relevant and I thought people might like to take a look at it.

    1. yep! I mentioned this course on the first day as a different approach/view to a data science course. Plus it’s free and online.

Leave a Reply

Fill in your details below or click an icon to log in: Logo

You are commenting using your account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s

%d bloggers like this: