Category Ethics and Humanity

Data & Hubris

This is a guest post by Professor Matthew Jones, from Columbia’s History department, who has been attending the course. Data & Hubris In the wake of the recent election, data people, those that love, and especially those that idealize them exploded in Schadenfreude about the many errors of the traditional punditocracy. Computational statistics and data […]

Week 6: Kaggle, crowdsourcing, decision trees, random forests, social networks, and Google’s hybrid research environment

Each week Cathy O’Neil blogs about the class. Cross-posted from Yesterday we had two guest lecturers, who took up approximately half the time each. First we welcomed William Cukierski from Kaggle, a data science competition platform. Will went to Cornell for a B.A. in physics and to Rutgers to get his Ph.D. in biomedical […]

Week 5: GetGlue, time series, financial modeling, advanced regression, and ethics

Each week Cathy O’Neil blogs about the class. Cross-posted from But what makes this week unique is that Cathy was our guest lecturer. So first I need to introduce her, and then what follows is her blog post. Students in the class already know Cathy because she comes each week, asks good questions and […]

Next-Gen Data Scientists

The following is a prologue to a discussion of what makes for a good data scientist. Data is information and is extremely powerful. Models and algorithms that use data can literally change the world. Quantitatively-minded people have always been able to solve important problems, so this is nothing new, and there’s always been data, so […]

On Inspiring Students and Being Human

Dear Students, Lest you think (yes, I know I used that turn of phrase in posts before. I like it.) that I am bragging about my character traits (in which case you don’t know me well enough yet– I never brag, and who isn’t human?), wipe that thought from your mind, and read on. On […]

Human Ingenuity

Read the full paper here: Presented, in part, as inspiration: observe the elegance and simplicity of the model; the deep insight that solved a problem as massive as ranking sites on the web with a solution involving eigenvectors. Presented, also, for the student discussion on Anderson’s article.