Data-driven modeling

↧

Lecture 03

February 13, 2012, 9:09 am

With document classification (e.g., spam filtering) as a motivating example, we reviewed issues with kNN for high-dimensional classification problems—namely the curse of dimensionality—and explored...

View Article

Lecture 04

February 27, 2012, 10:16 am

Our previous discussion of naive Bayes led us to the problem of overfitting, specifically in dealing with rare words for text classification. We investigated this problem a bit more formally in the...

View Article

Lecture 05

March 5, 2012, 9:11 am

In this lecture we studied maximum likelihood inference for linear classifiers. We saw that ordinary least squares regression can be phrased as maximum likelihood inference under the assumption of...

View Article

Homework 02

March 10, 2012, 4:11 pm

The second homework is posted. The first problem is an exercise in using cross-validation to select the best-fit polynomial for some synthetic data where both the degree and coefficients are unknown....

View Article

Lecture 06

March 19, 2012, 8:20 am

In this lecture we looked at non-linear feature transformations to accomodate more complex decision boundaries, introduced regularization to avoid overfitting, and covered the kernel trick for learning...

View Article

Image may be NSFW.
Clik here to view.

Lecture 07

March 26, 2012, 12:11 pm

In this lecture we extended our toolbox of linear classification methods to include support vector machines (SVMs). We began with a unifying view of loss functions for classification, including...

View Article

Lecture 08

April 2, 2012, 4:36 am

We had our first guest lecture this week. John Myles White presented recent work on modeling data from functional MRI experiments to understand the relationship between various mental states and...

View Article

Final Project

April 4, 2012, 10:56 am

Here are details for the final project. The main objective is to apply techniques we’ve discussed in class to real-world data. You may either use a pre-compiled data set or build your own data set...

View Article

Lecture 09

April 9, 2012, 11:38 am

In this lecture we discussed recommendation systems in general, and collaborative filtering in particular. With the Netflix Prize as a motivating example, we saw that simple memory-based methods (e.g.,...

View Article

Lecture 10

April 16, 2012, 12:24 pm

This week we had our second guest lecture. Max Shron presented a live demo of using Google Transit data to analyze the effects of budget cuts on passenger wait times, adapting his original analysis for...

View Article