inicio mail me! sindicaci;ón

The lesson of the Netflix Prize

by Rob Paterson

For those that do not know – Netflix held a multi year competition to find a better search and ratings system – many teams competed.

In the final stretch the breakthrough came when many of the teams joined forces – the big difference was made by adding teams that up to then had “got it wrong”. A great story of this competition is on Wired.

The secret sauce for both BellKor’s Pragmatic Chaos and The Ensemble was collaboration between diverse ideas, and not in some touchy-feely, unquantifiable, “when people work together things are better” sort of way. The top two teams beat the challenge by combining teams and their algorithms into more complex algorithms incorporating everybody’s work. The more people joined, the more the resulting team’s score would increase.

//

“It’s been quite a drama,” said Netflix chief product officer Neil Hunt at Monday’s awards ceremony. “At first, a whole lot of teams got in — and they got 6-percent improvement, 7-percent improvement, 8-percent improvement, and then it started slowing down, and we got into year two. There was this long period where they were barely making progress, and we were thinking, ‘maybe this will never be won.’

“Then there was a great insight among some of the teams — that if they combined their approaches, they actually got better. It was fairly unintuitive to many people [because you generally take the smartest two people and say 'come up with a solution']… when you get this combining of these algorithms in certain ways, it started out this ’second frenzy.’ In combination, the teams could get better and better and better.”

Ironically, the most outlying approaches — the ones farthest away from the mainstream way to solve a given problem — proved most helpful towards the end of the contest, as the teams neared the summit.

For instance, BellKor’s Pragmatic Chaos (methodology here) credits some of its success to slicing the data by what they called “frequency.” As it turns out, people who rate a whole slew of movies at one time tend to be rating movies they saw a long time ago. The data showed that people employ different criteria to rate movies they saw a long time ago, as opposed to ones they saw recently — and that in addition, some movies age better than others, skewing either up or down over time. (Finally, someone has explained why Snakes On A Plane seemed more fun at the time than it does now.)

By tracking the number of movies rated on a given day as an indicator of how long it had been since a given viewer had seen a movie, and by tracking how memory affected particular movie ratings, Pragmatic Theory (later part of the winning team) was able to gain a slight edge, even though this particular algorithm isn’t particularly good at predicting which movies people will like when run on its own.

Another example: According to Joe Sill of The Ensemble, Big Chaos (the Austrians who also became part of the winning team) discovered that viewers in general tend to rate movies differently on Fridays versus Mondays, and certain users are in good moods on Sundays, and so on. The team essentially devised a three-dimensional model that incorporated time into the relationship between people and movies.

Taken on its own, the fact that a viewer rated a given movie on a Monday is a horrible indicator of what other movies they’ll want to rent — a crucial part of Netflix’ business (it says its recommendations are better indicators of what people will rent than their “most popular” lists). But combined with hundreds of other algorithms from other minds, each weighted with precision, and combined and recombined, that otherwise inconsequential fact takes on huge importance.

“One of the big lessons was developing diverse models that captured distinct effects,” said Sill, “even if they’re very small effects.”

This approach is the opposite of how we have been taught to solve problems. There has to be a plan and a few smart folks working to the plan.

What I see here is the power of setting in place the conditions that allow for “emergence”.

Science and Research is going to explode by going down this path.

What will be needed are great supporting tools – watch this space!

Share and Enjoy:
  • E-mail this story to a friend!
  • Print this article!
  • TwitThis
  • del.icio.us
  • Facebook
  • Reddit
  • Digg
  • Google
  • StumbleUpon
  • SphereIt


7 Tweets

7 Comments »

ffblogSeptember 30th, 2009 at 11:35 am

New Post “The lesson of the Netflix Prize” http://bit.ly/IIo9d

This comment was originally posted on Twitter

dkasrelSeptember 30th, 2009 at 12:02 pm

Interesting insight: RT @ffblog: New Post “The lesson of the Netflix Prize” http://bit.ly/IIo9d

This comment was originally posted on Twitter

mdglissSeptember 30th, 2009 at 4:34 pm

Lessons from the Netflix prize http://bit.ly/1GxnBu

This comment was originally posted on Twitter

hkotadiaSeptember 30th, 2009 at 11:35 pm

The lesson of the Netflix Prize http://bit.ly/2FvO8r #innovation #crowdsourcing

This comment was originally posted on Twitter

Intranets20October 1st, 2009 at 1:46 am

The lesson of the Netflix Prize http://ow.ly/15Sd2u

This comment was originally posted on Twitter

medicieffectOctober 1st, 2009 at 7:24 pm

Winning groundbreaking Netflix ratings solution is textbook case of the Medici Effect: http://bit.ly/1GxnBu (thanks @chrisyeh)

This comment was originally posted on Twitter

bradbigOctober 6th, 2009 at 12:12 am

RT @medicieffect Winning groundbreaking Netflix ratings solution is textbook case of the Medici Effect: http://bit.ly/1GxnBu (thx @chrisyeh)

This comment was originally posted on Twitter

» Subscribe to the RSS feed for these comments

Your comment

Want an image to appear near your comment? Go to gravatar.com

HTML-Tags:
<a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>

Additional comments powered by BackType