"Those who cannot remember the past are condemned to repeat it." -- George Santayana
This is a compiled list of Kaggle competitions and their winning solutions for problems that don't fit well in regression, classification, sequence, or image regime.
The purpose to complie this list is for easier access and therefore learning from the best in data science.
Literature review is a crucial yet sometimes overlooked part in data science. To avoid reinventing the wheels and get inspired on how to preprocess, engineer, and model the data, it's worth spend 1/10 to 1/5 of the project time just researching how people deal with similar problems/datasets.
Time spent on literature review is time well spent.
This is only one list of the whole compilation. For other lists of competitions and solutions, please refer to:
- Kaggle - Regression
- Kaggle - Classification
- Kaggle - Sequence
- Kaggle - Image
Hope the compilation can save you efforts and offer you insights. Enjoy!
======
Wed 11 May 2016 – Wed 6 Jul 2016
The goal of this competition is to predict which place a person would like to check in to. For the purposes of this competition, Facebook created an artificial world consisting of more than 100,000 places located in a 10 km by 10 km square. For a given set of coordinates, your task is to return a ranked list of the most likely places.
======
Fri 15 Apr 2016 – Fri 10 Jun 2016
Which hotel type will an Expedia customer book?
======
Tue 1 Dec 2015 – Fri 8 Jan 2016
Given the sleigh's antiquated, weight-limited specifications, your challenge is to optimize the routes and loads Santa will take to and from the North Pole.
======
Thu 16 Jul 2015 – Wed 30 Sep 2015
Predict which coupons a customer will buy
======
Mon 20 Apr 2015 – Wed 1 Jul 2015
Predict the destination of taxi trips based on initial partial trajectories
======
Mon 24 Nov 2014 – Wed 7 Jan 2015
In this job scheduling problem, you will assign which elves work on which toys, at what time, and for how long. The goal is to complete all of the toys as early as possible, scaled by the natural log of the number of elves that work.
======
Tue 6 May 2014 – Tue 28 Oct 2014
Model friend memberships to multiple circles
======
Mon 12 May 2014 – Mon 15 Sep 2014
Use the ATLAS experiment to identify the Higgs boson
======
Mon 31 Mar 2014 – Tue 1 Apr 2014
Decode a sequence of pseudorandom numbers
======
Mon 14 Oct 2013 – Sun 2 Mar 2014
Reverse the arrow of time in the Game of Life
======
Mon 2 Dec 2013 – Sun 26 Jan 2014
He's making a list, checking it twice; to fill up his sleigh, he needs your advice
======
Tue 3 Sep 2013 – Mon 4 Nov 2013
Learning to rank hotels to maximize purchases
======
Thu 18 Apr 2013 – Wed 26 Jun 2013
Determine whether an author has written a given paper
======
Sat 13 Apr 2013 – Sun 14 Apr 2013
Predict which people are influential in a social network
======
Tue 5 Feb 2013 – Wed 27 Mar 2013
Can we objectively measure the symptoms of Parkinson’s disease with a smartphone? We have the data to find out!
======
Fri 11 Jan 2013 – Wed 20 Feb 2013
Predict what events our users will be interested in based on user actions, event metadata, and demographic information.
======
Fri 14 Dec 2012 – Fri 8 Feb 2013
Provide creative visualizations of the Kaggle leaderboard
======
Mon 10 Dec 2012 – Sun 20 Jan 2013
Using 3 years of school grading data supplied by the Colorado Department of Education and R-Squared Research, visually uncover trends in the Colorado public school system.
======
Fri 14 Dec 2012 – Sat 19 Jan 2013
Solve ye olde traveling salesman problem to help Santa Claus deliver his presents
======
Wed 24 Oct 2012 – Wed 21 Nov 2012
The Task: you will be given a path which, at one point in the training time period, was an optimal path from node A to B. The question is then to make a probalistic prediction, for each of the 5 test graphs, whether the given path is STILL an optimal path.
======
Fri 14 Sep 2012 – Mon 15 Oct 2012
Find hidden patterns, connections, and ultimately compelling stories in a treasure trove of data about US federal campaign contributions
======
Fri 3 Aug 2012 – Sun 7 Oct 2012
Predict which jobs users will apply to
======
Thu 7 Jun 2012 – Mon 10 Sep 2012
Start digging into electronic health records and submit your creative, insightful, and visually striking analyses.
======
Sat 18 Aug 2012 – Mon 27 Aug 2012
Your Analysis and/or Visualization featured in the Harvard Business Review
======
Thu 26 Apr 2012 – Thu 9 Aug 2012
Predict which songs a user will listen to.
======
Sat 21 Jul 2012 – Sun 22 Jul 2012
Can you predict if a listener will love a new song?
======
Tue 5 Jun 2012 – Tue 10 Jul 2012
The challenge is to recommend missing links in a social network. Participants will be presented with an external anonymized, directed social graph (no, not Facebook, keep guessing) from which some edges have been deleted, and asked to make ranked predictions for each user in the test set of which other users they would want to follow.
======
Thu 7 Jun 2012 – Sat 30 Jun 2012
Start digging into electronic health records and submit your ideas for the most promising, impactful or interesting predictive modeling competitions
======
Sat 24 Sep 2011 – Mon 17 Oct 2011
There's been a lot of recent work done in unsupervised feature learning for classification and there are a ton of older methods that also work well. The purpose of this competition is to find out which of these methods work best on relatively large-scale high dimensional learning tasks.
======