Popular and easy to use data sets
-
Wine Quality Dataset
-
Haberman’s Survival Data Set
-
Italian Football Data: Serie A & B
-
2012 Presidential Campaign Finance Data
Other Data Sources
If none of the above datasets interest you, you might want to try looking for data from one of the links below. Be warned, there may be some significant data processing to perform before you will be able to perform your analysis.
-
2012 Presidential Campaign Finance Data
-
New Residential Housing Sale Data
-
Sean Lahman Baseball Statistics
-
Links to six data lists curated by data scientists
-
Inside-R blog post: Finding Data on the Internet
-
UC Irvine Machine Learning Repository
-
University of Missouri Libraries: FInd Datasets on the Internet
-
Datahub
-
Freebase – open, community-curated database of information
UN maintains hundreds of open data sets on its constituent websites (WHO, UNICEF..) that you might be interested in. Its a gold mine of excellent cleaned up factual data for analysis.