Table of Contents
General #
- research-quality data sets by Hilary Mason
- Guardian datablog all datasets
- Yahoo! Webscope - datasets from Yahoo! research
- http://www.indexmundi.com/
- http://www.freebase.com/
- http://www.google.com/publicdata/home - google public data explorer
- AWS: Public Data Sets
- Biological database
-
- https://quarry.wmflabs.org - "Run SQL queries against Wikipedia & other databases"
Bibliography #
Brain #
Governments #
- UN: http://data.un.org
- US: http://data.gov
- UK: http://data.gov.uk
Society #
Law enforcement #
- The Proceedings of the Old Bailey, 1674-1913
- http://www.fatalencounters.org/ - a database of people killed during interactions with law enforcement.
Economy #
Environment #
Food #
Nutrient #
- http://nutritiondata.self.com/ - nutrition data.
- USDA National Nutrient Database
Ingredients #
Restaurants #
Recipes #
Menu #
shopping #
- Instacart dataset
Geography #
Population #
Weather #
- You can use Mathematica.
- http://www.infochimps.com/tags/weather
-
http://aws.amazon.com/datasets/2759 - Daily Global Weather Measurements, 1929-2009 (NCDC, GSOD)
- http://www7.ncdc.noaa.gov/CDO/cdoselect.cmd?datasetabbv=GSOD
- http://code.google.com/p/flyontime/source/browse/trunk/analysis/README.txt
- http://ckan.net/package/search?q=temperature
Languages #
- SQuAD: The Stanford Question Answering Dataset
- http://googleresearch.blogspot.com/2013/12/free-language-lessons-for-computers.html
Movies #
- [[https://github.com/hadley/data-movies]] - "download data from IMDB movies and parse into useful form"
- IMDB
- The Internet Movie Script Database
- Rotten tomatoes
Music #
See also Music
- http://labrosa.ee.columbia.edu/millionsong/
- http://musicdatascience.com/emi-million-interview-dataset/
- The Whitburn Project
- http://www.discogs.com - music database
- http://developer.echonest.com - song characteristics (bpm, danceability, etc.)
- http://www.whosampled.com
- Billboard Top 100 Songs, 1950-2015
- MusicNet
Networks #
Mobility #
U.S. Congress #
Web #
- http://www.commoncrawl.org/
- http://cnets.indiana.edu/groups/nan/webtraffic/click-dataset
- Wikipedia clickstream dataset
World Bank #
Incoming Links #
Related Articles (Article 0) #
Suggested Pages #
- 0.099 Human song
- 0.093 Restaurants
- 0.086 Music informatics
- 0.075 Bird song
- 0.060 Musicology
- 0.054 Olfaction
- 0.030 Food choice
- 0.025 Protein-protein interaction network
- 0.025 Sheet music
- 0.025 Brain structural variation
- More suggestions...