I’m currently compiling a list of social tagging datasets for our current / future research, but since it might be of interest to others as well I’m sharing it. Here’s the link:
A non-exhaustive list of Social Tagging Datasets that are available for research
If you are aware of other social tagging datasets available for research, please let me know by leaving a comment to this post.
http://www.datawrangling.com/some-datasets-available-on-the-web
Ritesh, I’m looking for _tagging_ datasets specifically (i.e. datasets of tagged resources such as delicious, last.fm, flickr, etc), and it seems that the list contains mainly non-tagging datasets. Are you aware of additional _tagging_ datasets? thanks for your comment. M.
Here is a collection of folksonomy data sets: http://www.tagora-project.eu/data/ (not all mentioned sets are available though)
Part of bibsonomy data – for KSDC challenge 08: http://www.kde.cs.uni-kassel.de/ws/rsdc08/dataset.html
Three more datasets are available here:
http://nlp.uned.es/social-tagging/
Thanks everyone for your replies. I’d like to point you to the URL [1] again, as it now contains an updated version of the list.
[1] http://kmi.tugraz.at/staff/markus/datasets/
Markus, great job. Some other datasets:
* http://www.yr-bcn.es/dokuwiki/doku.php?id=semantically_annotated_snapshot_of_wikipedia
* http://blogs.sun.com/plamere/entry/open_research_the_data_lastfm
* http://musicbrainz.org/doc/Database