A List of Social Tagging Datasets Made Available for Research
This list is not exhaustive - help expand it!
Social Tagging Systems | Research Group | Source | Year Obtained | Availability | Contact | References |
CiteULike | Oversity Ltd. | Primary | Daily Snapshots | Via Download after Email (link) | Richard Cameron | |
Bibsonomy | KDE | Primary | Periodical Snapshots every half year | Available after signed license agreement | Andreas Hotho | [Hotho 2006] |
MovieLens | GroupLens | Primary | 2009 | Via Download (link) | GroupLens Info | [Sen 2006] |
GiveALink | NaN Group | Primary | Current information via API | Via API | Filippo Menczer | [Markines 2009] |
ESP Game | Luis von Ahn | Primary | 2006 | Via Download (link) | Luis von Ahn | [VonAhn 2004] |
Delicious | DAI Labor | Secondary | 2007/2008 | Via Email Request | Alan Said | [Wetzker 2006] |
Delicious, Stumble Upon & Wikipedia | NLP and Information Retrieval Group | Secondary | 2008/2009 | Via Download (link) | Arkaitz Zubiaga | [Zubiaga 2009a] [Zubiaga 2009b] [Zubiaga 2009c] |
Delicious, Flickr, Last.fm, zexe.net | TAGora | Secondary | 2006, 2007, 2008 | Via Download (link) | Vittorio Loreto | |
Delicious, Flickr, Diigo, Bibsonomy and others | Agents and Social Computation | Secondary | 2009 | Via Email Request | Markus Strohmaier | [Grahsl 2010] |
In case you are aware of other available datasets, please let me know by leaving a comment on a corresponding blog post.
Page updated and maintained by Markus Strohmaier.
References
[Grahsl 2010] H.P. Grahsl, C. Körner, M. Strohmaier. A Collection of Tagging Datasets Containing Complete Personomies From Heterogeneous Sources. Technical Report, Knowledge Management Institute, Graz University of Technology. To be published in 2010
[Hotho 2006] A. Hotho, R. Jäschke, C. Schmitz, and G. Stumme. BibSonomy: A Social Bookmark and Publication Sharing System. In Aldo de Moor, Simon Polovina, and Harry Delugach, editors, Proceedings of the Conceptual Structures Tool Interoperability Workshop at the 14th International Conference on Conceptual Structures, Aalborg, Denmark
[Markines 2009] B. Markines and F. Menczer. A Scalable, Collaborative Similarity Measure for Social Annotation Systems. Proc. 20th ACM Conf. on Hypertext and Hypermedia (HT).
[Sen 2006] S. Sen, S. K. Lam, A. M. Rashid, D. Cosley, D. Frankowski, J. Osterhouse, F. M. Harper, and J. Riedl. tagging, communities, vocabulary, evolution. In CSCW '06: Proceedings of the 2006 20th Anniversary Conference on Computer Supported Cooperative Work, pages 181-190, New York, NY, USA, 2006. ACM.
[VonAhn 2004] L. von Ahn and L. Dabbish. Labeling Images with a Computer Game. ACM Conference on Human Factors in Computing Systems, CHI 2004. pp 319-326.
[Wetzker 2008] R. Wetzker, C. Zimmermann, and C. Bauckhage. Analyzing Social Bookmarking Systems: A Delicious cookbook. In Mining Social Data (MSoDa) Workshop Proceedings, pp. 26-30. ECAI 2008, (July 2008).
[Zubiaga 2009a] A. Zubiaga, R. Mart穩nez, and V. Fresno. Getting the Most Out of Social Annotations for Web Page Classification. Proceedings of DocEng 2009, the 9th ACM Symposium on Document Engineering, pp. 74-83, Munich, Germany. 2009.
[Zubiaga 2009b] A. Zubiaga, A. P. Garc穩a-Plaza, V. Fresno, and R. Mart穩nez. Content-based Clustering for Tag Cloud Visualization. Proceedings of ASONAM 2009, International Conference on Advances in Social Networks Analysis and Mining. 2009.
[Zubiaga 2009c] A. Zubiaga. Enhancing Navigation on Wikipedia with Social Tags. Wikimania 2009. Buenos Aires, Argentina. 2009.
Last edited on December 7, 2009 (Christian Körner, Markus Strohmaier)
http://www.markusstrohmaier.info/datasets/
另外:http://www.tagora-project.eu/data/
A List of Social Tagging Datasets Made Available for Research的更多相关文章
- 近年Recsys论文
2015年~2017年SIGIR,SIGKDD,ICML三大会议的Recsys论文: [转载请注明出处:https://www.cnblogs.com/shenxiaolin/p/8321722.ht ...
- Install SharePoint 2013 on Windows Server 2012 without a domain
Any setup of Team Foundation Server is not complete until you have at least tried t work with ShareP ...
- Use of Deep Learning in Modern Recommendation System: A Summary of Recent Works(笔记)
注意:论文中,很多的地方出现baseline,可以理解为参照物的意思,但是在论文中,我们还是直接将它称之为基线,也 就是对照物,参照物. 这片论文中,作者没有去做实际的实验,但是却做了一件很有意义的事 ...
- 关于LDA的文章
转:http://www.zhizhihu.com/html/y2011/3228.html l Theory n Introduction u Unsupervised learning by ...
- Link-based Classification相关数据集
Link-based Classification相关数据集 Datasets Document Classification Datasets: CiteSeer: The CiteSeer dat ...
- Open Data for Deep Learning
Open Data for Deep Learning Here you’ll find an organized list of interesting, high-quality datasets ...
- SharePoint 2010 搜索结果没有显示部分文件
Why SharePoint 2010 search does not show some results? SharePoint 2010 search is better than ever ...
- 论文翻译——Character-level Convolutional Networks for Text Classification
论文地址 Abstract Open-text semantic parsers are designed to interpret any statement in natural language ...
- paper 118:计算机视觉、模式识别、机器学习常用牛人主页链接
牛人主页(主页有很多论文代码) Serge Belongie at UC San Diego Antonio Torralba at MIT Alexei Ffros at CMU Ce Liu at ...
随机推荐
- Spring整合jdbc
首先web.xml文件跟往常一样,加载spring容器和加载org.springframework.web.context.ContextLoaderListener读取applicationCont ...
- 【学习笔记】Y2-1-1 Oracle数据库基础
Oracle 简介关系型(二维表)数据库 用来存储海量数据在大数据量的并发检索的情况下,性能要高于其他同类数据库产品一般运行环境是Linux和UnixOracle版本中的I(Internet) G(G ...
- (引用)Python 生成随机数小结
转载:http://blog.csdn.net/shuaijiasanshao/article/details/51339438
- 启动Tomcat时报 Expected stackmap frame at this location.(JDK1.7编译)
从svn上下的项目,部署到tomcat 7.0.19 上, 并且配置的是jdk7. 启动时出现以下问题. Location: com/genlot/loms/service/SysPermissio ...
- Xamarin Mono 环境搭建(使用Visual Studio 2013 开发android 和 ios )
本文主要介绍Xamarin结合VS2013来开发Android应用程序,主要会介绍Mono和Xamarin的关系,以及整个搭建环境的过程. 一.Mono和Xamarin介绍 1.Mono简介 Mono ...
- css之absolute绝对定位(绝对定位特性)
学习了绝对定位以后,对此进行一个总结,啦啦啦啦~ 绝对定位特性 1.破坏性 破坏了原有的位置,从文档流里脱离出来 2.包裹性 如果下面这种情况,父级元素将不会有高度和宽度,失去原有的大小
- perl 引用
数组的数组 $a = [ [1, 2, 3], [4, 5, 6], [7, 8, 9] ] 哈希的哈希 my $student_properties_of = { 'zdd' => { 'ag ...
- ExtJs知识点概述
1.前言 ExtJS的前身是YUI(Yahoo User Interface).经过不断的发展与改进,ExtJS现在已经成功发布到了ExtJS 6版本,是一套目前最完整和最成熟的javascript基 ...
- 如何优化sql语句
1. 首先要搞明白什么叫执行计划? 执行计划是数据库根据SQL语句和相关表的统计信息作出的一个查询方案,这个方案是由查询优化器自动分析产生的,比如一条SQL语句如果用来从一个 10万条记录的表中查1条 ...
- NDB Cluster 存储引擎物理备份
NDB Cluster 存储引擎物理备份NDB Cluster 存储引擎也是一款事务性存储引擎,和Innodb 一样也有redo 日志.NDBCluter 存储引擎自己提供了备份功能,可以通过相关的命 ...