A List of Social Tagging Datasets Made Available for Research
This list is not exhaustive - help expand it!
| Social Tagging Systems | Research Group | Source | Year Obtained | Availability | Contact | References |
| CiteULike | Oversity Ltd. | Primary | Daily Snapshots | Via Download after Email (link) | Richard Cameron | |
| Bibsonomy | KDE | Primary | Periodical Snapshots every half year | Available after signed license agreement | Andreas Hotho | [Hotho 2006] |
| MovieLens | GroupLens | Primary | 2009 | Via Download (link) | GroupLens Info | [Sen 2006] |
| GiveALink | NaN Group | Primary | Current information via API | Via API | Filippo Menczer | [Markines 2009] |
| ESP Game | Luis von Ahn | Primary | 2006 | Via Download (link) | Luis von Ahn | [VonAhn 2004] |
| Delicious | DAI Labor | Secondary | 2007/2008 | Via Email Request | Alan Said | [Wetzker 2006] |
| Delicious, Stumble Upon & Wikipedia | NLP and Information Retrieval Group | Secondary | 2008/2009 | Via Download (link) | Arkaitz Zubiaga | [Zubiaga 2009a] [Zubiaga 2009b] [Zubiaga 2009c] |
| Delicious, Flickr, Last.fm, zexe.net | TAGora | Secondary | 2006, 2007, 2008 | Via Download (link) | Vittorio Loreto | |
| Delicious, Flickr, Diigo, Bibsonomy and others | Agents and Social Computation | Secondary | 2009 | Via Email Request | Markus Strohmaier | [Grahsl 2010] |
In case you are aware of other available datasets, please let me know by leaving a comment on a corresponding blog post.
Page updated and maintained by Markus Strohmaier.
References
[Grahsl 2010] H.P. Grahsl, C. Körner, M. Strohmaier. A Collection of Tagging Datasets Containing Complete Personomies From Heterogeneous Sources. Technical Report, Knowledge Management Institute, Graz University of Technology. To be published in 2010
[Hotho 2006] A. Hotho, R. Jäschke, C. Schmitz, and G. Stumme. BibSonomy: A Social Bookmark and Publication Sharing System. In Aldo de Moor, Simon Polovina, and Harry Delugach, editors, Proceedings of the Conceptual Structures Tool Interoperability Workshop at the 14th International Conference on Conceptual Structures, Aalborg, Denmark
[Markines 2009] B. Markines and F. Menczer. A Scalable, Collaborative Similarity Measure for Social Annotation Systems. Proc. 20th ACM Conf. on Hypertext and Hypermedia (HT).
[Sen 2006] S. Sen, S. K. Lam, A. M. Rashid, D. Cosley, D. Frankowski, J. Osterhouse, F. M. Harper, and J. Riedl. tagging, communities, vocabulary, evolution. In CSCW '06: Proceedings of the 2006 20th Anniversary Conference on Computer Supported Cooperative Work, pages 181-190, New York, NY, USA, 2006. ACM.
[VonAhn 2004] L. von Ahn and L. Dabbish. Labeling Images with a Computer Game. ACM Conference on Human Factors in Computing Systems, CHI 2004. pp 319-326.
[Wetzker 2008] R. Wetzker, C. Zimmermann, and C. Bauckhage. Analyzing Social Bookmarking Systems: A Delicious cookbook. In Mining Social Data (MSoDa) Workshop Proceedings, pp. 26-30. ECAI 2008, (July 2008).
[Zubiaga 2009a] A. Zubiaga, R. Mart穩nez, and V. Fresno. Getting the Most Out of Social Annotations for Web Page Classification. Proceedings of DocEng 2009, the 9th ACM Symposium on Document Engineering, pp. 74-83, Munich, Germany. 2009.
[Zubiaga 2009b] A. Zubiaga, A. P. Garc穩a-Plaza, V. Fresno, and R. Mart穩nez. Content-based Clustering for Tag Cloud Visualization. Proceedings of ASONAM 2009, International Conference on Advances in Social Networks Analysis and Mining. 2009.
[Zubiaga 2009c] A. Zubiaga. Enhancing Navigation on Wikipedia with Social Tags. Wikimania 2009. Buenos Aires, Argentina. 2009.
Last edited on December 7, 2009 (Christian Körner, Markus Strohmaier)
http://www.markusstrohmaier.info/datasets/
另外:http://www.tagora-project.eu/data/
A List of Social Tagging Datasets Made Available for Research的更多相关文章
- 近年Recsys论文
2015年~2017年SIGIR,SIGKDD,ICML三大会议的Recsys论文: [转载请注明出处:https://www.cnblogs.com/shenxiaolin/p/8321722.ht ...
- Install SharePoint 2013 on Windows Server 2012 without a domain
Any setup of Team Foundation Server is not complete until you have at least tried t work with ShareP ...
- Use of Deep Learning in Modern Recommendation System: A Summary of Recent Works(笔记)
注意:论文中,很多的地方出现baseline,可以理解为参照物的意思,但是在论文中,我们还是直接将它称之为基线,也 就是对照物,参照物. 这片论文中,作者没有去做实际的实验,但是却做了一件很有意义的事 ...
- 关于LDA的文章
转:http://www.zhizhihu.com/html/y2011/3228.html l Theory n Introduction u Unsupervised learning by ...
- Link-based Classification相关数据集
Link-based Classification相关数据集 Datasets Document Classification Datasets: CiteSeer: The CiteSeer dat ...
- Open Data for Deep Learning
Open Data for Deep Learning Here you’ll find an organized list of interesting, high-quality datasets ...
- SharePoint 2010 搜索结果没有显示部分文件
Why SharePoint 2010 search does not show some results? SharePoint 2010 search is better than ever ...
- 论文翻译——Character-level Convolutional Networks for Text Classification
论文地址 Abstract Open-text semantic parsers are designed to interpret any statement in natural language ...
- paper 118:计算机视觉、模式识别、机器学习常用牛人主页链接
牛人主页(主页有很多论文代码) Serge Belongie at UC San Diego Antonio Torralba at MIT Alexei Ffros at CMU Ce Liu at ...
随机推荐
- nullcon HackIM 2016 -- Crypto Question 4
He is influential, he is powerful. He is your next contact you can get you out of this situation. Yo ...
- 使用oh-my-zsh后导致的卡顿问题
现象是每次cd和ll时都会被卡住很长时间根本受不了,最后在官方github查明原因是使用的主题会自动获取git信息,可以使用以下命令禁止zsh自动获取git信息,解决卡顿问题 git config - ...
- window.location 结构
属性 含义 protocol 协议 hostname 服务器的名字 port 端口 pathname URL中主机名后的部分 search "?"后的部分,又称 ...
- wcf,jquery,post,跨域
参照了网上的很多资料,vs2012 项目是wcf服务. .demo地址http://files.cnblogs.com/files/dswyzx/WcfServiceDemoa.rar
- Java 如何跳出多重嵌套循环体?
1. 在最外层循环语句前定义一个标号,循环体内任意位置都可以使用带有标号的break语句跳出外层循环,结束整个循环. System.out.println("MainClass1 Start ...
- C++的一些小的知识点
1.初始化: 对于内置类型 ]; // 10个未初始化int ](); // 10个值初始化为0的int 对于自定义类型: 只要一调用new,那么编译器不仅仅给它分配内存,还调用它的默认构造函数初始化 ...
- IOS TableView 去除点击后产生的灰色背景
- (UITableViewCell *)tableView:(UITableView *)tableView cellForRowAtIndexPath:(NSIndexPath *)indexPa ...
- TOMCAT的安装部署配置(配图解)
TOMCAT的安装部署配置 前提已经成功搭建配置JDK 下载好压缩包后,直接解压至某一目录下,目录中不能包含中文 双击安装文件,出现如下界面 点击[NEXT],出现如下界面 点击[I AGREE],出 ...
- Android学习---如何创建数据库,SQLite(onCreate,onUpgrade方法)和SQLiteStudio的使用
一.android中使用什么数据库? SQLite是遵守ACID的关系数据库管理系统,它包含在一个相对小的C程式庫中.它是D.RichardHipp建立的公有领域项目.SQLite 是一个软件库,实现 ...
- setNeedsDisplay和setNeedsLayout
1,UIView的setNeedsDisplay和setNeedsLayout方法 首先两个方法都是异步执行的.而setNeedsDisplay会调用自动调用drawRect方法,这样可以拿到 UI ...