hadoop python and Twitter】的更多相关文章

http://www.wubiaoblog.com/archives/1159 http://blog.csdn.net/anbo724 http://f.dataguru.cn/forum.php?mod=viewthread&tid=36405&fromuid=4771 http://tianweidut.github.io/2012/12/dumbo/ http://wenku.it168.com/d_000096479.shtml http://blog.csdn.net/v_ju…
Hadoop, Python, and NoSQL lead the pack for big data jobs   Rise in cloud-based analytics could increase demand for employees with more diversified skill sets The demand for job skills related to data processing -- NoSQL, Apache Hadoop, Python, and a…
CODE: #!/usr/bin/python # -*- coding: utf-8 -*- ''' Created on 2014-7-10 @author: guaguastd @name: retweet_frequency_map.py ''' if __name__ == '__main__': # import visualize from visualize import visualize_frequency_map # pip install prettytable # fr…
CODE: #!/usr/bin/python # -*- coding: utf-8 -*- ''' Created on 2014-8-5 @author: guaguastd @name: analyze_favorite_tweet.py ''' if __name__ == '__main__': # import json #import json # import search from search import search_for_tweet # import get_fri…
CODE: #!/usr/bin/python # -*- coding: utf-8 -*- ''' Created on 2014-7-18 @author: guaguastd @name: collect_time_series.py ''' if __name__ == '__main__': # import json import json # import partial from functools import partial # import trend from tren…
CODE: #!/usr/bin/python # -*- coding: utf-8 -*- ''' Created on 2014-7-30 @author: guaguastd @name: friends_followers_analyzer.py ''' if __name__ == '__main__': # import json #import json # import search from search import search_for_tweet # import ge…
CODE: #!/usr/bin/python # -*- coding: utf-8 -*- ''' Created on 2014-7-24 @author: guaguastd @name: extract_retweet_attributions.py ''' if __name__ == '__main__': # import login, see http://blog.csdn.net/guaguastd/article/details/31706155 from login i…
CODE: #!/usr/bin/python # -*- coding: utf-8 -*- ''' Created on 2014-7-9 @author: guaguastd @name: entities_frequency_map.py ''' if __name__ == '__main__': # import Counter from collections import Counter # import visualize from visualize import visua…
CODE: #!/usr/bin/python # -*- coding: utf-8 -*- ''' Created on 2014-7-29 @author: guaguastd @name: friends_followers_fetch.py ''' if __name__ == '__main__': # import json import json # import search from search import search_for_tweet # import get_fr…
目录 1.Collecting data 1.1 Register Your App 1.2 Accessing the Data 1.3 Streaming 2.Text Pre-processing 2.1 The Anatomy of a Tweet 2.2 How to Tokenise a Tweet Text 3.Term Frequencies 3.1 Counting Terms 3.2 Removing stop-words 3.3 More term filters 4.Ru…