Hadoop, Python, and NoSQL lead the pack for big data jobs

 

Rise in cloud-based analytics could increase demand for employees with more diversified skill sets

The demand for job skills related to data processing -- NoSQL, Apache Hadoop, Python, and a smattering of other such skills -- has hit all-time highs, according to statistics collected by tech job site Dice.com. The biggest gains, though, are for all things NoSQL.

Dice claims the number of job postings for "NoSQL experts" -- those with experience in unstructured data systems like MongoDB -- has risen 54 percent since last year. Other, related skills, such as Apache Hadoop and Python, have also posted significant year-over-year gains (43 percent and 16 percent, respectively). Python has become one of the big go-to languages for data processing, thanks to its simplicity and its wide selection of data-processing libraries.

Indeed.com and its Job Trends graph provide more details about which big data skills were most in demand.  Indeed.com's stats show MongoDB is the most commonly mentioned of the NoSQL variants in job listings, with 4,979 entries as of this writing. Couchbase, Redis, and CouchDB are the three next most common NoSQL variants, with Riak, Hbase, Neo4j, and ElasticSearch all trailing far behind.

When comparing MongoDB, Python, and Hadoop, Python is by far the most in-demand of the three, with some 27,000 jobs. However,  Python developer jobs cover a great deal more than just big data, as expertise in Python can be applied to a broader range of jobs than MongoDB and Hadoop.

That said, the more analytics-related skills appear to command slightly higher pay. Indeed.com estimates that the majority of MongoDB jobs start somewhere north of $60,000, while with Python and Hadoop the majority of the pay is in the $50,000 and up range.

Other, more generic job requests related to big data are also up, with the term "big data" showing a major surge in appearances -- up 46 percent year-over-year. Generic requests for expertise in SaaS and cloud are also up, by 20 percent and 27 percent, respectively. Dice claims one side effect of a rise in cloud-based analytics is a growing demand for employees with multiple skills in this category -- for example, both Hadoop and cloud storage.

Michael Rappa, creator of the first academic program devoted to data analytics, made a similar observation when InfoWorld spoke to him about big data jobs in 2012. Rappa's take at the time was that big data wasn't "a new specialty or suite of tools we have to train people into," but rather a "new organizational reality that everyone will need to adjust to occupationally," where multiple occupations across an organization would require new awareness of how to work with big data.

This story, "Hadoop, Python, and NoSQL lead the pack for big data jobs" was originally published by InfoWorld .

Hadoop, Python, and NoSQL lead the pack for big data jobs的更多相关文章

  1. python 内存NoSQL数据库

    python 内存NoSQL数据库 来自于网络,经过修改,秉承Open Source精神,回馈网络! #!/usr/bin/python #-*- coding: UTF-8 -*- # # memd ...

  2. Python爬虫学习:四、headers和data的获取

    之前在学习爬虫时,偶尔会遇到一些问题是有些网站需要登录后才能爬取内容,有的网站会识别是否是由浏览器发出的请求. 一.headers的获取 就以博客园的首页为例:http://www.cnblogs.c ...

  3. hadoop datanode启动失败(All directories in dfs.data.dir are invalid)

    由于hadoop节点的磁盘满了,导致节点死掉,今天对其进行扩容.首先,将原节点的数据拷贝到目标节点下,从而避免数据的丢失,但是在执行hadoop_daemon.sh start datanode后没有 ...

  4. Python学习——struct模块的pack、unpack示例

    he struct module includes functions for converting between strings of bytes and native Python data t ...

  5. Python使用struct处理二进制(pack和unpack用法)

    转载自:http://www.cnblogs.com/gala/archive/2011/09/22/2184801.html 这篇文章写的很好,所以无耻的转了.. 有的时候需要用python处理二进 ...

  6. Python学习笔记 - day12 - Python操作NoSQL

    NoSQL(非关系型数据库) NoSQL,指的是非关系型的数据库.NoSQL有时也称作Not Only SQL的缩写,是对不同于传统的关系型数据库的数据库管理系统的统称.用于超大规模数据的存储.(例如 ...

  7. Python操作nosql数据库之redis

    一.NoSQL的操作 NoSQL,泛指非关系型的数据库.随着互联网web2.0网站的兴起,传统的关系数据库在应付web2.0网站,特别是超大规模和高并发的SNS类型的web2.0纯动态网站已经显得力不 ...

  8. Python:struct模块的pack、unpack

    mport struct pack.unpack.pack_into.unpack_from 1 # ref: http://blog.csdn<a href="http://lib. ...

  9. [Python] How to unpack and pack collection in Python?

    It  is a pity that i can not add the video here. As a result, i offer the link as below: How to unpa ...

随机推荐

  1. redhat开启linux server

    1.redhat linux5 enterprize 默认情况下是没有安装telnet server,可以使用rpm -q telnet查询,下图是安装后的查询结果

  2. java版本 ueditor 在线编辑器 配置

    上传文件大小的配置 1. ueditor\dialogs\video\video.js   搜索  file_size_limit  修改这个数值 (这是前台 flash的限制) 2. ueditor ...

  3. c pointer and array

    Pointer:  A pointer is a variable that contains the address of a variable. if c is a char and p is a ...

  4. POJ 1006 Biorhythms(中国剩余定理)

    题目地址:POJ 1006 学习了下中国剩余定理.參考的该博客.博客戳这里. 中国剩余定理的求解方法: 假如说x%c1=m1,x%c2=m2,x%c3=m3.那么能够设三个数R1,R2,R3.R1为c ...

  5. windows 下一个 easy_install 设备

    下载安装python安装工具 1,方法是下载ez_setup.py后 2,在cmd下运行 python ez_setup.py.就可以自己主动安装setuptools 3,环境变量设置将 C:\Pro ...

  6. PHPExcel的读取excel的操作

    首先导入类库: require_once 'PHPExcel.php'; require_once 'PHPExcel\IOFactory.php'; require_once 'PHPExcel\R ...

  7. HttpServlet请求重定向

    方法一: public void doGet(HttpServletRequest request, HttpServletResponse response) throws ServletExcep ...

  8. IOS Remote Notification

    1. 本地证书合成 rm *.pem echo "export cert..." openssl pkcs12 -clcerts -nokeys -out push_cert.pe ...

  9. [Eclipse]The type XXX cannot be resolved. It is indirectly referenced from required .class files

    在Eclipse中遇到The type XXX cannot be resolved. It is indirectly referenced from required .class files错误 ...

  10. 编译安装 php 5.4.11

    第一步 先下载 tzr.gz 的php源码包然后 tar zxvf  php-5.4.11.tar.gz然后 cd php-5.4.11 然后复制如下编译代码 ./configure \--prefi ...