Hadoop, Python, and NoSQL lead the pack for big data jobs

 

Rise in cloud-based analytics could increase demand for employees with more diversified skill sets

The demand for job skills related to data processing -- NoSQL, Apache Hadoop, Python, and a smattering of other such skills -- has hit all-time highs, according to statistics collected by tech job site Dice.com. The biggest gains, though, are for all things NoSQL.

Dice claims the number of job postings for "NoSQL experts" -- those with experience in unstructured data systems like MongoDB -- has risen 54 percent since last year. Other, related skills, such as Apache Hadoop and Python, have also posted significant year-over-year gains (43 percent and 16 percent, respectively). Python has become one of the big go-to languages for data processing, thanks to its simplicity and its wide selection of data-processing libraries.

Indeed.com and its Job Trends graph provide more details about which big data skills were most in demand.  Indeed.com's stats show MongoDB is the most commonly mentioned of the NoSQL variants in job listings, with 4,979 entries as of this writing. Couchbase, Redis, and CouchDB are the three next most common NoSQL variants, with Riak, Hbase, Neo4j, and ElasticSearch all trailing far behind.

When comparing MongoDB, Python, and Hadoop, Python is by far the most in-demand of the three, with some 27,000 jobs. However,  Python developer jobs cover a great deal more than just big data, as expertise in Python can be applied to a broader range of jobs than MongoDB and Hadoop.

That said, the more analytics-related skills appear to command slightly higher pay. Indeed.com estimates that the majority of MongoDB jobs start somewhere north of $60,000, while with Python and Hadoop the majority of the pay is in the $50,000 and up range.

Other, more generic job requests related to big data are also up, with the term "big data" showing a major surge in appearances -- up 46 percent year-over-year. Generic requests for expertise in SaaS and cloud are also up, by 20 percent and 27 percent, respectively. Dice claims one side effect of a rise in cloud-based analytics is a growing demand for employees with multiple skills in this category -- for example, both Hadoop and cloud storage.

Michael Rappa, creator of the first academic program devoted to data analytics, made a similar observation when InfoWorld spoke to him about big data jobs in 2012. Rappa's take at the time was that big data wasn't "a new specialty or suite of tools we have to train people into," but rather a "new organizational reality that everyone will need to adjust to occupationally," where multiple occupations across an organization would require new awareness of how to work with big data.

This story, "Hadoop, Python, and NoSQL lead the pack for big data jobs" was originally published by InfoWorld .

Hadoop, Python, and NoSQL lead the pack for big data jobs的更多相关文章

  1. python 内存NoSQL数据库

    python 内存NoSQL数据库 来自于网络,经过修改,秉承Open Source精神,回馈网络! #!/usr/bin/python #-*- coding: UTF-8 -*- # # memd ...

  2. Python爬虫学习:四、headers和data的获取

    之前在学习爬虫时,偶尔会遇到一些问题是有些网站需要登录后才能爬取内容,有的网站会识别是否是由浏览器发出的请求. 一.headers的获取 就以博客园的首页为例:http://www.cnblogs.c ...

  3. hadoop datanode启动失败(All directories in dfs.data.dir are invalid)

    由于hadoop节点的磁盘满了,导致节点死掉,今天对其进行扩容.首先,将原节点的数据拷贝到目标节点下,从而避免数据的丢失,但是在执行hadoop_daemon.sh start datanode后没有 ...

  4. Python学习——struct模块的pack、unpack示例

    he struct module includes functions for converting between strings of bytes and native Python data t ...

  5. Python使用struct处理二进制(pack和unpack用法)

    转载自:http://www.cnblogs.com/gala/archive/2011/09/22/2184801.html 这篇文章写的很好,所以无耻的转了.. 有的时候需要用python处理二进 ...

  6. Python学习笔记 - day12 - Python操作NoSQL

    NoSQL(非关系型数据库) NoSQL,指的是非关系型的数据库.NoSQL有时也称作Not Only SQL的缩写,是对不同于传统的关系型数据库的数据库管理系统的统称.用于超大规模数据的存储.(例如 ...

  7. Python操作nosql数据库之redis

    一.NoSQL的操作 NoSQL,泛指非关系型的数据库.随着互联网web2.0网站的兴起,传统的关系数据库在应付web2.0网站,特别是超大规模和高并发的SNS类型的web2.0纯动态网站已经显得力不 ...

  8. Python:struct模块的pack、unpack

    mport struct pack.unpack.pack_into.unpack_from 1 # ref: http://blog.csdn<a href="http://lib. ...

  9. [Python] How to unpack and pack collection in Python?

    It  is a pity that i can not add the video here. As a result, i offer the link as below: How to unpa ...

随机推荐

  1. Centos系统python2.x升级python3.x

    1. Centos7初始安装的python版本为2.7.5 2.下载最新的python3.5.2,解压到/usr/local/src 3. 配置.编译.测试.安装(根据软件的README说明) ./c ...

  2. 笔记本开了WIFI之后只能上QQ,上不了网页的解决方法

    前几天拉了宽带之后,开了WIFI,发现WIFI能上网,但是电脑就上不了网页. 把本地连接的DNS指定一下,(电信)指定为202.102.192.68

  3. Android开发环境搭建(图文教程)

    昨天又搭建了一次Android的开发环境,尝试了好几种方式,也遇到了一些问题,在此分享一下. 注意:官网公布的最新版本号的SDK和ADT(23.0.0),对于和Eclipse集成的开发环境是有BUG存 ...

  4. 经典SQL语句大全之数据开发

    数据开发 1.按姓氏笔画排序:Select * From TableName Order By CustomerName Collate Chinese_PRC_Stroke_ci_as //从少到多 ...

  5. Android获取设备隐私 忽略6.0权限管理

    1.前言 (1).由于MIUI等部分国产定制系统也有权限管理,没有相关api,故无法判断用户是否允许获取联系人等隐私.在Android 6.0之后,新增权限管理可以通过官方api判断用户的运行状态: ...

  6. C#生成XML的三种途径

    C#生成XML的三种途径 为了全面,这里都将XML保存到文件中,有三种生成XML的方式: 1.我认为是最原始,最基本的一种:利用XmlDocument向一个XML文件里写节点,然后再利用XmlDocu ...

  7. hibernate入门之person表

    下面的hibernate入门person表指的是:根据mysql数据库中的test表和其中的元素-->建立映射表==>进而创建持久化类的顺序来操作了,下面为步骤 1.配置MySQL驱动程序 ...

  8. hadoop错误org.apache.hadoop.util.DiskChecker$DiskErrorException Could not find any valid local directory for

    错误: org.apache.hadoop.util.DiskChecker$DiskErrorException: Could not find any valid local directory ...

  9. PureMVC(JS版)源码解析(五):SimpleCommand类

          之前我们对PureMVC中涉及到观察者模式的三个基本类(Notification/Observer/Notifier)进行了分析,接下来将对PureMVC源码中的其他类进行分析,首先我们讲 ...

  10. 【Android】数据存储-SharedPreferences存储

    简单介绍:SharedPreferences是使用键值对的方式来存储数据的,也就是说,当保存一条数据的时候,给这条数据提供一个键,如果需要读取数据,只需要通过这个键就可以提取到对应的数据. 一:存储数 ...