deep api integration makes getting value from your big data easy

深度api集成使你大数据訪问更加easy

Elasticsearch is quickly becoming the de facto search and analytics solution that organizations are using to provide real-time insights into their Hadoop data. Elasticsearch for Hadoop—affectionately known as es-hadoop—is a two-way connector that lets you
index data into Elasticsearch and query it in real time. With a native API implementation, fast indexing, and a rich query language, es-hadoop is optimized for performance and efficiency, making it an elegant solution for your big data projects. With support
for a wide range of libraries, Elasticsearch helps you to make better use of your data across the entire Hadoop ecosystem.

data can seamlessly move between Elasticsearch and Hadoop

  • Index directly into Elasticsearch from Hadoop 直接对hadoop上的数据建立索引

    The native integration allows you to efficiently push data into Elasticsearch using the existing Hadoop tools you know and love ,原生态的集成同意你通过你喜欢的hadoop工具将数据推送到ElasticSearch中
  • Query Elasticsearch from Hadoop从hadoop查询Elasticsearch

    The rich query API of Elasticsearch allows you to ask complex questions and use the real-time results in Hadoop.Elasticsearch丰富的查询api支持你迅速取得对hadoop的复杂查询结果。

  • Use HDFS as a long-term archive for Elasticsearch使用HDFS对Elasticsearch索引长期存档

    es-hadoop allows Elasticsearch to push backup data to HDFS using the built-in snapshot and restore capability.es-hadoop插件同意es推送备份数据到HDFS通过使用快照的方式和恢复这些数据到es

how people are using Elasticsearch and Hadoop

      • Klout Queries Over 400M Users’ Data To Build Marketing Campaigns

        Using HDFS to store user data and index it into Elasticsearch, Klout builds real-time targeted marketing campaigns that are generated in seconds rather than minutes.
      • MutualMind Replaces 15-Minute Batch Process with Real-Time Analysis

        With customers like AT&T, Kraft, Nestle, and Starbucks interested in keeping a pulse on their brands, MutualMind uses Elasticsearch to get quick insight and Hadoop for batch-based statistical analysis.
      • International Financial Services Firm Quickly Analyzes Access Logs

        Instead of waiting hours to run MapReduce jobs to analyze access logs, a global financial institution gets value from its data with Elasticsearch in minutes—and even increased the quantity of log data it processed from one hour to a full week.

works with any flavor of Hadoop distribution

We are official partners with a number of organizations within the Hadoop ecosystem, including Cloudera, MapR, Hortonworks, Databricks, and Concurrent. Whether you’re using vanilla Hadoop, or other distributions like CDH,
HDP, and MapR, Elasticsearch has got you covered. As an added bonus, we are also certified on Cloudera Enterprise 5 and are Certified Technology Partners with Hortonworks.

take a look under the hood

visualize your big data

Elasticsearch works with the visualization tool Kibana to help you explore your big data with in real time. With beautifully designed graphs, charts, and maps, Kibana transforms your data into real-time, customizable dashboards that let you visualize the value
of your data.

leave the real-time analytics to us

Gone are the days of waiting hours or more for a batch process to run in order to get insight into your Hadoop data. Elasticsearch provides responses in milliseconds, which can significantly reduce a Hadoop job’s execution time and the cost associated with
it, especially on “rented resources” such as Amazon EMR or EC2.

ask more sophisticated questions

Elasticsearch provides a robust query DSL that lets users to ask sophisticated questions that result in more complete answers, faster.

prepared for when things go awry

Elasticsearch is designed to tolerate hardware failures. Es-hadoop continues communicating with the cluster, even when failures occur.

added efficiency with our native integration

Elasticsearch is natively integrated with Hadoop so there is no gap for the user to bridge. We provide a dedicated Input and Output format for vanilla MapReduce, taps for reading and writing data in Cascading, storages for Pig and Hive, a native Spark Resilient
Distributed Dataset (RDD) for both Java and Scala, and support for Storm’s bolt and spout abstractions so you can access Elasticsearch just as if the data were in HDFS.

enhance your workflow to get the best of both worlds

Get maximum flexibility with the es-hadoop connector by leveraging everything that Hadoop has to offer (via MapReduce, Hive, Pig, Cascading, Spark, and Storm) and combining it with a real-time search and analytics capability of Elasticsearch.

need to grow? just add more nodes.

Elasticsearch can be scaled in the same way as your Hadoop cluster – add more Elasticsearch nodes and the data will be automatically re-balanced.

原文网址:http://www.elasticsearch.com/products/hadoop/

explore your hadoop data and get real-time results的更多相关文章

  1. hadoop data 相关开源项目(近期学习计划)

    计划学习几个hadoop相关的开源项目: 1.spring hadoop 2.spring batch 3.spring redis 4.spring mongo 相关项目样例:https://git ...

  2. 【Repost】A Practical Intro to Data Science

    Are you a interested in taking a course with us? Learn about our programs or contact us at hello@zip ...

  3. Choosing Between ElasticSearch, MongoDB & Hadoop

    An interesting trend has been developing in the IT landscape over the past few years.  Many new tech ...

  4. Awesome Big Data List

    https://github.com/onurakpolat/awesome-bigdata A curated list of awesome big data frameworks, resour ...

  5. zookeeper集群的搭建以及hadoop ha的相关配置

    1.环境 centos7 hadoop2.6.5 zookeeper3.4.9 jdk1.8 master作为active主机,data1作为standby备用机,三台机器均作为数据节点,yarn资源 ...

  6. Hadoop伪分布式集群环境搭建

    本教程讲述在单机环境下搭建Hadoop伪分布式集群环境,帮助初学者方便学习Hadoop相关知识. 首先安装Hadoop之前需要准备安装环境. 安装Centos6.5(64位).(操作系统再次不做过多描 ...

  7. 初识Hadoop

    第一部分:              初识Hadoop 一.             谁说大象不能跳舞 业务数据越来越多,用关系型数据库来存储和处理数据越来越感觉吃力,一个查询或者一个导出,要执行很长 ...

  8. hadoop分布式存储(2)-hadoop的安装(毕业设计)

    总共分三步:1.准备linux环境 租用"云主机",阿里云,unitedStack等,云主机不受本机性能影响(或者直接安转linux操作系统或者虚拟机也行): PuTTy Conf ...

  9. Hadoop入门之安装配置(hadoop-0.20.2)

    Hadoop,简单理解为HDFS(分布式存储)+Mapreduce(分布式处理),专为离线和大规模数据分析而设计. Hadoop可以把很多linux的廉价PC组成分布式结点,然后编程人员也不需要知道分 ...

随机推荐

  1. HTML5, CSS3, ES5新的web标准和浏览器支持一览 转

    本文整理了一些最重要(或者说人气比较高罢)的新标准,虽然它们多数还只是w3c的草案,离Recommendation级别还早,却已经成为新一轮浏览器大战中备受追捧的明星,开发者社区里也涌现出大量相关的d ...

  2. HDU OJ Digital Roots 题目1013

     /*Digital Roots Time Limit: 2000/1000 MS (Java/Others)    Memory Limit: 65536/32768 K (Java/Other ...

  3. DateNavigator

    <Border BorderThickness="1,1,1,1" BorderBrush="Black" Grid.Column="1&quo ...

  4. (转)[unity3d]easytouch的使用

    对于移动平台上的RPG类的游戏,我们常用虚拟摇杆来控制人物角色的行走和一些行为,相信我们对它并不陌生,之前尝试了EasyTouch2.5,发现并没有最新版的3.1好用,2.5版本的对于自适应没有做的很 ...

  5. Java Web -- Servlet(5) 开发Servlet的三种方法、配置Servlet具体解释、Servlet的生命周期(2)

    三.Servlet的生命周期 一个Java servlet具有一个生命周期,这个生命周期定义了一个Servlet怎样被加载并被初始化,怎样接收请求并作出对请求的响应,怎样被从服务中清除.Servlet ...

  6. Nginx启用ssl以及免费证书申请

    主要是这个东西,折腾了我两天,所以记录下来. 最开始是在meteor下面调用一个webservice,但是发现meteor项目的发布环境时https,所以请求的webservice也必须时webser ...

  7. 第一节,学习cocos2d-x的前期准备

    1,我用的mac系统,在mac系统上装上cocos2d-x的模板 2,用doxygen工具装上API,这个非常重要,没有API的开发不叫开发,因此我们要习惯看API 3,知道怎么查看cocos2d-x ...

  8. 微信小程序 - 上拉加载下拉刷新

    点击下载示例:小程序 - 上拉刷新下拉加载

  9. php之快速入门学习-6(字符串变量)

    PHP 字符串变量 字符串变量用于存储并处理文本. PHP 中的字符串变量 字符串变量用于包含有字符的值. 在创建字符串之后,我们就可以对它进行操作了.您可以直接在函数中使用字符串,或者把它存储在变量 ...

  10. 用Navicat Premium 操作MySQL数据库

    1. 用Navicat来查看MySQL数据库        打开Navicat Premium–>[连接]–>[MySQL]–>[连接名:新建数据库的名字,此处为“本地”]:[主机: ...