Spark编译安装和运行
一、环境说明
Mac OSX Java 1.7.0_71 Spark
二、编译安装
tar -zxvf spark-.tgz cd spark- ./sbt/sbt assembly
ps:如果之前执行过编译,需要执行 ./sbt/sbt clean
清理后才能重新编译。
三、运行
adeMacBook-Pro:spark- apple$ ./bin/spark-shell log4j:WARN No appenders could be found for logger (org.apache.hadoop.metrics2.lib.MutableMetricsFactory). log4j:WARN Please initialize the log4j system properly. log4j:WARN See http://logging.apache.org/log4j/1.2/faq.html#noconfig for more info. Using Spark's default log4j profile: org/apache/spark/log4j-defaults.properties // :: INFO SecurityManager: Changing view acls to: apple // :: INFO SecurityManager: Changing modify acls to: apple // :: INFO SecurityManager: SecurityManager: authentication disabled; ui acls disabled; users with view permissions: Set(apple); users with modify permissions: Set(apple) // :: INFO HttpServer: Starting HTTP Server // :: INFO Server: jetty-.y.z-SNAPSHOT // :: INFO AbstractConnector: Started SocketConnector@ // :: INFO Utils: Successfully started service . Welcome to ____ __ / __/__ ___ _____/ /__ _\ \/ _ \/ _ `/ __/ '_/ /___/ .__/\_,_/_/ /_/\_\ version /_/ Using Scala version (Java HotSpot(TM) -Bit Server VM, Java 1.7.0_71) Type in expressions to have them evaluated. Type :help for more information. // :: INFO SparkContext: Running Spark version // :: INFO SecurityManager: Changing view acls to: apple // :: INFO SecurityManager: Changing modify acls to: apple // :: INFO SecurityManager: SecurityManager: authentication disabled; ui acls disabled; users with view permissions: Set(apple); users with modify permissions: Set(apple) // :: INFO Slf4jLogger: Slf4jLogger started // :: INFO Remoting: Starting remoting // :: INFO Remoting: Remoting started; listening on addresses :[akka.tcp://sparkDriver@192.168.1.106:61567] // :: INFO Utils: Successfully started service . // :: INFO SparkEnv: Registering MapOutputTracker // :: INFO SparkEnv: Registering BlockManagerMaster // :: INFO DiskBlockManager: Created local directory at /-4d54-89f3-8d97bf15205f/blockmgr-b8410cda-aa29---d6155512cd53 // :: INFO MemoryStore: MemoryStore started with capacity 265.4 MB // :: INFO HttpFileServer: HTTP File server directory -4d54-89f3-8d97bf15205f/httpd-a1838f08-2ccd-42d2--6e91cb6fdfad // :: INFO HttpServer: Starting HTTP Server // :: INFO Server: jetty-.y.z-SNAPSHOT // :: INFO AbstractConnector: Started SocketConnector@ // :: INFO Utils: Successfully started service . // :: INFO SparkEnv: Registering OutputCommitCoordinator // :: INFO Server: jetty-.y.z-SNAPSHOT // :: INFO AbstractConnector: Started SelectChannelConnector@ // :: INFO Utils: Successfully started service . // :: INFO SparkUI: Started SparkUI at http://192.168.1.106:4040 // :: INFO Executor: Starting executor ID driver on host localhost // :: INFO Executor: Using REPL class URI: http://192.168.1.106:61566 // :: INFO Utils: Successfully started service . // :: INFO NettyBlockTransferService: Server created on // :: INFO BlockManagerMaster: Trying to register BlockManager // :: INFO BlockManagerMasterEndpoint: Registering block manager localhost: with ) // :: INFO BlockManagerMaster: Registered BlockManager // :: INFO SparkILoop: Created spark context.. Spark context available as sc. // :: INFO SparkILoop: Created sql context.. SQL context available as sqlContext. scala>
参考:
https://spark.apache.org/docs/latest/
三、使用spark交互模式
. 运行./spark-shell.sh . scala> val data = Array(, , , , ) //产生data data: Array[Int] = Array(, , , , ) . scala> val distData = sc.parallelize(data) //将data处理成RDD distData: spark.RDD[Int] = spark.ParallelCollection@7a0ec850 (显示出的类型为RDD) . scala> distData.reduce(_+_) //在RDD上进行运算,对data里面元素进行加和 // :: INFO spark.SparkContext: Starting job... . 最后运行得到 // :: INFO spark.SparkContext: Job finished in 0.076729174 s res2: Int =
Spark编译安装和运行的更多相关文章
- Heka 编译安装后 运行报错 panic: runtime error: cgo argument has Go pointer to Go pointer
Heka 编译安装后 运行报错 panic: runtime error: cgo argument has Go pointer to Go pointer 解决办法: 1. Start heka ...
- Spark入门实战系列--2.Spark编译与部署(下)--Spark编译安装
[注]该系列文章以及使用到安装包/测试数据 可以在<倾情大奉送--Spark入门实战系列>获取 .编译Spark .时间不一样,SBT是白天编译,Maven是深夜进行的,获取依赖包速度不同 ...
- spark编译安装 spark 2.1.0 hadoop2.6.0-cdh5.7.0
1.准备: centos 6.5 jdk 1.7 Java SE安装包下载地址:http://www.oracle.com/technetwork/java/javase/downloads/java ...
- Ubuntu16.04下编译安装及运行单目ORBSLAM2
官网有源代码和配置教程,地址是 https://github.com/raulmur/ORB_SLAM2 1 安装必要工具 首先,有两个工具是需要提前安装的.即cmake和Git. sudo apt- ...
- spark下载安装,运行examples(spark一)
1.官方网址 http://spark.apache.org/ image.png 2.点击下载 下载最新版本目前是(2.4.3)此spark预设为hadoop2.7或者更高版本,我前面安装的是had ...
- 基于cdh5.10.x hadoop版本的apache源码编译安装spark
参考文档:http://spark.apache.org/docs/1.6.0/building-spark.html spark安装需要选择源码编译方式进行安装部署,cdh5.10.0提供默认的二进 ...
- Spark入门实战系列--2.Spark编译与部署(上)--基础环境搭建
[注] 1.该系列文章以及使用到安装包/测试数据 可以在<倾情大奉送--Spark入门实战系列>获取: 2.Spark编译与部署将以CentOS 64位操作系统为基础,主要是考虑到实际应用 ...
- Spark编译与部署
Spark入门实战系列--2.Spark编译与部署(上)--基础环境搭建 [注] 1.该系列文章以及使用到安装包/测试数据 可以在<倾情大奉送--Spark入门实战系列>获取: 2.S ...
- MySQL编译安装
1.准备工作 其官方站点为http://www.mysql.com/ 为了避免发生端口冲突.程序冲突现象.建议先查询MySQL软件的安装情况,确认没有使用以RPM方式安装的mysql-server.m ...
随机推荐
- Machine Learning Methods: Decision trees and forests
Machine Learning Methods: Decision trees and forests This post contains our crib notes on the basics ...
- mysql随机获取一条或者多条数据
原文地址:http://www.im286.com/thread-7091552-1-1.html 转来备份 研究一些随机的因素,主要是讲究效率问题. 语句一: MYSQL手册里面针对RAND()的提 ...
- Struts2拦截器的应用
拦截器类 public class AdminInterceptor extends AbstractInterceptor { private static final long serialVer ...
- Data conversion error converting
词错如果出现在sql语句中,那么多半是类型转换的问题
- jar包与lib包的区别
jar包是编译时使用,假如编译出错代码没问题一定是jar包的问题,lib是运行时使用,比如程序启动后出错了但是编译没有问题,就可能是lib出错了,不会是jar包的问题.
- 关于markdown需要澄清的一些误解
关于markdown需要澄清的误解: 首先, 最大的一个误解就是 转义! markdown不支持对小于号 < 的转义, 如 \<"pre">, 这时候仍然会认为是 ...
- javascript,css延迟加载器
/** * js/css LazyLoad * * 变量hash记录已经加载过的资源,避免重复加载 * * Z.loadScript('a.js', function() { ... }) * * Z ...
- Toast工具类,Android中不用再每次都写烦人的Toast了
package com.zhanggeng.contact.tools; /** * Toasttool can make you use Toast more easy ; * * @author ...
- NSArray和NSMutableArray
//1. NSArray EOItems *eOItems = [[EOItems alloc] init]; eOItems.ID = [NSNumber numberWithInt:]; NSAr ...
- [BZOJ2423][HAOI2010]最长公共子序列
[BZOJ2423][HAOI2010]最长公共子序列 试题描述 字符序列的子序列是指从给定字符序列中随意地(不一定连续)去掉若干个字符(可能一个也不去掉)后所形成的字符序列.令给定的字符序列X=“x ...