Java性能漫谈－数组复制之System.arraycopy

当我还年幼的时候，我很任性，复制数组也是，写一个for循环，来回倒腾，后来长大了，就发现了System.arraycopy的好处。

为了测试俩者的区别我写了一个简单赋值int[100000]的程序来对比，并且中间使用了nanoTime来计算时间差：

程序如下：

        int[] a = new int[100000];

        for(int i=0;i<a.length;i++){

            a[i] = i;

        }

        int[] b = new int[100000];

        int[] c = new int[100000];

        for(int i=0;i<c.length;i++){

            c[i] = i;

        }

        int[] d = new int[100000];

        for(int k=0;k<10;k++){

            long start1 = System.nanoTime();

            for(int i=0;i<a.length;i++){

                b[i] = a[i];

            }

            long end1 = System.nanoTime();

            System.out.println("end1 - start1 = "+(end1-start1));

            long start2 = System.nanoTime();

            System.arraycopy(c, 0, d, 0, 100000);

            long end2 = System.nanoTime();

            System.out.println("end2 - start2 = "+(end2-start2));

            System.out.println();

        }

为了避免内存不稳定干扰和运行的偶然性结果，我在一开始的时候把所有空间申明完成，并且只之后循环10次执行，得到如下结果：

end1 - start1 = 366806

end2 - start2 = 109154

end1 - start1 = 380529

end2 - start2 = 79849

end1 - start1 = 421422

end2 - start2 = 68769

end1 - start1 = 344463

end2 - start2 = 72020

end1 - start1 = 333174

end2 - start2 = 77277

end1 - start1 = 377335

end2 - start2 = 82285

end1 - start1 = 370608

end2 - start2 = 66937

end1 - start1 = 349067

end2 - start2 = 86532

end1 - start1 = 389974

end2 - start2 = 83362

end1 - start1 = 347937

end2 - start2 = 63638

可以看出，System.arraycopy的性能很不错，为了看看究竟这个底层是如何处理的，我找到openJDK的一些代码留恋了一些：

System.arraycopy是一个native函数，需要看native层的代码：

    public static native void arraycopy(Object src,  int  srcPos,

                                        Object dest, int destPos,

                                        int length);

找到对应的openjdk6-src/hotspot/src/share/vm/prims/jvm.cpp，这里有JVM_ArrayCopy的入口：

JVM_ENTRY(void, JVM_ArrayCopy(JNIEnv *env, jclass ignored, jobject src, jint src_pos,

                               jobject dst, jint dst_pos, jint length))

  JVMWrapper("JVM_ArrayCopy");

  // Check if we have null pointers

  if (src == NULL || dst == NULL) {

    THROW(vmSymbols::java_lang_NullPointerException());

  }

  arrayOop s = arrayOop(JNIHandles::resolve_non_null(src));

  arrayOop d = arrayOop(JNIHandles::resolve_non_null(dst));

  assert(s->is_oop(), "JVM_ArrayCopy: src not an oop");

  assert(d->is_oop(), "JVM_ArrayCopy: dst not an oop");

  // Do copy

  Klass::cast(s->klass())->copy_array(s, src_pos, d, dst_pos, length, thread);

JVM_END

前面的语句都是判断，知道最后的copy_array(s, src_pos, d, dst_pos, length, thread)是真正的copy，进一步看这里，在openjdk6-src/hotspot/src/share/vm/oops/typeArrayKlass.cpp中：

void typeArrayKlass::copy_array(arrayOop s, int src_pos, arrayOop d, int dst_pos, int length, TRAPS) {

  assert(s->is_typeArray(), "must be type array");

  // Check destination

  if (!d->is_typeArray() || element_type() != typeArrayKlass::cast(d->klass())->element_type()) {

    THROW(vmSymbols::java_lang_ArrayStoreException());

  }

  // Check is all offsets and lengths are non negative

  if (src_pos <  || dst_pos <  || length < ) {

    THROW(vmSymbols::java_lang_ArrayIndexOutOfBoundsException());

  }

  // Check if the ranges are valid

  if  ( (((unsigned int) length + (unsigned int) src_pos) > (unsigned int) s->length())

     || (((unsigned int) length + (unsigned int) dst_pos) > (unsigned int) d->length()) ) {

    THROW(vmSymbols::java_lang_ArrayIndexOutOfBoundsException());

  }

  // Check zero copy

  if (length == )

    return;

  // This is an attempt to make the copy_array fast.

  int l2es = log2_element_size();

  int ihs = array_header_in_bytes() / wordSize;

  char* src = (char*) ((oop*)s + ihs) + ((size_t)src_pos << l2es);

  char* dst = (char*) ((oop*)d + ihs) + ((size_t)dst_pos << l2es);

  Copy::conjoint_memory_atomic(src, dst, (size_t)length << l2es);//还是在这里处理copy

}

这个函数之前的仍然是一堆判断，直到最后一句才是真实的拷贝语句。

在openjdk6-src/hotspot/src/share/vm/utilities/copy.cpp中找到对应的函数：

// Copy bytes; larger units are filled atomically if everything is aligned.

void Copy::conjoint_memory_atomic(void* from, void* to, size_t size) {

  address src = (address) from;

  address dst = (address) to;

  uintptr_t bits = (uintptr_t) src | (uintptr_t) dst | (uintptr_t) size;

  // (Note:  We could improve performance by ignoring the low bits of size,

  // and putting a short cleanup loop after each bulk copy loop.

  // There are plenty of other ways to make this faster also,

  // and it's a slippery slope.  For now, let's keep this code simple

  // since the simplicity helps clarify the atomicity semantics of

  // this operation.  There are also CPU-specific assembly versions

  // which may or may not want to include such optimizations.)

  if (bits % sizeof(jlong) == ) {

    Copy::conjoint_jlongs_atomic((jlong*) src, (jlong*) dst, size / sizeof(jlong));

  } else if (bits % sizeof(jint) == ) {

    Copy::conjoint_jints_atomic((jint*) src, (jint*) dst, size / sizeof(jint));

  } else if (bits % sizeof(jshort) == ) {

    Copy::conjoint_jshorts_atomic((jshort*) src, (jshort*) dst, size / sizeof(jshort));

  } else {

    // Not aligned, so no need to be atomic.

    Copy::conjoint_jbytes((void*) src, (void*) dst, size);

  }

}

上面的代码展示了选择哪个copy函数，我们选择conjoint_jints_atomic，在openjdk6-src/hotspot/src/share/vm/utilities/copy.hpp进一步查看：

// jints,                 conjoint, atomic on each jint

  static void conjoint_jints_atomic(jint* from, jint* to, size_t count) {

    assert_params_ok(from, to, LogBytesPerInt);

    pd_conjoint_jints_atomic(from, to, count);

  }

继续向下查看，在openjdk6-src/hotspot/src/cpu/zero/vm/copy_zero.hpp中：

static void pd_conjoint_jints_atomic(jint* from, jint* to, size_t count) {

  _Copy_conjoint_jints_atomic(from, to, count);

}

继续向下查看，在openjdk6-src/hotspot/src/os_cpu/linux_zero/vm/os_linux_zero.cpp中：

void _Copy_conjoint_jints_atomic(jint* from, jint* to, size_t count) {

    if (from > to) {

      jint *end = from + count;

      while (from < end)

        *(to++) = *(from++);

    }

    else if (from < to) {

      jint *end = from;

      from += count - ;

      to   += count - ;

      while (from >= end)

        *(to--) = *(from--);

    }

  }

可以看到，直接就是内存块赋值的逻辑了，这样避免很多引用来回倒腾的时间，必然就变快了。

Java性能漫谈－数组复制之System.arraycopy的更多相关文章

Java 数组拷贝方法 System.arraycopy
System类提供的数组拷贝方法: public static native void arraycopy(Object src, int srcPos, Object dest, int destP ...
jdk提供的数组扩容方法:System.arraycopy
package chapter7; /* * jdk提供的扩容方法 * System.arraycopy */public class TestArrayjdk { public static voi ...
[Java] arraycopy 数组复制(转)
public class ArraycopyTest { public static void main(String[] args) { //静态初始化两个长度不同的 ...
【Java基础】System.arraycopy()的使用详解
由于在Java中System.arraycopy()方法在一维数组和二维数组中的表现不同,所以做了一个测试 public static void main(String[] args) { int[] ...
java.lang.System.arraycopy() 与java.util.Arrays.copyOf()的区别
java.lang.System.arraycopy() 与java.util.Arrays.copyOf()的区别一.java.lang.System.arraycopy() 该方法的声明: /* ...
java System.arrayCopy使用说明
java System.arrayCopy使用说明 java.lang.System.arraycopy() 方法复制指定的源数组的数组,在指定的位置开始,到目标数组的指定位置. 下面是 System ...
【java】为数组全部元素赋同样的值以及数组之间的复制
为数组全部元素赋同样的值 : boolean[] resArray=new boolean[100]; Arrays.fill(resArray, true); 数组之间的复制: System.arr ...
java中的数组概念
数组的定义形式: 动态初始化方式: 1.声明并开辟数组 String str[]=new String[3];//3表示数组的长度 2.分布完成 String str[]=null; str=new ...
[Java] System.arraycopy 数组复制
函数原型: public static void arraycopy(Object src, int srcPos, Object dest, int destPos, int length) ; s ...

随机推荐

java 位运算权限管控(转载)
这里笔者介绍一种很常用,也比较专业的权限控制思路.这里用java语言描述,其实都差不多的.要换成其他的语言主,自己转一下就可以了.为了方便起见,我们这里定义a^b为:a的b次方.这里,我们为每一个操作 ...
活动指示器UIActivityIndicatorView
活动指示器UIActivityIndicatorView可以告知用户有一个操作正在进行中 1.创建 UIActivityIndicatorView *activityIndicatorView ...
Xml读取异常--Invalid byte 1 of 1-byte UTF-8 sequence
xml读取异常Invalid byte 1 of 1-byte UTF-8 sequence org.dom4j.DocumentException: Invalid byte 1 of 1-byte ...
c++中vector等容器的实现机制
stl容器区别: vector list deque set map-底层实现 stl容器区别: vector list deque set map (转) 在STL中基本容器有: vector.li ...
为GCD队列绑定NSObject类型上下文数据-利用__bridge_retained(transfer)转移内存管理权-备
下面评论的好友“@Jim”给了种新的思路,就是在清除context的函数里面,用“_bridge_transfer”转换context,把context的内存管理权限重新交给ARC,这样,就不用显式调 ...
win7系统64位"禁用驱动程序签名强制"如何设置？
换了Win7系统64位旗舰版的朋友是不是都发现了一点,以前32位Win7系统用的好好的小程序小应用,一下子就装不上了.这是由于Win7系统64位系统对于未有认证签名的驱动程序进行了限制安装,而大部分小 ...
scala 入门（2）--数组相关操作
scala 无论从数组的定义还是操作方面来说都跟java非常相似,但又有其独特的强大之处… 1. 定长数组对于长度不变的数组,可以用scala中的Array. //scala 里使用new实例化对象 ...
Android调用系统的Activity、ContentProvider、Service、Broadcast Receiver
Intent-------->Action |_____________Activity------------------------------->Intent Action |___ ...
抗忙，，建个MAVEN的私服仓库-NEXUS
公司最近需求越来越上轨道,MAVEN的私服仓库-NEXUS构架起来哟.. 参考文档URL: http://www.linuxidc.com/Linux/2011-07/39578p3.htm http ...
js深入研究之Person类案例
<script type="text/javascript"> /* 定义一个Person类 */ function Person(name, age) { this. ...

Java性能漫谈－数组复制之System.arraycopy

Java性能漫谈－数组复制之System.arraycopy的更多相关文章

随机推荐

热门专题