Deep compression code

https://github.com/songhan/SqueezeNet-Deep-Compression

import sys

import os

import numpy as np

import pickle

help_ = '''

Usage:

    decode.py <net.prototxt> <net.binary> <target.caffemodel>

    Set variable CAFFE_ROOT as root of caffe before run this demo!

'''

if len(sys.argv) != 4:

    print help_

    sys.exit()

else:

    prototxt = sys.argv[1]

    net_bin = sys.argv[2]

    target = sys.argv[3]

# os.system("cd $CAFFE_ROOT")

caffe_root = os.environ["CAFFE_ROOT"]

os.chdir(caffe_root)

print caffe_root

sys.path.insert(0, caffe_root + 'python')

import caffe

caffe.set_mode_cpu()

net = caffe.Net(prototxt, caffe.TEST)

layers = filter(lambda x:'conv' in x or 'fc' in x or 'ip' in x, net.params.keys())

fin = open(net_bin, 'rb')

def binary_to_net(weights, spm_stream, ind_stream, codebook, num_nz):

    bits = np.log2(codebook.size)

    if bits == 4:

        slots = 2

    elif bits == 8:

        slots = 1

    else:

        print "Not impemented,", bits

        sys.exit()

    code = np.zeros(weights.size, np.uint8) 

    # Recover from binary stream

    spm = np.zeros(num_nz, np.uint8)

    ind = np.zeros(num_nz, np.uint8)

    if slots == 2:

        spm[np.arange(0, num_nz, 2)] = spm_stream % (2**4)

        spm[np.arange(1, num_nz, 2)] = spm_stream / (2**4)

    else:

        spm = spm_stream

    ind[np.arange(0, num_nz, 2)] = ind_stream% (2**4)

    ind[np.arange(1, num_nz, 2)] = ind_stream/ (2**4)

    # Recover the matrix

    ind = np.cumsum(ind+1)-1

    code[ind] = spm

    data = np.reshape(codebook[code], weights.shape)

    np.copyto(weights, data)

nz_num = np.fromfile(fin, dtype = np.uint32, count = len(layers))

for idx, layer in enumerate(layers):

    print "Reconstruct layer", layer

    print "Total Non-zero number:", nz_num[idx]
    #eg . Reconstruct layer conv1
    #Total Non-zero number: 13902

    if 'conv' in layer:

        bits = 8  #卷积层使用８ｂｉｔ量化，全连接使用４ｂｉｔ

    else:

        bits = 4

    codebook_size = 2 ** bits　#所有码字的总数

    codebook = np.fromfile(fin, dtype = np.float32, count = codebook_size)

    bias = np.fromfile(fin, dtype = np.float32, count = net.params[layer][1].data.size)

    np.copyto(net.params[layer][1].data, bias)　　　#把ｆｉｎ里的值拷贝进去，原先net.params[layer][1].data全部都是０

    spm_stream = np.fromfile(fin, dtype = np.uint8, count = (nz_num[idx]-1) / (8/bits) + 1)

    ind_stream = np.fromfile(fin, dtype = np.uint8, count = (nz_num[idx]-1) / 2+1)

    binary_to_net(net.params[layer][0].data, spm_stream, ind_stream, codebook, nz_num[idx])

net.save(target)

Deep compression code的更多相关文章

[综述]Deep Compression/Acceleration深度压缩/加速/量化
Survey Recent Advances in Efficient Computation of Deep Convolutional Neural Networks, [arxiv '18] A ...
DEEP COMPRESSION小记
2016ICLR最佳论文 Deep Compression: Compression Deep Neural Networks With Pruning, Trained Quantization A ...
Deep Compression Compressing Deep Neural Networks With Pruning, Trained QuantizationAnd Huffman Coding
转载请注明出处: http://www.cnblogs.com/sysuzyq/p/6200613.html by 少侠阿朱
论文翻译：2021_Towards model compression for deep learning based speech enhancement
论文地址:面向基于深度学习的语音增强模型压缩论文代码:没开源,鼓励大家去向作者要呀,作者是中国人,在语音增强领域深耕多年引用格式:Tan K, Wang D L. Towards model c ...
A Full Hardware Guide to Deep Learning
A Full Hardware Guide to Deep Learning Deep Learning is very computationally intensive, so you will ...
网络压缩论文集(network compression)
Convolutional Neural Networks ImageNet Models Architecture Design Activation Functions Visualization ...
cs231n spring 2017 lecture15 Efficient Methods and Hardware for Deep Learning 听课笔记
1. 深度学习面临的问题: 1)模型越来越大,很难在移动端部署,也很难网络更新. 2)训练时间越来越长,限制了研究人员的产量. 3)耗能太多,硬件成本昂贵. 解决的方法:联合设计算法和硬件. 计算硬件 ...
深度学习网络压缩模型方法总结(model compression)
两派 1. 新的卷机计算方法这种是直接提出新的卷机计算方式,从而减少参数,达到压缩模型的效果,例如SqueezedNet,mobileNet SqueezeNet: AlexNet-level ac ...
(zhuan) Where can I start with Deep Learning?
Where can I start with Deep Learning? By Rotek Song, Deep Reinforcement Learning/Robotics/Computer V ...

随机推荐

SVN服务端的版本对比及创建仓库时的注意事项
SVN是一个开放源代码的版本控制系统,分为客户端和服务端.就windows系统而言,客户端通常使用 TortoiseSVN,下载地址:https://tortoisesvn.net/ ,而服务端通常 ...
HTML5学习笔记1
1.HTML5概述继html4和xhtml1.0后的超文本标记语言最新版本.最重要的三项技术:html5核心规范(标签元素),CSS3,JavaScript2008年发布,主要为了补全功能.特点:1 ...
StringBuilder和StringBuffer解析（百度面试题优化须要用到的）
StringBuilder是java5及以后提供的API,它不是线程安全的,而StringBuffer是java1.4曾经的API,它是线程安全的,所以说StringBuilder的效率更高一些,今天 ...
《学习opencv》笔记——矩阵和图像操作——cvAbs,cvAbsDiff and cvAbsDiffS
矩阵和图像的操作 (1)cvAbs,cvAbsdiff,cvAbsDiffS 它们的结构为: void cvAbs( //取src中元素的绝对值,写到dst中 const CvArr* src, co ...
C#编程（六）------------枚举
原文链接:http://blog.csdn.net/shanyongxu/article/details/46423255 枚举定义枚举用到的关键字:enum public enum TimeOfD ...
1. python 字符串简介与常用函数
1. python中的字符串简介与常用函数在python中,字符串变成了一个强大的处理工具集,他是不可变的,也就是说字符串包含字符与字符的顺序,他不可以原处修改字符串是我们后面需要学习的稍大一点的 ...
Windows 7 卸载 IE10
今天微软为Windows 7发布了IE10预览版,你是否已经安装?根据笔者的体验,IE10确实如微软所说,在速度.性能等各方面都有了明显提升. 不过,IE10发布预览版安装后会直接替代IE9,如果你想 ...
Oracle 学习（scott方案）
Oracle学习中,重点是sql语句的学习,而所有的sql语句都要在scott用户下完成. 熟悉这个用户下的四张表,是必要的. 查看所有表名: SELECT * FROM tab; 查看每张表的结 ...
Python垃圾回收机制及gc模块详解：内存泄露的例子
标记清理是用来解决循环引用的.分代回收针对所有的新创建即进入0代的对象和进入1.2代的对象..这样就解释了python“引用计数为主.标记清理+分代回收为辅”的垃圾回收原理,因为循环引用毕竟是少数情况 ...
Java网络编程技术1
1. Java网络编程常用API 1.1 InetAddress类使用示例 1.1.1根据域名查找IP地址获取用户通过命令行方式指定的域名,然后通过InetAddress对象来获取该域名对应的IP地 ...

Deep compression code

Deep compression code的更多相关文章

随机推荐

热门专题