Deep compression code

https://github.com/songhan/SqueezeNet-Deep-Compression

import sys

import os

import numpy as np

import pickle

help_ = '''

Usage:

    decode.py <net.prototxt> <net.binary> <target.caffemodel>

    Set variable CAFFE_ROOT as root of caffe before run this demo!

'''

if len(sys.argv) != 4:

    print help_

    sys.exit()

else:

    prototxt = sys.argv[1]

    net_bin = sys.argv[2]

    target = sys.argv[3]

# os.system("cd $CAFFE_ROOT")

caffe_root = os.environ["CAFFE_ROOT"]

os.chdir(caffe_root)

print caffe_root

sys.path.insert(0, caffe_root + 'python')

import caffe

caffe.set_mode_cpu()

net = caffe.Net(prototxt, caffe.TEST)

layers = filter(lambda x:'conv' in x or 'fc' in x or 'ip' in x, net.params.keys())

fin = open(net_bin, 'rb')

def binary_to_net(weights, spm_stream, ind_stream, codebook, num_nz):

    bits = np.log2(codebook.size)

    if bits == 4:

        slots = 2

    elif bits == 8:

        slots = 1

    else:

        print "Not impemented,", bits

        sys.exit()

    code = np.zeros(weights.size, np.uint8) 

    # Recover from binary stream

    spm = np.zeros(num_nz, np.uint8)

    ind = np.zeros(num_nz, np.uint8)

    if slots == 2:

        spm[np.arange(0, num_nz, 2)] = spm_stream % (2**4)

        spm[np.arange(1, num_nz, 2)] = spm_stream / (2**4)

    else:

        spm = spm_stream

    ind[np.arange(0, num_nz, 2)] = ind_stream% (2**4)

    ind[np.arange(1, num_nz, 2)] = ind_stream/ (2**4)

    # Recover the matrix

    ind = np.cumsum(ind+1)-1

    code[ind] = spm

    data = np.reshape(codebook[code], weights.shape)

    np.copyto(weights, data)

nz_num = np.fromfile(fin, dtype = np.uint32, count = len(layers))

for idx, layer in enumerate(layers):

    print "Reconstruct layer", layer

    print "Total Non-zero number:", nz_num[idx]
    #eg . Reconstruct layer conv1
    #Total Non-zero number: 13902

    if 'conv' in layer:

        bits = 8  #卷积层使用８ｂｉｔ量化，全连接使用４ｂｉｔ

    else:

        bits = 4

    codebook_size = 2 ** bits　#所有码字的总数

    codebook = np.fromfile(fin, dtype = np.float32, count = codebook_size)

    bias = np.fromfile(fin, dtype = np.float32, count = net.params[layer][1].data.size)

    np.copyto(net.params[layer][1].data, bias)　　　#把ｆｉｎ里的值拷贝进去，原先net.params[layer][1].data全部都是０

    spm_stream = np.fromfile(fin, dtype = np.uint8, count = (nz_num[idx]-1) / (8/bits) + 1)

    ind_stream = np.fromfile(fin, dtype = np.uint8, count = (nz_num[idx]-1) / 2+1)

    binary_to_net(net.params[layer][0].data, spm_stream, ind_stream, codebook, nz_num[idx])

net.save(target)

Deep compression code的更多相关文章

[综述]Deep Compression/Acceleration深度压缩/加速/量化
Survey Recent Advances in Efficient Computation of Deep Convolutional Neural Networks, [arxiv '18] A ...
DEEP COMPRESSION小记
2016ICLR最佳论文 Deep Compression: Compression Deep Neural Networks With Pruning, Trained Quantization A ...
Deep Compression Compressing Deep Neural Networks With Pruning, Trained QuantizationAnd Huffman Coding
转载请注明出处: http://www.cnblogs.com/sysuzyq/p/6200613.html by 少侠阿朱
论文翻译：2021_Towards model compression for deep learning based speech enhancement
论文地址:面向基于深度学习的语音增强模型压缩论文代码:没开源,鼓励大家去向作者要呀,作者是中国人,在语音增强领域深耕多年引用格式:Tan K, Wang D L. Towards model c ...
A Full Hardware Guide to Deep Learning
A Full Hardware Guide to Deep Learning Deep Learning is very computationally intensive, so you will ...
网络压缩论文集(network compression)
Convolutional Neural Networks ImageNet Models Architecture Design Activation Functions Visualization ...
cs231n spring 2017 lecture15 Efficient Methods and Hardware for Deep Learning 听课笔记
1. 深度学习面临的问题: 1)模型越来越大,很难在移动端部署,也很难网络更新. 2)训练时间越来越长,限制了研究人员的产量. 3)耗能太多,硬件成本昂贵. 解决的方法:联合设计算法和硬件. 计算硬件 ...
深度学习网络压缩模型方法总结(model compression)
两派 1. 新的卷机计算方法这种是直接提出新的卷机计算方式,从而减少参数,达到压缩模型的效果,例如SqueezedNet,mobileNet SqueezeNet: AlexNet-level ac ...
(zhuan) Where can I start with Deep Learning?
Where can I start with Deep Learning? By Rotek Song, Deep Reinforcement Learning/Robotics/Computer V ...

随机推荐

Spring @Value 用法小结，#与$的区别
20161016更新:这货其实是SpEL的功能,来这里看看吧: Spring 4 官方文档学习(五)核心技术之SpEL 起因一直的用法是 @Value("${jdbc.driverClas ...
老生常谈ajax
一,js中的ajax ajax(Asynchronous Javascript And XML)即为异步的JavaScript和XML,顾名思义,这个技术是和我们当前页面刷新无关的,因为它是异步的,在 ...
JAVA JDOM解析XML 带CDATA数据
import java.io.StringReader;import java.util.*; import org.jdom.Document;import org.jdom.Element;imp ...
Vi 编辑
1.vi的基本概念基本上vi可以分为三种状态,分别是命令模式(command mode).插入模式(Insert mode)和底行模式(last line mode),各模式的功能区分如下: 1) ...
Ext.state.Manager.setProvider(new Ext.state.CookieProvider())
Ext.state.Manager.setProvider(new Ext.state.CookieProvider()) 初始化Ext状态管理器,在Cookie中记录用户的操作状态,如果不启用,象刷 ...
python文章学习列表
1.https://home.cnblogs.com/u/darkpig/feed/blog/ 这篇博主的系列文章 2.
Configuring HDFS High Availability
Configuring HDFS High Availability 原文请訪问 http://blog.csdn.net/ashic/article/details/47024617,突袭新闻小灵儿 ...
Terrain tessellation &&Threaded Rendering Vk
https://github.com/NVIDIAGameWorks/GraphicsSamples/tree/master/samples/es3aep-kepler/TerrainTessella ...
Observer 观察者模式 MD
Markdown版本笔记我的GitHub首页我的博客我的微信我的邮箱 MyAndroidBlogs baiqiantao baiqiantao bqt20094 baiqiantao@sina ...
20 个具有惊艳效果的 jQuery 图像缩放插件
jQuery相对与Flash的魔力已经贯穿整个网络.尽管,Flash层被认为是用于网页设计的首选,然而随着jQuery的出现,以及他的酷似Flash的交互式特效使得网页更加的优雅——Flash开始靠边 ...

Deep compression code

Deep compression code的更多相关文章

随机推荐

热门专题