分布式机器学习框架：MxNet

MxNet官网： http://mxnet.readthedocs.io/en/latest/

前言：

caffe是很优秀的dl平台。影响了后面很多相关框架。

cxxnet借鉴了很多caffe的思想。相比之下，cxxnet在实现上更加干净，例如依赖很少，通过mshadow的模板化使得gpu和cpu代码只用写一份，分布式接口也很干净。

mxnet是cxxnet的下一代，目前实现了cxxnet所有功能，但借鉴了minerva/torch7/theano，加入更多新的功能。

ndarray编程接口，类似matlab/numpy.ndarray/torch.tensor。独有优势在于通过背后的engine可以在性能上和内存使用上更优
symbolic接口。这个可以使得快速构建一个神经网络，和自动求导。
更多binding 目前支持比较好的是python，马上会有julia和R
更加方便的多卡和多机运行
性能上更优。目前mxnet比cxxnet快40%，而且gpu内存使用少了一半。

目前mxnet还在快速发展中。这个月的主要方向有三，更多的binding，更好的文档，和更多的应用（language model、语音，机器翻译，视频）。地址在dmlc/mxnet · GitHub

官方简介：

MXNet is a deep learning framework designed for both efficiency andflexibility.It allows you tomix theflavours
of symbolicprogramming and imperative programming to maximize efficiency and productivity.In its core, a dynamic dependency scheduler that automatically parallelizes both symbolic and imperative operations on the fly.A graph optimization
layer on top of that makes symbolic execution fast and memory efficient.The library is portable and lightweight, and it scales to multiple GPUs and multiple machines.

MXNet is also more than a deep learning project. It is also a collection ofblue prints and guidelines for buildingdeep learning
system, and interesting insights of DL systems for hackers.

MxNet混合了符号式设计和命令式设计，来最大化效率和提高产出。其核心是一个动态调度器，不停的并行执行符号和命令操作。顶层的图优化层使符号执行快速且有效。这个包轻量级可移植性好，并且可以扩展到多GPU和多个机器。

MxNet不仅是一个深度学习工程，并且是一个为构建DL系统提供蓝图和指导的集合，并且为hackers 提供了一个有趣的视野。

最新发展

What's New

MXNet Memory Monger, Training Deeper Nets with Sublinear Memory Cost
Tutorial for NVidia GTC 2016
Embedding Torch layers and functions in MXNet
MXNet.js: Javascript Package for Deep Learning in Browser (without server)
Design Note: Design Efficient Deep Learning Data Loading Module
MXNet on Mobile Device
Distributed Training
Guide to Creating New Operators (Layers)
Amalgamation and Go Binding for Predictors
Training Deep Net on 14 Million Images on A Single Machine
MxNet的内存管理：子线性的内存代价
NVIDIA GTC2016上的教程
嵌入 Torch网络层和函数到MxNet
MxNet.js : 可运行到浏览器中的javascript包
设计节点：设计有效的深度学习数据载入模型
移动设备的上的 Mxnet
分布式训练方法
网络层的运算符重载
使用一个深度网络训练1400万张图片

Features

Design notes providing useful insights that can re-used by other DL projects
Flexible configuration for arbitrary computation graph
Mix and match good flavours of programming to maximize flexibility and efficiency
Lightweight, memory efficient and portable to smart devices
Scales up to multi GPUs and distributed setting with auto parallelism
Support for python, R, C++ and Julia
Cloud-friendly and directly compatible with S3, HDFS, and Azure

Ask Questions

Please use mxnet/issues for how to use mxnet and reporting bugs

License

Reference Paper

Tianqi Chen, Mu Li, Yutian Li, Min Lin, Naiyan Wang, Minjie Wang, Tianjun Xiao,Bing Xu, Chiyuan Zhang, and Zheng Zhang.MXNet: A Flexible
and Efficient Machine Learning Library for Heterogeneous Distributed Systems.In Neural Information Processing Systems, Workshop on Machine Learning Systems, 2015

History

MXNet is initiated and designed in collaboration by the authors of cxxnet, minerva andpurine2. The project reflects what we have learnt from the past projects. It combines important flavours of the existing projects for efficiency, flexibility and
memory efficiency.