Practical Machine Learning For The Uninitiated
Practical Machine Learning For The Uninitiated
Last fall when I took on ShippingEasy’s machine learning problem, I had no practical
 experience in the field. Getting such a task put on my plate was somewhat terrifying, and even more so as we started to wade into the waters of machine learning. Ultimately, we overcame those obstacles anddelivered
 a solution that allowed us to automate our customer’s actions with greater than 95% accuracy. Here are some of the challenges that we experienced when applying machine learning to the shipping & fulfilment domain, and how we broke through them.
Lost in Translation
Machine learning is a subfield of computer science stemming from research into artificial intelligence.[3] It has strong ties to statistics and mathematical optimization, which deliver methods, theory and application domains to the field.
So sayeth the Wikipedia. These roots are where the lexicon of the machine learning stems from. If you have
 not been working directly in machine learning, statistics, math or AI, or perhaps your exposure to these are long past, a discussion about machine learning will be hard to follow. Often it is taken for granted that you know what classification, regression,
 clustering, supervised, unsupervised, feature vector, sample, over-fitting, binning, banding, density and a host of other terms mean.
 As a result, you will be somewhat lost until you can get familiar with this language. Getting a
As a result, you will be somewhat lost until you can get familiar with this language. Getting a
 good book will help. I would recommend Machine Learning, a short course. It clocks in at less than 200 pages, and so is something
 that a working professional can consume. Even if you can’t follow everything in the book, reading through it will give you a foundation that will allow you to make use of all the other resources you may find online.
To give you a starting point to building your vocabulary, I will offer a few terms here that will help determine what type of machine learning problem you are dealing with.
Supervised vs Unsupervised Learning: Supervised learning is where you have a set
 of input data with known outcomes by which you wish to predict the outcome of future inputs. Our problem at ShippingEasy was
 of this type. We had past orders and shipments and needed to predict shipments given future orders. Unsupervised
 learning is where you have input data, but no known outcomes. You are searching for what features have meaning within a set of data. If graphed, the data will form clusters around the patterns of meaningful features.
Classification vs Regression: Within supervised learning, there are problems of classification and regression. Classification is
 where you wish to determine the class (output) of an input. For instance, predicting what shirt color a person may wear on a given day based on data about what shirts they have worn in the past. The different shirt colors are the classes that you are attempting
 to predict.
Regression is where you wish to determine a numeric value given other numeric inputs describing the sample.
 For instance, predicting an engineer’s salary based on age and years in the industry. Given enough past data, you could arrive at a statistically relevant salary figure given an arbitrary age and years in the industry (assuming true relationships between age,
 years in industry and salary).
Algorithmic Obsession
 Once you have a foundation of concepts and language, you can start looking into all of the amazing resources
Once you have a foundation of concepts and language, you can start looking into all of the amazing resources
 on the web for machine learning.Stanford’s machine learning videos are great, as are mathematicalmonk’s
 youtube videos.
These are fantastic resources for learning how to write machine learning algorithms. But these turned out to not be of much use to me. Not because they are not great, but because the practical application of machine learning is about solving
 a domain problem, not writing machine learning algorithms. To make the point, consider this portion of a machine learning algorithm expressed in mathematical notation (which is yet another barrier to the uninitiated):
 What does this have to do with the problem you are trying to solve? Absolutely nothing. This is a
What does this have to do with the problem you are trying to solve? Absolutely nothing. This is a
 description of one portion of an algorithm that may be fed arbitrary data to produce statistically relevant results. It has been implemented by someone smarter than you in
 an open source libraryor possibly a service offering that
 have been well exercised by a large audience. You could implement it perfectly, and it could produce great results or really bad results. It all depends on the relevancy of the data you feed to it, which brings me to my last point.
Its the Data, Stupid
While there are a tremendous number of resources for how to write machine learning algorithms, there are not many dealing with how to find relevant data within a domain that will allow an algorithm to produce accurate results. This is where you will find that
 you have spent most of your time, effort and creativity at the end of an applied machine learning project if you were smart enough to use a good machine learning library or service.
 That algorithms dominate the resources for machine learning makes a certain amount of sense. Algorithms
That algorithms dominate the resources for machine learning makes a certain amount of sense. Algorithms
 are generic and have practicability for many different scenarios. TheK-Nearest Neighbor algorithm
 may be able to predict what movies you would like to watch on Netflix, or it might be able to predict which sex offenders are at high risk for recidivism. These different applications of K-Nearest neighbor would have very different data that needs to be surfaced
 from their respective domains and fed to them, however.
There exists an area of machine learning geared towards feature detection, and I won’t dismiss its validity.
 I will say, however, that if someone understands the domains of movie consumption and purchasing dynamics or criminal behavior, justice and rehabilitation, they have a leg up in practically applying machine learning to those domains. For even if there is a
 statistical correlation between day of the week and movie choices, it does not mean that there is a causative relationship between them. Understanding the domain can help you ascertain if it does.
Some of the data will be obvious. It winds up being a value in a column of a row in the database and it screams its pertinence. Some will be much less obvious and need to be inferred. For instance, for the sex offender recidivism problem, there are probably
 a number of criminal incidents, each with a timestamp for when they occurred. For any given person, the amount of time that has passed since their last criminal event, in days, might need to be calculated and included with the data sent to the algorithm. This
 ‘freshness’ of their criminal activity needs to be inferred from your data, and it may be a key to getting the desired results in predicting future likelihood of behavior. Or it might not.
I think the moral of the story here is that to really apply machine learning in a practical way, being a mathematical or statistical wizard is not the most important element of success. What I feel is more important is having an understanding of the domain
 to know what data is relevant and an explorers curiosity to have meaningful hunches and a willingness to explore and vet them. You will need to be comfortable employing something resembling a scientific method – ensuring accuracy is measurable, quantifying
 the effects of change, and meticulously exploring isolated changes to discover what data affects a system.
In conclusion
 Employing machine learning to solve domain problems can provide huge value to a company
Employing machine learning to solve domain problems can provide huge value to a company
 or the public at large. Learning machine learning and how to properly apply it to a domain, however, can be challenging. You will need to develop a knowledge of the fundamentals of machine learning, but do not need to be a computer science, math or statistics
 guru to employ it. Leverage existing libs or services, and then focus your efforts on finding the meaningful data within the domain, both obvious and obscure, that will allow satisfactory results to be achieved.
Practical Machine Learning For The Uninitiated的更多相关文章
- 【机器学习Machine Learning】资料大全
		昨天总结了深度学习的资料,今天把机器学习的资料也总结一下(友情提示:有些网站需要"科学上网"^_^) 推荐几本好书: 1.Pattern Recognition and Machi ... 
- 机器学习(Machine Learning)&深度学习(Deep Learning)资料【转】
		转自:机器学习(Machine Learning)&深度学习(Deep Learning)资料 <Brief History of Machine Learning> 介绍:这是一 ... 
- How to use data analysis for machine learning (example, part 1)
		In my last article, I stated that for practitioners (as opposed to theorists), the real prerequisite ... 
- How do I learn machine learning?
		https://www.quora.com/How-do-I-learn-machine-learning-1?redirected_qid=6578644 How Can I Learn X? ... 
- A Gentle Guide to Machine Learning
		A Gentle Guide to Machine Learning Machine Learning is a subfield within Artificial Intelligence tha ... 
- 机器学习(Machine Learning)与深度学习(Deep Learning)资料汇总
		<Brief History of Machine Learning> 介绍:这是一篇介绍机器学习历史的文章,介绍很全面,从感知机.神经网络.决策树.SVM.Adaboost到随机森林.D ... 
- Lessons learned developing a practical large scale machine learning system
		原文:http://googleresearch.blogspot.jp/2010/04/lessons-learned-developing-practical.html Lessons learn ... 
- Practical Black-Box Attacks against Machine Learning
		目录 概 主要内容 Jacobian-based Dataset Augmentation Note Papernot N, Mcdaniel P, Goodfellow I, et al. Prac ... 
- [Machine Learning & Algorithm]CAML机器学习系列2:深入浅出ML之Entropy-Based家族
		声明:本博客整理自博友@zhouyong计算广告与机器学习-技术共享平台,尊重原创,欢迎感兴趣的博友查看原文. 写在前面 记得在<Pattern Recognition And Machine ... 
随机推荐
- 混合语言编程:启用CLR(公共语言运行时编译)让C#调用C++
			前言 关于混合C#和C++的编程方式,本人之前写过一篇博客(参见混合语言编程:C#使用原生的Directx和OpenGL),在之前的博客中,介绍了在C#的Winform和WPF下使用原生的Direct ... 
- max_allowed_packet自动恢复
			https://dev.mysql.com/doc/refman/5.5/en/packet-too-large.html http://blog.chinaunix.net/uid-20304801 ... 
- Java Platform Standard Edition 8 Documentation
			下面这个图挺有用的,收藏一下. Oracle has two products that implement Java Platform Standard Edition (Java SE) 8: J ... 
- 20135223/20135234/20135229小组——亚博 Arduino智能小车实践报告
			实验名称:Arduino智能小车组装和综合测试 实验小组成员:20135223何伟钦 20135234马启扬 20135229吕松鸿 实验日期:2015.10.27—2015.11.3 实验时长:24 ... 
- Cordova4.0 系列 -- 基本环境搭建(1)
			一. 安装Node.js基本环境 官网下载地址:https://nodejs.org/ 安装成功之后可以使用简单命令查看其版本 node -v npm相关命令 node cli.js install ... 
- node判断文件目录是否存在
			'use strict'; //这是一个简单的应用 var path = require('path'); var fs = require("fs") ; global.l = ... 
- [wikioi2069]油画(贪心)
			题目:http://www.wikioi.com/problem/2069/ 分析: 首先这个问题比较复杂,涉及到两个重要的考虑点,一个是当前拿来的颜色是否保留,一个是若保留后那么应该把当前盘子的哪个 ... 
- Socket网络编程--FTP客户端(1)(Windows)
			已经好久没有写过博客进行分享了.具体原因,在以后说. 这几天在了解FTP协议,准备任务是写一个FTP客户端程序.直接上干货了. 0.了解FTP作用 就是一个提供一个文件的共享协议. 1.了解FTP协议 ... 
- [AaronYang]C#人爱学不学[7]
			做一个决定,并不难,难的是付诸行动,并且坚持到底 --Aaronyang的博客(www.ayjs.net)-www.8mi.me 1. 委托-我的总结 1.1 委托:面试我都会说,把方法当参数.委托包 ... 
- 第二十课:js中如何操作元素的属性系统
			本章的内容有点复杂,我将用简单的方式来介绍重要的东西,不重要的东西,这里就不讲了,讲了也毛用. 通常我们把对象的非函数成员叫做属性.对元素节点来说,其属性大题分为两大类,固有属性和自定义属性.固有属性 ... 
