图像处理------K-Means算法演示

二毛子 2024-10-20 22:36:43 原文

一：数学原理

K-Means算法的作者是MacQueen，基本的数学原理很容易理解，假设有一个像素

数据集P。我们要根据值不同将它分为两个基本的数据集合Cluster1, Cluster2，使

用K-Means算法大致如下：

假设两个Cluster的RGB值分别为112,225,244和23,34,99则像素集合中的像素点

a(222,212,234), b(198,205,229), c(25,77,52),d(34,55,101)计算每个像素点与这

两个cluster中心点的欧几里德距离，则像素点a, b属于前面一个cluster， c,d属于

后面一个cluster。然后在根据(222+198)/2, (212+205)/2, (234+52)/2更新cluster

的RGB值，对后一个cluster做同样处理。然后再计算每个像素点到cluster中心点

的欧几里德距离。最终值没有变化则得到分类Cluster点集合。

二：算法基本流程

三：算法关键代码解析

初始化cluster中心点代码如下：

[java] view plain copy

Random random = new Random();
for (int i = 0; i < numOfCluster; i++)
{
int randomNumber1 = random.nextInt(width);
int randomNumber2 = random.nextInt(height);
index = randomNumber2 * width + randomNumber1;
ClusterCenter cc = new ClusterCenter(randomNumber1, randomNumber2, inPixels[index]);
cc.setcIndex(i);
clusterCenterList.add(cc);
}

初始化所有像素点代码如下：

[java] view plain copy

// create all cluster point
for (int row = 0; row < height; ++row)
{
for (int col = 0; col < width; ++col)
{
index = row * width + col;
int color = inPixels[index];
pointList.add(new ClusterPoint(row, col, color));
}
}

计算两个像素点之间欧几里德距离的代码如下:

[java] view plain copy

// int pa = (p.getPixelColor() >> 24) & 0xff;
int pr = (p.getPixelColor() >> 16) & 0xff;
int pg = (p.getPixelColor() >> 8) & 0xff;
int pb = p.getPixelColor() & 0xff;
// int ca = (c.getPixelColor() >> 24) & 0xff;
int cr = (c.getPixelColor() >> 16) & 0xff;
int cg = (c.getPixelColor() >> 8) & 0xff;
int cb = c.getPixelColor() & 0xff;
return Math.sqrt(Math.pow((pr - cr), 2.0) + Math.pow((pg - cg), 2.0) + Math.pow((pb - cb), 2.0));

; i<clusterCenterList.size(); i++)

{

clusterCenterList.get(i).setNumOfPoints(0);

}

// recalculate the sum and total of points for each cluster

double[] redSums = new double[3];

double[] greenSum = new double[3];

double[] blueSum = new double[3];

for(int i=0; i<pointList.size(); i++)

{

int cIndex = (int)pointList.get(i).getClusterIndex();

clusterCenterList.get(cIndex).addPoints();

int ta = (pointList.get(i).getPixelColor() >> 24) & 0xff;

int tr = (pointList.get(i).getPixelColor() >> 16) & 0xff;

int tg = (pointList.get(i).getPixelColor() >> 8) & 0xff;

int tb = pointList.get(i).getPixelColor() & 0xff;

ta = 255;

redSums[cIndex] += tr;

greenSum[cIndex] += tg;

blueSum[cIndex] += tb;

}

double[] oldClusterCentersColors = new double[clusterCenterList.size()];

for(int i=0; i<clusterCenterList.size(); i++)

{

double sum = clusterCenterList.get(i).getNumOfPoints();

int cIndex = clusterCenterList.get(i).getcIndex();

int red = (int)(greenSum[cIndex]/sum);

int green = (int)(greenSum[cIndex]/sum);

int blue = (int)(blueSum[cIndex]/sum);

System.out.println("red = " + red + " green = " + green + " blue = " + blue);

int clusterColor = (255 << 24) | (red << 16) | (green << 8) | blue;

clusterCenterList.get(i).setPixelColor(clusterColor);

oldClusterCentersColors[i] = clusterColor;

}

return oldClusterCentersColors;

}

四：运行效果

五：K-Means算法源代码

[java] view plain copy

package com.gloomyfish.segmentation.kmeans;
import java.awt.image.BufferedImage;
import java.util.ArrayList;
import java.util.List;
import java.util.Random;
import com.gloomyfish.filter.study.AbstractBufferedImageOp;
import com.gloomyfish.segmentation.fuzzycmeans.ClusterPoint;
public class KMeansProcessor extends AbstractBufferedImageOp {
private List<ClusterCenter> clusterCenterList;
private List<ClusterPoint> pointList;
private int numOfCluster;
public KMeansProcessor(int clusters)
{
this.numOfCluster = clusters;
pointList = new ArrayList<ClusterPoint>();
this.clusterCenterList = new ArrayList<ClusterCenter>();
}
@Override
public BufferedImage filter(BufferedImage src, BufferedImage dest) {
// initialization the pixel data
int width = src.getWidth();
int height = src.getHeight();
int[] inPixels = new int[width*height];
src.getRGB( 0, 0, width, height, inPixels, 0, width );
int index = 0;
//Create random points to use a the cluster center
Random random = new Random();
for (int i = 0; i < numOfCluster; i++)
{
int randomNumber1 = random.nextInt(width);
int randomNumber2 = random.nextInt(height);
index = randomNumber2 * width + randomNumber1;
ClusterCenter cc = new ClusterCenter(randomNumber1, randomNumber2, inPixels[index]);
cc.setcIndex(i);
clusterCenterList.add(cc);
}
// create all cluster point
for (int row = 0; row < height; ++row)
{
for (int col = 0; col < width; ++col)
{
index = row * width + col;
int color = inPixels[index];
pointList.add(new ClusterPoint(row, col, color));
}
}
// initialize the clusters for each point
double[] clusterDisValues = new double[clusterCenterList.size()];
for(int i=0; i<pointList.size(); i++)
{
for(int j=0; j<clusterCenterList.size(); j++)
{
clusterDisValues[j] = calculateEuclideanDistance(pointList.get(i), clusterCenterList.get(j));
}
pointList.get(i).setClusterIndex(getCloserCluster(clusterDisValues));
}
// calculate the old summary
// assign the points to cluster center
// calculate the new cluster center
// computation the delta value
// stop condition--
double[] oldClusterCenterColors = reCalculateClusterCenters();
while(true)
{
stepClusters();
double[] newClusterCenterColors = reCalculateClusterCenters();
if(isStop(oldClusterCenterColors, newClusterCenterColors))
{
break;
}
else
{
oldClusterCenterColors = newClusterCenterColors;
}
}
//update the result image
dest = createCompatibleDestImage(src, null );
index = 0;
int[] outPixels = new int[width*height];
for (int j = 0; j < pointList.size(); j++)
{
for (int i = 0; i < clusterCenterList.size(); i++)
{
ClusterPoint p = this.pointList.get(j);
if (clusterCenterList.get(i).getcIndex() == p.getClusterIndex())
{
int row = (int)p.getX(); // row
int col = (int)p.getY(); // column
index = row * width + col;
outPixels[index] = clusterCenterList.get(i).getPixelColor();
}
}
}
// fill the pixel data
setRGB( dest, 0, 0, width, height, outPixels );
return dest;
}
private boolean isStop(double[] oldClusterCenterColors, double[] newClusterCenterColors) {
for(int i=0; i<oldClusterCenterColors.length; i++)
{
System.out.println("cluster " + i + " old : " + oldClusterCenterColors[i] + ", new : " + newClusterCenterColors[i]);
if(oldClusterCenterColors[i] != newClusterCenterColors[i])
{
return false;
}
}
System.out.println();
return true;
}
/**
* update the cluster index by distance value
*/
private void stepClusters()
{
// initialize the clusters for each point
double[] clusterDisValues = new double[clusterCenterList.size()];
for(int i=0; i<pointList.size(); i++)
{
for(int j=0; j<clusterCenterList.size(); j++)
{
clusterDisValues[j] = calculateEuclideanDistance(pointList.get(i), clusterCenterList.get(j));
}
pointList.get(i).setClusterIndex(getCloserCluster(clusterDisValues));
}
}
/**
* using cluster color of each point to update cluster center color
*
* @return
*/
private double[] reCalculateClusterCenters() {
// clear the points now
for(int i=0; i<clusterCenterList.size(); i++)
{
clusterCenterList.get(i).setNumOfPoints(0);
}
// recalculate the sum and total of points for each cluster
double[] redSums = new double[3];
double[] greenSum = new double[3];
double[] blueSum = new double[3];
for(int i=0; i<pointList.size(); i++)
{
int cIndex = (int)pointList.get(i).getClusterIndex();
clusterCenterList.get(cIndex).addPoints();
int ta = (pointList.get(i).getPixelColor() >> 24) & 0xff;
int tr = (pointList.get(i).getPixelColor() >> 16) & 0xff;
int tg = (pointList.get(i).getPixelColor() >> 8) & 0xff;
int tb = pointList.get(i).getPixelColor() & 0xff;
ta = 255;
redSums[cIndex] += tr;
greenSum[cIndex] += tg;
blueSum[cIndex] += tb;
}
double[] oldClusterCentersColors = new double[clusterCenterList.size()];
for(int i=0; i<clusterCenterList.size(); i++)
{
double sum = clusterCenterList.get(i).getNumOfPoints();
int cIndex = clusterCenterList.get(i).getcIndex();
int red = (int)(greenSum[cIndex]/sum);
int green = (int)(greenSum[cIndex]/sum);
int blue = (int)(blueSum[cIndex]/sum);
System.out.println("red = " + red + " green = " + green + " blue = " + blue);
int clusterColor = (255 << 24) | (red << 16) | (green << 8) | blue;
clusterCenterList.get(i).setPixelColor(clusterColor);
oldClusterCentersColors[i] = clusterColor;
}
return oldClusterCentersColors;
}
/**
*
* @param clusterDisValues
* @return
*/
private double getCloserCluster(double[] clusterDisValues)
{
double min = clusterDisValues[0];
int clusterIndex = 0;
for(int i=0; i<clusterDisValues.length; i++)
{
if(min > clusterDisValues[i])
{
min = clusterDisValues[i];
clusterIndex = i;
}
}
return clusterIndex;
}
/**
*
* @param point
* @param cluster
* @return distance value
*/
private double calculateEuclideanDistance(ClusterPoint p, ClusterCenter c)
{
// int pa = (p.getPixelColor() >> 24) & 0xff;
int pr = (p.getPixelColor() >> 16) & 0xff;
int pg = (p.getPixelColor() >> 8) & 0xff;
int pb = p.getPixelColor() & 0xff;
// int ca = (c.getPixelColor() >> 24) & 0xff;
int cr = (c.getPixelColor() >> 16) & 0xff;
int cg = (c.getPixelColor() >> 8) & 0xff;
int cb = c.getPixelColor() & 0xff;
return Math.sqrt(Math.pow((pr - cr), 2.0) + Math.pow((pg - cg), 2.0) + Math.pow((pb - cb), 2.0));
}
}

图像处理------K-Means算法演示的更多相关文章

KNN 与 K - Means 算法比较
KNN K-Means 1.分类算法聚类算法 2.监督学习非监督学习 3.数据类型:喂给它的数据集是带label的数据,已经是完全正确的数据喂给它的数据集是无label的数据,是杂乱无章的,经过 ...
K－means算法
K-means算法很简单,它属于无监督学习算法中的聚类算法中的一种方法吧,利用欧式距离进行聚合啦. 解决的问题如图所示哈:有一堆没有标签的训练样本,并且它们可以潜在地分为K类,我们怎么把它们划分呢? ...
JS写的排序算法演示
看到网上有老外写的,就拿起自已之前完成的jmgraph画图组件也写了一个.想了解jmgraph的请移步:https://github.com/jiamao/jmgraph 当前演示请查看:http:/ ...
用Python从零开始实现K近邻算法
KNN算法的定义: KNN通过测量不同样本的特征值之间的距离进行分类.它的思路是:如果一个样本在特征空间中的k个最相似(即特征空间中最邻近)的样本中的大多数属于某一个类别,则该样本也属于这个类别.K通 ...
02-18 scikit-learn库之k近邻算法
目录 scikit-learn库之k近邻算法一.KNeighborsClassifier 1.1 使用场景 1.2 代码 1.3 参数详解 1.4 方法 1.4.1 kneighbors([X, n ...
使用K均值算法进行图片压缩
K均值算法上一期介绍了机器学习中的监督式学习,并用了离散回归与神经网络模型算法来解决手写数字的识别问题.今天我们介绍一种机器学习中的非监督式学习算法--K均值算法. 所谓非监督式学习,是一种 ...
AlgorithmMan，一套免费的算法演示神器
概述该文章的最新版本已迁移至个人博客[比特飞],单击链接 https://www.byteflying.com/archives/971 访问. 文章末尾附带GitHub开源下载地址. 0.概述 ...
K近邻算法：机器学习萌新必学算法
摘要:K近邻(k-NearestNeighbor,K-NN)算法是一个有监督的机器学习算法,也被称为K-NN算法,由Cover和Hart于1968年提出,可以用于解决分类问题和回归问题. 1. 为什么 ...
K 均值算法-如何让数据自动分组
公号:码农充电站pro 主页:https://codeshellme.github.io 之前介绍到的一些机器学习算法都是监督学习算法.所谓监督学习,就是既有特征数据,又有目标数据. 而本篇文章要介绍 ...
机器学习算法之K近邻算法
0x00 概述 K近邻算法是机器学习中非常重要的分类算法.可利用K近邻基于不同的特征提取方式来检测异常操作,比如使用K近邻检测Rootkit,使用K近邻检测webshell等. 0x01 原理 ...

随机推荐

mysql安装(CentOS 7.1 (64-bit system) MySQL 5.6.24)
环境:CentOS 7.1 (64-bit system) MySQL 5.6.24yum install libaio //安装依赖的包wget http://dev.mysql.com/get/m ...
if语句中同时判断多个条件的多种方法
总结一下自己经常用到的python中的if语句同时判断多个条件的不同方法,假设有: x, y, z = 0, 1, 0 方法一,多个逻辑运算符一起使用,这也是最常用的写法: if x == 1 or ...
ssh: Could not resolve hostname git.*****-inc.com : Temporary failure in name resolution fatal: The remote end hung up unexpectedly
问题出现的情景:使用git pull拉取开发的代码到测试服务器,报错: ssh: Could not resolve hostname git.****-inc.com : Temporary fai ...
Discuz的安装与使用
Discuz的安装与使用一.Discuz的安装由于本机已经安装好XAMPP集成工具,后续Discuz访问数据库以及服务器等都是基于XAMPP环境.在主机localhost根目录下新建bbs文件夹. ...
使用Nginx实现灰度发布
灰度发布是指在黑与白之间,能够平滑过渡的一种发布方式.AB test就是一种灰度发布方式,让一部分用户继续用A,一部分用户开始用B,如果用户对B没有什么反对意见,那么逐步扩大范围,把所有用户都迁移到B ...
windows FileZilla Server 开启FTP over TLS
FileZilla Server官方下载地址: https://filezilla-project.org/download.php?type=server FileZilla Server 开启FT ...
CENTOS6.6 下mysql MHA架构搭建
本文来自我的github pages博客http://galengao.github.io/ 即www.gaohuirong.cn 摘要: 本篇是自己搭建的一篇mysql MHA文章前面的安装步骤基 ...
[译]Serilog Tutorial
在过去的几年中,结构化日志已经大受欢迎.而Serilog是 .NET 中最著名的结构化日志类库 ,我们提供了这份的精简指南来帮助你快速了解并运用它. 0. 内容设定目标认识Serilog 事件和级 ...
加入GIMPS项目，寻找梅森素数！
截止到目前为止人类共找到了50个梅森素数,其中最后16个梅森素数都是通过GIMPS项目找到的. 为了激励人们寻找梅森素数和促进网格技术发展,总部设在美国旧金山的电子前沿基金会(EFF)于1999年3月 ...
架构师入门：搭建双注册中心的高可用Eureka架构（基于项目实战）
本文的案例是基于架构师入门:搭建基本的Eureka架构(从项目里抽取) 改写的. 在上文里,我们演示Eureka客户端调用服务的整个流程,在这部分里我们将在架构上有所改进.大家可以想象下,在上文里案 ...

图像处理------K-Means算法演示

图像处理------K-Means算法演示的更多相关文章

随机推荐

热门专题