Q-REG

Jin, S., Barath, D., Pollefeys, M., & Armeni, I. (2023). Q-REG: End-to-End Trainable Point Cloud Registration with Surface Curvature.

paper: 2309.16023v1] Q-REG: End-to-End Trainable Point Cloud Registration with Surface Curvature (arxiv.org)
code: waiting

Questions Raised

RANSAC-like estimation methods cope with the combinatorics of the problem via selecting random subsets of m correspondences( e.g., m=3 for rigid pose estimation). this allows to progressively explore the \((\frac{n}{m})\) possible combinations, where n is the total number of matches.

简单来说就是RANSAC style不可微，不能end-to-end；而其他learning-based方法为了实现端到端就将hard correspondence换成了基于socre的soft correspondence(hard就是True or False，soft就是有权重，或者说点对匹配程度)，又会使得计算开销太大，并且引入大量噪声。

作者就想实现hard correspondence的端到端，怎么办，采用single correspondence来预测变换就可以了，这样就没有random subsets，而是迭代遍历correspondence set，取最好预测结果。

Contribution

设计了Q-REG，一种结合single correspondence的local surface patches(fitting quadrics)，来估计位姿的点云配准方法，意图替代RANSAC。从介绍上，Q-REG与correspondence matching method 无关(it is agnostic to the correspondence matching method)，并且能够快速做outlier rejection by filtering degenerate solutions and assumption inconsistent motions (rigid poses inconsistent with motion priors (e.g., to avoid unrealistically large scaling).)
将Q-REG设计成可微(differentiable)方案，用于无论是在correspondence matching method 还是 pose estimation method的端到端训练
刷SOTA哩

Description

employing higher-order geometric information , Q-REG achieving exhaustive search to replace RANSAC and improve the performance and run-time

First Step: Correspondence Matching

使用任意Correspondence Matcher（e.g patch-based: PPFNet, PPF-FoldNet; full-conv: FCGF）得到feature-matching based putative correspondences \(\{P, Q\}\in C\) , 用于之后的Q-REG方法预估变换矩阵。

Q-REG是single-correspondence方法，因此区别于RANSAC每次随机挑选三对corresponding point \(\{p, q\}\) 预测变换矩阵，Q-REG每次只取单对corresponding point，用于estimate transform between \(P\) and \(Q\) 。

Second Step: Q-REG

Q-REG直接当作工具用的步骤为：

从correspondence set \(C\) 中迭代取出single correspondence \(\{p,\ q \}\) ;
对以每个single corrspondence为输入预测变换矩阵
选择best transformation model 作为初步结果, the pose quality metric is calculated as the cardinality of its support i.e., the number of inliers.
之后根据论文[^ 1] 的方法进行local optimization.( a local re-sampling and re-fitting of inlier correspondences based on their normals (coming from the fitted quadrics) and positions. )

如果嵌入端到端训练则只进行到第二步时根据预测结果构建Loss: \(L_{pose}\) 。

后文对single correspondence为输入预测变换矩阵的过程进行详述，以及介绍 \(L_{pose}\) 的构成

1. Quadric Fitting based local patch

对于single correspondence \(\{p, q\}\in C\) ，可以为点划分local patch(Q-REG通过K=50的KNN来划分)，预测一对local patch，并计算两个loca patch彼此的LRF(local reference frame) \(R_p, R_q \in SO(3)\) （即作为将点从世界坐标系转换到局部参考系的旋转矩阵）。假如预测正确，我们就可以做两片点云的对齐( \(R=R_qR_p^T\) )。因此Q-REG应用二次曲面拟合来预估 \(R_p,\ R_q\) 。

至于translation vector \(t\) ，论文直接以 q, p作为两片点云重叠区域的质心， \(t=q-p\) 。

论文中应用如下约束拟合3D quadric surface：

\[\hat{p}^TQp=0
\]

\(\hat{p}\) ：3D homogeneous point(3D齐次点) lying on the surface
Q is the quadirc parameters in matrix as:

\[Q = \begin{pmatrix}A&D&E&G\\D&B&F&H\\E&F&C&I\\G&H&I&J\end{pmatrix}
\]

理论上最佳的是local patch的所有点都能落在曲面上，但是当然不可能，所以需要拟合。

之后，作者重写了上述公式便于应用：

\(|\mathcal{N}|\) is the number of neighbors to which the quadric is fitted(paper sets to 50). 换句话说，二次曲面拟合不用single corrspondence 中的p，q点，也就是keypoints，而是使用local patch中的其他点，也就是neighbor points.
\(d_i\) 是第i个neighbor point离原点(the origin)的平方距离(squared dist)。（所以这里实现时是不是需要先对local patch以keypoint求相对距离进行标准化）。

使用上述linear equation获得 \(Q\) 中的系数。

然后对求得二次曲面系数矩阵 \(Q\) 应用平移，使得keypoint能落在曲面上，也就是调整系数 \(J\) 使得对于keypoint，公式 \(p^TQp =0\) 成立。

最终取二次曲面系数矩阵 \(Q\) 的部分，得到如下矩阵 \(P\) ，并使用对矩阵 \(P\) 使用 Eigen-decomposition ，得到特征向量矩阵 \(V\) 作为求得LRF \(R_p或R_q\) 。

注意：为了保留尺度(scaling) 信息，这里不对特征向量进行单位化。

2. Estimate rigid Transformation

the rotation \(R=R_pPR_q^T \in SO(3)\) ，其中 \(P\) 表示一个unknown permutation matrix，用于控制p的LRF与q的LRF之间的各轴对应关系，这种对应关系分三种情况考虑：

当LRF三轴的模（长度）各不相同时，也就是x-y-z三方向尺度信息都不一致。只需要按照三轴的长度从大到小排列对应即可 。这种方式基于这样的假设：该过程建立在点云中没有或有但是可忽略的各向异性缩放的假设之上，因此相对应轴长度保持不变。这种方式可以实现scale-invariant，并且通过不可实现的缩放过滤不可靠匹配。因此，rigid transformation可以通过single correspondence解决。
当LRF三轴的模（长度）其中两个相同，与另一个不相同时，也就是x-y-z三方向有两个方向尺度信息一致，那么直观上就可以理解：两个方向尺度信息一致，使得一对一匹配LRF三轴时，有两对轴无法明确匹配。因此，需要最起码two correspondences来互相印证，保证 \(P\) 矩阵预测正确。
当LRF三轴的模（长度）都相同，也就是x-y-z三方向尺度都一致，此时local patch以keypoint为原点接近一个sphere surface。同理，需要最起码three correspondences。

所以为了实现estimate rigid transformation from a single correspondence，只保留 \(C\) LRF三轴的模（长度）各不相同的corrspondences，各轴长度差都大于 \(10^{-3}\) 。之后就可以用 \(R=R_pPR_q^T\) 公式计算刚性旋转矩阵。

3. End-to-End Training Loss

\[\epsilon (T_{p,q}) = \sqrt{\frac{1}{|C|}\sum_{(p_i,q_i) \in C}{||T_{p,q}p_i-q_i}||_2^2}
\]

\[L_{pose} = \sum_{(p,q)\in C}{(1-\frac{min(\epsilon(T_{p,q}), \gamma)}{\gamma} -s)}
\]

\(\gamma\) is a threshold and \(s\) is the score of the point correspondence predicted by the matching network

上述所提到的 \(L_{pose}\) 可以与其他广泛使用的registration loss functions 相结合实现从特征匹配到配准的端到端训练。

Experiments

dataset：3DMatch、3DLoMatch；KITTI；ModelNet、ModelLoNet
corresponding matcher：Predator、RegTR、GeoTr
metrics：RR(registration recall)、RRE(registration rotation Error)、RTE(Registration Translation Error)、

没说的，在matcher一致的情况下全SOTA，并且还比其他estimator(ICP、PointDesc……)好.消融实验也证明了Q-REG所有component都有效提升了一定的指标额度：quadric-fitting single-corresponding solver、local optimation、used in end-to-end training。

Run-time

[^ 1]:Karel Lebeda, Jirı Matas, and Ondrej Chum. Fixing the locally optimized ransac-full experimental evaluation. In British machine vision conference. Citeseer, 2012. 5

Q-REG论文阅读的更多相关文章

Deep Reinforcement Learning for Dialogue Generation 论文阅读
本文来自李纪为博士的论文 Deep Reinforcement Learning for Dialogue Generation. 1,概述当前在闲聊机器人中的主要技术框架都是seq2seq模型.但 ...
论文阅读笔记：《Interconnected Question Generation with Coreference Alignment and Conversion Flow Modeling》
论文阅读:<Interconnected Question Generation with Coreference Alignment and Conversion Flow Modeling& ...
[论文阅读笔记] node2vec Scalable Feature Learning for Networks
[论文阅读笔记] node2vec:Scalable Feature Learning for Networks 本文结构解决问题主要贡献算法原理参考文献 (1) 解决问题由于DeepWal ...
[论文阅读笔记] Are Meta-Paths Necessary, Revisiting Heterogeneous Graph Embeddings
[论文阅读笔记] Are Meta-Paths Necessary? Revisiting Heterogeneous Graph Embeddings 本文结构解决问题主要贡献算法原理参考文 ...
[论文阅读笔记] Unsupervised Attributed Network Embedding via Cross Fusion
[论文阅读笔记] Unsupervised Attributed Network Embedding via Cross Fusion 本文结构解决问题主要贡献算法原理实验结果参考文献 (1 ...
【论文阅读】Motion Planning through policy search
想着CSDN还是不适合做论文类的笔记,那里就当做技术/系统笔记区,博客园就专心搞看论文的笔记和一些想法好了,[]以后中框号中间的都算作是自己的内心OS 有时候可能是问题,有时候可能是自问自答,毕竟是笔 ...
Learning under Concept Drift: A Review 概念漂移综述论文阅读
首先这是2018年一篇关于概念漂移综述的论文[1]. 最新的研究内容包括 (1)在非结构化和噪声数据集中怎么准确的检测概念漂移.how to accurately detect concept dri ...
论文阅读（Xiang Bai——【PAMI2017】An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition）
白翔的CRNN论文阅读 1. 论文题目 Xiang Bai--[PAMI2017]An End-to-End Trainable Neural Network for Image-based Seq ...
BITED数学建模七日谈之三：怎样进行论文阅读
前两天,我和大家谈了如何阅读教材和备战数模比赛应该积累的内容,本文进入到数学建模七日谈第三天:怎样进行论文阅读. 大家也许看过大量的数学模型的书籍,学过很多相关的课程,但是若没有真刀真枪地看过论文,进 ...
论文阅读笔记 - YARN : Architecture of Next Generation Apache Hadoop MapReduceFramework
作者:刘旭晖 Raymond 转载请注明出处 Email:colorant at 163.com BLOG:http://blog.csdn.net/colorant/ 更多论文阅读笔记 http:/ ...

随机推荐

在Transformers 中使用约束波束搜索引导文本生成
引言本文假设读者已经熟悉文本生成领域波束搜索相关的背景知识,具体可参见博文如何生成文本: 通过 Transformers 用不同的解码方法生成文本. 与普通的波束搜索不同,约束波束搜索允许我们控 ...
kafka学习之三_信创CPU下单节点kafka性能测试验证
kafka学习之三_信创CPU下单节点kafka性能测试验证背景前面学习了 3controller+5broker 的集群部署模式. 晚上想着能够验证一下国产机器的性能. 但是国产机器上面的设备有 ...
适合Windows桌面、Material Design设计风格、WPF美观控件库【强烈推荐】
推荐一个在Github已start超过13.6K,非常流行.美观的WPF控件库. 项目简介这是一个适用于Windows桌面,全面且易于使用的控件库,遵循Google推测的Material Desig ...
前端vue基于原生check增强单选多选插件
前端vue基于原生check增强单选多选插件, 下载完整代码请访问uni-app插件市场地址:https://ext.dcloud.net.cn/plugin?id=12979 效果图如下: ...
mysql 查询时间段的数据怎么写？
测试让我查询2个时间段的数据,这里不怎么会,所以记录一下: 一般可以使用如下语法: select * from xxx表 where xx='xx' and time between '时间戳' ...
Java 递归的小练习，累加、累乘、斐波那契兔子、文件递归
递归的小练习, public static void main(String[] args) { System.out.println(sum(10)); System.out.println(mul ...
万字长文解析最常见的数据库恢复算法: ARIES
万字长文解析最常见的数据库恢复算法: ARIES 首发地址: https://mp.weixin.qq.com/s/Kc13g8OHK1h_f7eMlnl4Aw Introduction 上图中为基于 ...
jar包、war包项目部署
部署部署 jar包部署 war包部署 jar包环境准备 JDK Tomcat Linux 环境 1.将jar文件上传至服务器 2.编写脚本启动脚本放在跟jar 一起的路径下,如果不放在同一路 ...
Windows商店开发者注册失败
前言最近写了个小工具想上架Windows应用商店,但是在填写信息那一页总是失败,提示Error code 2201. Correlation ID 9d436e3a-94df-498a-b224-8 ...
行行AI人才直播第12期：风平智能创始人林洪祥《AI数字人的技术实践和商业探讨》
行行AI人才是博客园和顺顺智慧共同运营的AI行业人才全生命周期服务平台. 歌手孙燕姿凭借AI翻唱席卷各大视频平台.有视频博主用AI技术复活已故的奶奶,并且与之对话缅怀亲人填补遗憾.更有国外网红通过GP ...

Q-REG论文阅读