基于字典学习的多视图低维相关表示方法研究

摘要：近年来，基于图像集的识别分类引起了极大的关注，由于两个主要原因：基于单幅图像的表示方法对图像质量要求较高且效果不稳定；数字时代可以很容易获得一个物体的多幅图像。基于图像集识别分类的关键在于如何给图像集建模且如何衡量两个图像集间的相似性。在本文中，我们先用典型相关分析（CCA）对所有图像集进行特征提取，以减少计算复杂度；然后将训练集用费希尔鉴别字典学习来训练，每类图像集训练一个子字典，令系数矩阵稀疏，从而使字典更加具有鉴别力且稳定；再将测试集建模成一个正则化包，所有训练集（即所有子字典）协同表示一个正则化包，以此保持训练集间的关系；最后用距离来衡量图像集间的相似性，并进行分类。用Honda/UCSD和CMU Mobo图像集来测试，且获得不错的结果。

关键词图像集分类典型相关分析字典学习协同表示

毕业设计说明书外文摘要

Title Research on Multi-view Low-dimensional Correlation Representation Based on Dictionary Learning

Abstract：Recently, the classification based on image set has attracted great attention, due to two main reasons: the representation methods based on single image are unstable and require higher quality of image sets and the digital age can easily obtain multiple images of one object. The key issue of classification is how to model the image sets and how to measure the resemblance between the two image sets. In this paper, we first use canonical correlation analysis (CCA) to extract features of all sets to reduce the computational complexity; then we use Fisher discriminant dictionary learning to train the gallery sets, and a sub-dictionary represent image sets of one subject. We make the coefficient matrix sparse so that the dictionary is more discriminative and stable. By modeling the query set as a regularized hull, we represent this hull collaboratively over all the gallery sets to keep relationship between gallery sets. Finally, distance between image sets is utilized to measure the similarity and classify. Experiments were performed on the Honda and Mobo datasets and achieved great results.

Keywords Image set classification, canonical correlation analysis, dictionary learning, collaborative representation

1 引言（或绪论）2

1.1 工程背景及意义2

1.2 相关技术的现状2

1.3 总体技术方案及其社会影响3

1.4 技术方案的经济因素分析3

1.5 论文章节安排3

2 特征提取4

2.1 典型相关分析4

2.2 特征融合6

3 费希尔有鉴别字典学习7

3.1 字典学习7

3.2 费希尔辨别保真项8

3.3 费希尔辨别系数项9

3.4 费希尔辨别字典学习模型及分类9

4 基于图像集的协同表示11

4.1 图像集分类11

4.2 基于集-集距离的包11

4.3 基于集-集距离的协同表示12

5 基于费希尔字典学习的集-集距离协同表示模型及分类14

6 实验分析19

6.1 参数设置19

6.2 Honda/UCSD数据集19

6.2 CMU Mobo数据集21

结论24

致谢25

参考文献26

附录28

1 引言（或绪论）

如今，多视图学习已经成为机器学习中不可或缺的重要内容和研究方向。这是由于基于单幅图像的分类识别方法有两大缺点：一是对分类的图像质量要求高，二是不稳定。相比较而言，基于图像集的表示方法纳入了远超单幅图像的有用信息，不仅能够更精细准确的表示原对象的特征，而且能有效的减少一些不利因素及异常点的影响，这就使得基于图像集的学习算法更加稳定和鲁棒。然而，基于单幅图像的子空间学习方法并不都适用于图像集的情形，因此，非常有必要研究图像集的低维表示方法，充分利用图像集中包含的丰富信息，构建其整体判别的有效算法，对于丰富图像集的多视图理论，提高相关学习算法的实际性能，具有重要的理论和应用价值。基于字典学习的多视图低维相关表示方法研究:http://www.chuibin.com/jisuanji/lunwen_205646.html