matlab图像处理外文翻译外文文献英文文献基于视觉的矿井救援机器人场景识别.doc

资源描述

《matlab图像处理外文翻译外文文献英文文献基于视觉的矿井救援机器人场景识别.doc》由会员分享，可在线阅读，更多相关《matlab图像处理外文翻译外文文献英文文献基于视觉的矿井救援机器人场景识别.doc（12页珍藏版）》请在三一办公上搜索。

1、附录A 英文原文Scene recognition for mine rescue robot localization based on visionCUI Yi-an(崔益安), CAI Zi-xing(蔡自兴), WANG Lu(王璐)Abstract：A new scene recognition system was presented based on fuzzy logic and hidden Markov model(HMM) that can be applied in mine rescue robot localization during emergencies.

2、The system uses monocular camera to acquire omni-directional images of the mine environment where the robot locates. By adopting center-surround difference method, the salient local image regions are extracted from the images as natural landmarks. These landmarks are organized by using HMM to repres

3、ent the scene where the robot is, and fuzzy logic strategy is used to match the scene and landmark. By this way, the localization problem, which is the scene recognition problem in the system, can be converted into the evaluation problem of HMM. The contributions of these skills make the system have

4、 the ability to deal with changes in scale, 2D rotation and viewpoint. The results of experiments also prove that the system has higher ratio of recognition and localization in both static and dynamic mine environments.Key words: robot location; scene recognition; salient image; matching strategy; f

5、uzzy logic; hidden Markov model1 IntroductionSearch and rescue in disaster area in the domain of robot is a burgeoning and challenging subject1. Mine rescue robot was developed to enter mines during emergencies to locate possible escape routes for those trapped inside and determine whether it is saf

6、e for human to enter or not. Localization is a fundamental problem in this field. Localization methods based on camera can be mainly classified into geometric, topological or hybrid ones2. With its feasibility and effectiveness, scene recognition becomes one of the important technologies of topologi

7、cal localization.Currently most scene recognition methods are based on global image features and have two distinct stages: training offline and matching online.During the training stage, robot collects the images of the environment where it works and processes the images to extract global features t

8、hat represent the scene. Some approaches were used to analyze the data-set of image directly and some primary features were found, such as the PCA method 3. However, the PCA method is not effective in distinguishing the classes of features. Another type of approach uses appearance features including

9、 color, texture and edge density to represent the image. For example, ZHOU et al4 used multidimensional histograms to describe global appearance features. This method is simple but sensitive to scale and illumination changes. In fact, all kinds of global image features are suffered from the change o

10、f environment.LOWE 5 presented a SIFT method that uses similarity invariant descriptors formed by characteristic scale and orientation at interest points to obtain the features. The features are invariant to image scaling, translation, rotation and partially invariant to illumination changes. But SI

11、FT may generate 1 000 or more interest points, which may slow down the processor dramatically.During the matching stage, nearest neighbor strategy(NN) is widely adopted for its facility and intelligibility6. But it cannot capture the contribution of individual feature for scene recognition. In exper

12、iments, the NN is not good enough to express the similarity between two patterns. Furthermore, the selected features can not represent the scene thoroughly according to the state-of-art pattern recognition, which makes recognition not reliable7.So in this work a new recognition system is presented,

13、which is more reliable and effective if it is used in a complex mine environment. In this system, we improve the invariance by extracting salient local image regions as landmarks to replace the whole image to deal with large changes in scale, 2D rotation and viewpoint. And the number of interest poi

14、nts is reduced effectively, which makes the processing easier. Fuzzy recognition strategy is designed to recognize the landmarks in place of NN, which can strengthen the contribution of individual feature for scene recognition. Because of its partial information resuming ability, hidden Markov model

15、 is adopted to organize those landmarks, which can capture the structure or relationship among them. So scene recognition can be transformed to the evaluation problem of HMM, which makes recognition robust.2 Salient local image regions detectionResearches on biological vision system indicate that or

16、ganism (like drosophila) often pays attention to certain special regions in the scene for their behavioral relevance or local image cues while observing surroundings 8. These regions can be taken as natural landmarks to effectively represent and distinguish different environments. Inspired by those,

17、 we use center-surround difference method to detect salient regions in multi-scale image spaces. The opponencies of color and texture are computed to create the saliency map.Follow-up, sub-image centered at the salient position in S is taken as the landmark region. The size of the landmark region ca

18、n be decided adaptively according to the changes of gradient orientation of the local image 11.Mobile robot navigation requires that natural landmarks should be detected stably when environments change to some extent. To validate the repeatability on landmark detection of our approach, we have done

19、some experiments on the cases of scale, 2D rotation and viewpoint changes etc. Fig.1 shows that the door is detected for its saliency when viewpoint changes. More detailed analysis and results about scale and rotation can be found in our previous works12.3 Scene recognition and localizationDifferent

20、 from other scene recognition systems, our system doesnt need training offline. In other words, our scenes are not classified in advance. When robot wanders, scenes captured at intervals of fixed time are used to build the vertex of a topological map, which represents the place where robot locates.

21、Although the maps geometric layout is ignored by the localization system, it is useful for visualization and debugging13 and beneficial to path planning. So localization means searching the best match of current scene on the map. In this paper hidden Markov model is used to organize the extracted la

22、ndmarks from current scene and create the vertex of topological map for its partial information resuming ability.Resembled by panoramic vision system, robot looks around to get omni-images. From Fig.1 Experiment on viewpoint changeseach image, salient local regions are detected and formed to be a se

23、quence, named as landmark sequence whose order is the same as the image sequence. Then a hidden Markov model is created based on the landmark sequence involving k salient local image regions, which is taken as the description of the place where the robot locates. In our system EVI-D70 camera has a v

24、iew field of 170. Considering the overlap effect, we sample environment every 45 to get 8 images. Let the 8 images as hidden state Si (1i8), the created HMM can be illustrated by Fig.2. The parameters of HMM, aij and bjk, are achieved by learning, using Baulm-Welch algorithm14. The threshold of conv

25、ergence is set as 0.001.As for the edge of topological map, we assign it with distance information between two vertices. The distances can be computed according to odometry readings.Fig.2 HMM of environmentTo locate itself on the topological map, robot must run its eye on environment and extract a l

26、andmark sequence L1 Lk , then search the map for the best matched vertex (scene). Different from traditional probabilistic localization15, in our system localization problem can be converted to the evaluation problem of HMM. The vertex with the greatest evaluation value, which must also be greater t

27、han a threshold, is taken as the best matched vertex, which indicates the most possible place where the robot is.4 Match strategy based on fuzzy logicOne of the key issues in image match problem is to choose the most effective features or descriptors to represent the original image. Due to robot mov

28、ement, those extracted landmark regions will change at pixel level. So, the descriptors or features chosen should be invariant to some extent according to the changes of scale, rotation and viewpoint etc. In this paper, we use 4 features commonly adopted in the community that are briefly described a

29、s follows.GO: Gradient orientation. It has been proved that illumination and rotation changes are likely to have less influence on it5.ASM and ENT: Angular second moment and entropy, which are two texture descriptors.H: Hue, which is used to describe the fundamental information of the image.Another

30、key issue in match problem is to choose a good match strategy or algorithm. Usually nearest neighbor strategy (NN) is used to measure the similarity between two patterns. But we have found in the experiments that NN cant adequately exhibit the individual descriptor or features contribution to simila

31、rity measurement. As indicated in Fig.4, the input image Fig.4(a) comes from different view of Fig.4(b). But the distance between Figs.4(a) and (b) computed by Jefferey divergence is larger than Fig.4(c).To solve the problem, we design a new match algorithm based on fuzzy logic for exhibiting the su

32、btle changes of each features. The algorithm is described as below.And the landmark in the database whose fused similarity degree is higher than any others is taken as the best match. The match results of Figs.2(b) and (c) are demonstrated by Fig.3. As indicated, this method can measure the similari

33、ty effectively between two patterns.Fig.3 Similarity computed using fuzzy strategy5 Experiments and analysisThe localization system has been implemented on a mobile robot, which is built by our laboratory. The vision system is composed of a CCD camera and a frame-grabber IVC-4200. The resolution of

34、image is set to be 400320 and the sample frequency is set to be 10 frames/s. The computer system is composed of 1 GHz processor and 512 M memory, which is carried by the robot. Presently the robot works in indoor environments.Because HMM is adopted to represent and recognize the scene, our system ha

35、s the ability to capture the discrimination about distribution of salient local image regions and distinguish similar scenes effectively. Table 1 shows the recognition result of static environments including 5 laneways and a silo. 10 scenes are selected from each environment and HMMs are created for

36、 each scene. Then 20 scenes are collected when the robot enters each environment subsequently to match the 60 HMMs above.In the table, “truth” means that the scene to be localized matches with the right scene (the evaluation value of HMM is 30% greater than the second high evaluation). “Uncertainty”

37、 means that the evaluation value of HMM is greater than the second high evaluation under 10%. “Error match” means that the scene to be localized matches with the wrong scene. In the table, the ratio of error match is 0. But it is possible that the scene to be localized cant match any scenes and new

38、vertexes are created. Furthermore, the “ratio of truth” about silo is lower because salient cues are fewer in this kind of environment.In the period of automatic exploring, similar scenes can be combined. The process can be summarized as: when localization succeeds, the current landmark sequence is

39、added to the accompanying observation sequence of the matched vertex un-repeatedly according to their orientation (including the angle of the image from which the salient local region and the heading of the robot come). The parameters of HMM are learned again.Compared with the approaches using appea

40、rance features of the whole image (Method 2, M2), our system (M1) uses local salient regions to localize and map, which makes it have more tolerance of scale, viewpoint changes caused by robots movement and higher ratio of recognition and fewer amount of vertices on the topological map. So, our syst

41、em has better performance in dynamic environment. These can be seen in Table 2. Laneways 1, 2, 4, 5 are in operation where some miners are working, which puzzle the robot.6 Conclusions1) Salient local image features are extracted to replace the whole image to participate in recognition, which improv

42、e the tolerance of changes in scale, 2D rotation and viewpoint of environment image.2) Fuzzy logic is used to recognize the local image, and emphasize the individual features contribution to recognition, which improves the reliability of landmarks.3) HMM is used to capture the structure or relations

43、hip of those local images, which converts the scene recognition problem into the evaluation problem of HMM.4) The results from the above experiments demonstrate that the mine rescue robot scene recognition system has higher ratio of recognition and localization. Future work will be focused on using

44、HMM to deal with the uncertainty of localization.附录B 中文翻译基于视觉的矿井救援机器人场景识别CUI Yi-an(崔益安), CAI Zi-xing(蔡自兴), WANG Lu(王璐)摘要：基于模糊逻辑和隐马尔可夫模型（HMM），论文提出了一个新的场景识别系统，可应用于紧急情况下矿山救援机器人的定位。该系统使用单眼相机获取机器人所处位置的全方位的矿井环境图像。通过采用中心环绕差分法，从图像中提取突出的位置图像区域作为自然的位置标志。这些标志通过使用HMM有机组织起来代表机器人坐在场景，模糊逻辑算法用来匹配场景和位置标志。通过这种方式，定位问

45、题，即系统的现场识别问题，可以转化为对HMM的评价问题。这些技术贡献使系统具有处理比率变化、二维旋转和视角变化的能力。实验结果还证明，该系统在静态和动态矿山环境中都具有较高的识别和定位的成功率。关键字：机器人定位；场景识别；突出图像；匹配算法；模糊逻辑；隐马尔可夫模型1 介绍在机器人领域搜索和救援灾区是一个新兴而富有挑战性的课题。矿井救援机器人的开发是为了在紧急情况下进入矿井为被困人员查找可能的逃生路线，并确定该线路是否安全。定位识别是这个领域的基本问题。基于摄像头的定位可以主要分为几何法、拓扑法或混合法。凭借其可行性和有效性，场景识别成为拓扑定位的重要技术之一。目前，大多数场景识别方法是基于

46、全局图像特征，有两个不同的阶段：离线培训和在线匹配。在训练阶段，机器人收集其所工作环境的图像，并处理这些图像提取出能表征该场景的全局特征。一些方法直接分析图像数据得到一些基本特征，比如PCA方法。但是，PCA方法是不能区分特征的类别。另一种方法使用外观特征包括颜色、纹理和边缘密度来表示图像。例如，周等人用多维直方图来描述全局外观特征。此方法简单，但对比率和光照变化敏感。事实上，各种全局图像特征，所受来自环境变化的影响。LOWE提出了SIFT方法，该方法利用关注点尺度和方向所形成的描述的相似性获得特征。这些特征对于图像缩放、平移、旋转和局部光照不变是稳定的。但SIFT可能产生1 000个或更多的

47、兴趣点，这可能使处理器大大减慢。在匹配阶段，近邻算法（NN）因其简单和可行而被广泛采用。但是它并不能捕捉到个别特征对场景识别的贡献。在实验中，NN在表达两种部分之间的相似性时效果并不足够好。此外，所选的特征并不能彻底地按照国家模式识别标准表示场景，这使得识别结果不可靠。因此，在这些分析中提出了一种新的识别系统，如果使用在复杂的矿井环境中它将更加可靠和有效。在这个系统中，我们通过提取突出的图像局部区域作为位置标志用以替代整个图像，改善了信息的稳定性，从而处理比率、二维旋转和视角的变化。兴趣点数量有效减少，这使得处理更加容易。模糊识别算法用以识别邻近位置的位置标志，它可以增强个别特征对场景识别的作

48、用。由于它的部分信息恢复能力，采用隐马尔可夫模型组织这些位置标志，它可以捕捉到的结构或标志之间的关系。因此，场景识别可以转化为对HMM评价问题，这使得识别具有鲁棒性。2 局部图像区域不变形的检测生物视觉系统的研究表明，生物体（像果蝇）在观察周围环境时，经常因为他们的行为习惯注意场景中确定的特殊区域或者局部图像信息。这些区域可以当作天然的位置标志有效地表示和区别不同环境。受这些启示，我们利用中心环绕差分法检测多尺度图像空间突出的区域。计算颜色和纹理的相似度用以绘制突出区域的地图。随后，以地图突出位置为中心的分图像，被定义为位置标志区域。位置标志区域的大小可以根据该区域图像梯度方向的变化自适应决定

49、。移动机器人的导航要求当环境有一定程度变化时自然位置标志能被稳定地检测出来。为了验证我们方法对位置标志检测的的可重复性，我们已经在图像比例、二维旋转和视角等变化时，做了一些实验。图1表明当视角变化时因为它的突出效果大门能被检测出来。关于比率和旋转更详细的分析和结果可以在我们以前的论文中发现。图1 关于视角变化的实验3 场景识别和定位与其他场景识别系统不同的是，我们的不需要离线培训。换句话说，在前进中，我们不必对场景分类。当机器人徘徊时，在固定时间间隔内捕获的场景用于生成拓扑地图的顶点，它表示了机器人所在位置。虽然地图的几何布局被定位系统忽视，但它对可视化调试是有用的，并对路径规划很有益处。因此，定位即意味在地图上搜索当前场景的最佳匹配位置。在论文中隐马尔

展开阅读全文

matlab图像处理 外文翻译 外文文献 英文文献 基于视觉的矿井救援机器人场景识别.doc

matlab图像处理外文翻译外文文献英文文献基于视觉的矿井救援机器人场景识别.doc