首页 > 最新文献

Object recognition supported by user interaction for service robots最新文献

英文 中文
Active contours using a potential field 使用势场的活动轮廓
Pub Date : 2002-12-10 DOI: 10.1109/ICPR.2002.1048413
D. M. Honea, W. Snyder, G. Bilbro
In most implementations of active contours (snakes), the evolution of the snake depends only on image characteristics in the immediate neighborhood of the current snake points. This is true even when there is little edge data available in the current neighborhood, and even when the boundary of interest may be some distance away in the image. This paper proposes a vector potential field at each point in the image that is derived from the "pull" exerted by all edge points in the image; the pull for a given edge is inversely proportional to the square of the distance from the pixel it pulls. This potential field acts as a force, and snake points are moved based on the force at their current location, rather than moving to minimize energy at a candidate position. The resulting algorithm allows edges to influence snake evolution earlier and from a greater distance, and results in faster and better convergence to the final boundary under a variety of image characteristics.
在大多数活动轮廓(蛇)的实现中,蛇的进化仅取决于当前蛇点附近的图像特征。即使当前邻域中可用的边缘数据很少,即使感兴趣的边界在图像中可能有一段距离,这也是正确的。本文提出了图像中每个点的矢量势场,该势场是由图像中所有边缘点施加的“拉力”导出的;给定边缘的拉力与它所拉力的像素距离的平方成反比。这个势场起到了一个力的作用,蛇点的移动是基于它们当前位置的力,而不是在候选位置移动以最小化能量。所得到的算法允许边缘在更早和更大的距离上影响蛇的进化,并在各种图像特征下更快更好地收敛到最终边界。
{"title":"Active contours using a potential field","authors":"D. M. Honea, W. Snyder, G. Bilbro","doi":"10.1109/ICPR.2002.1048413","DOIUrl":"https://doi.org/10.1109/ICPR.2002.1048413","url":null,"abstract":"In most implementations of active contours (snakes), the evolution of the snake depends only on image characteristics in the immediate neighborhood of the current snake points. This is true even when there is little edge data available in the current neighborhood, and even when the boundary of interest may be some distance away in the image. This paper proposes a vector potential field at each point in the image that is derived from the \"pull\" exerted by all edge points in the image; the pull for a given edge is inversely proportional to the square of the distance from the pixel it pulls. This potential field acts as a force, and snake points are moved based on the force at their current location, rather than moving to minimize energy at a candidate position. The resulting algorithm allows edges to influence snake evolution earlier and from a greater distance, and results in faster and better convergence to the final boundary under a variety of image characteristics.","PeriodicalId":159502,"journal":{"name":"Object recognition supported by user interaction for service robots","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130025586","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Topologies on the planar orthogonal grid 平面正交网格上的拓扑结构
Pub Date : 2002-12-10 DOI: 10.1109/ICPR.2002.1048312
R. Klette
Reviews two different topological spaces for the orthogonal planar grid, the Alexandroff-Hopf and the Wyse topology. We show isomorphy and homeomorphy between different spaces which are used or applicable in image analysis.
回顾正交平面网格的两种不同拓扑空间,Alexandroff-Hopf和Wyse拓扑。给出了不同空间间的同构和同胚在图像分析中的应用。
{"title":"Topologies on the planar orthogonal grid","authors":"R. Klette","doi":"10.1109/ICPR.2002.1048312","DOIUrl":"https://doi.org/10.1109/ICPR.2002.1048312","url":null,"abstract":"Reviews two different topological spaces for the orthogonal planar grid, the Alexandroff-Hopf and the Wyse topology. We show isomorphy and homeomorphy between different spaces which are used or applicable in image analysis.","PeriodicalId":159502,"journal":{"name":"Object recognition supported by user interaction for service robots","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134072581","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Hierarchical interpretation of human activities using competitive learning 利用竞争性学习对人类活动进行分层解释
Pub Date : 2002-12-10 DOI: 10.1109/ICPR.2002.1048308
H. Wechsler, Zoran Duric, Fayin Li
In this paper we describe a method of learning hierarchical representations for describing and recognizing gestures expressed as one and two arm movements using competitive learning methods. At the low end of the hierarchy, the atomic motions ("letters") corresponding to flowfields computed from successive color image frames are derived using Learning Vector Quantization (LVQ). At the next intermediate level, the atomic motions are clustered into actions ("words") using homogeneity criteria. The highest level combines actions into activities ("sentences") using proximity driven clustering. We demonstrate the feasibility and the robustness of our approach on real color-image sequences, each consisting of several hundred frames corresponding to dynamic one and two arm movements.
在本文中,我们描述了一种学习分层表示的方法,用于描述和识别使用竞争学习方法表示为一个和两个手臂运动的手势。在层次结构的低端,原子运动(“字母”)对应于从连续彩色图像帧计算的流场,使用学习向量量化(LVQ)导出。在下一个中间级别,原子运动使用同质性标准聚类成动作(“词”)。最高级别的使用接近驱动聚类将动作组合成活动(“句子”)。我们证明了我们的方法在真实彩色图像序列上的可行性和鲁棒性,每个彩色图像序列由几百帧组成,对应于动态的单臂和双臂运动。
{"title":"Hierarchical interpretation of human activities using competitive learning","authors":"H. Wechsler, Zoran Duric, Fayin Li","doi":"10.1109/ICPR.2002.1048308","DOIUrl":"https://doi.org/10.1109/ICPR.2002.1048308","url":null,"abstract":"In this paper we describe a method of learning hierarchical representations for describing and recognizing gestures expressed as one and two arm movements using competitive learning methods. At the low end of the hierarchy, the atomic motions (\"letters\") corresponding to flowfields computed from successive color image frames are derived using Learning Vector Quantization (LVQ). At the next intermediate level, the atomic motions are clustered into actions (\"words\") using homogeneity criteria. The highest level combines actions into activities (\"sentences\") using proximity driven clustering. We demonstrate the feasibility and the robustness of our approach on real color-image sequences, each consisting of several hundred frames corresponding to dynamic one and two arm movements.","PeriodicalId":159502,"journal":{"name":"Object recognition supported by user interaction for service robots","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133944070","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Towards log-polar fixation for mobile robots - analysis of corner tracking on the log-polar camera 移动机器人的对数极固定研究——对数极相机的角点跟踪分析
Pub Date : 2002-12-10 DOI: 10.1109/ICPR.2002.1048299
A. Yeung, N. Barnes
Fixating on objects is fundamental to active vision tasks, such as reaching, navigation and docking. Most techniques generally have been designed for space-invariant cameras. This research proposes a new method for corner tracking to facilitate point fixation for a mobile robot using a foveated camera. When the target point is in the centre of the image, the fovea and its position can be accurately tracked at high resolution. At the same time, the periphery has a reduced pixel count thus reducing the image processing computation compared to a uniform camera with the same field of view. If the target point suddenly moves into the periphery, it still appears in the lower resolution part of the image and coarser control can bring it back into fovea. Our experiment results demonstrate the stability of the proposed method and the performance of our implementation is adequate for real-time tracking applications.
注视物体是主动视觉任务的基础,比如到达、导航和对接。大多数技术通常是为空间不变相机设计的。本研究提出了一种新的角点跟踪方法,以方便移动机器人使用注视点相机进行点固定。当目标点位于图像中心时,可以以高分辨率精确跟踪中央凹及其位置。同时,与具有相同视场的均匀相机相比,外围具有减少的像素计数从而减少了图像处理计算。如果目标点突然移动到外围,它仍然出现在图像的低分辨率部分,粗糙的控制可以使它回到中央凹。实验结果表明,所提方法的稳定性和实现性能足以满足实时跟踪应用。
{"title":"Towards log-polar fixation for mobile robots - analysis of corner tracking on the log-polar camera","authors":"A. Yeung, N. Barnes","doi":"10.1109/ICPR.2002.1048299","DOIUrl":"https://doi.org/10.1109/ICPR.2002.1048299","url":null,"abstract":"Fixating on objects is fundamental to active vision tasks, such as reaching, navigation and docking. Most techniques generally have been designed for space-invariant cameras. This research proposes a new method for corner tracking to facilitate point fixation for a mobile robot using a foveated camera. When the target point is in the centre of the image, the fovea and its position can be accurately tracked at high resolution. At the same time, the periphery has a reduced pixel count thus reducing the image processing computation compared to a uniform camera with the same field of view. If the target point suddenly moves into the periphery, it still appears in the lower resolution part of the image and coarser control can bring it back into fovea. Our experiment results demonstrate the stability of the proposed method and the performance of our implementation is adequate for real-time tracking applications.","PeriodicalId":159502,"journal":{"name":"Object recognition supported by user interaction for service robots","volume":"257 ","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134092569","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
A POCS-based method for reducing artifacts in BDCT compressed images 基于pocs的BDCT压缩图像伪影减少方法
Pub Date : 2002-12-10 DOI: 10.1109/ICPR.2002.1048286
J. Zou, Hong Yan
The theory of projection onto convex sets (POCS) is applied to reduce blocking artifacts in compressed images coded by the block discrete cosine transform (BDCT). An image before compression is simulated by a triangular mesh, which is the basis of a proposed smoothness constraint set. The mesh is constructed by dividing each block into a set of triangles. The proposed method outperforms four existing methods subjectively and objectively.
应用凸集投影理论(POCS)对块离散余弦变换(BDCT)编码的压缩图像进行块伪影的去除。用三角形网格模拟压缩前的图像,这是提出的平滑约束集的基础。网格是通过将每个块划分为一组三角形来构建的。该方法在主观上和客观上都优于现有的四种方法。
{"title":"A POCS-based method for reducing artifacts in BDCT compressed images","authors":"J. Zou, Hong Yan","doi":"10.1109/ICPR.2002.1048286","DOIUrl":"https://doi.org/10.1109/ICPR.2002.1048286","url":null,"abstract":"The theory of projection onto convex sets (POCS) is applied to reduce blocking artifacts in compressed images coded by the block discrete cosine transform (BDCT). An image before compression is simulated by a triangular mesh, which is the basis of a proposed smoothness constraint set. The mesh is constructed by dividing each block into a set of triangles. The proposed method outperforms four existing methods subjectively and objectively.","PeriodicalId":159502,"journal":{"name":"Object recognition supported by user interaction for service robots","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128949865","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Feasibility of Hough-transform-based iris localisation for real-time-application 基于hough变换的虹膜定位实时应用的可行性
Pub Date : 2002-12-10 DOI: 10.1109/ICPR.2002.1048486
Klaus D. Tönnies, F. Behrens, Melanie Aurnhammer
We present a fast method for locating iris features in frontal face images based on the Hough transform. it consists of an initial iris detection step and a tracking step which uses iris features from initialisation for speeding lip computation. The purpose of research was to evaluate the feasibility of the method for tracking at 200 frames per second or higher. Processing speed of the prototypical implementation on a 266 Mhz Pentium II PC is approximately 6 seconds for initial iris detection and about 0.05 seconds for each tracking step. Further speed-up using faster equipment seems feasible. The algorithm was applied to images of subjects taken under normal room lighting conditions. Tests showed robustness with respect to shadowing and partial occlusion of the iris. The localisation error was below two pixels. Accuracy for tracking was within one pixel. A reduction of the number of pixels, which are processed in the tracking step by 90% showed a modest degradation of the results.
提出了一种基于霍夫变换的人脸正面图像虹膜特征快速定位方法。它由初始虹膜检测步骤和跟踪步骤组成,跟踪步骤利用初始化虹膜特征来加速唇部计算。研究的目的是评估该方法在200帧/秒或更高速度下跟踪的可行性。在266 Mhz的Pentium II PC上,原型实现的初始虹膜检测处理速度约为6秒,每个跟踪步骤约为0.05秒。使用更快的设备进一步加速似乎是可行的。该算法应用于在正常室内照明条件下拍摄的受试者图像。测试显示了对虹膜阴影和部分遮挡的稳健性。定位错误低于两个像素。跟踪精度在一个像素以内。在跟踪步骤中处理的像素数量减少90%,显示出结果的适度退化。
{"title":"Feasibility of Hough-transform-based iris localisation for real-time-application","authors":"Klaus D. Tönnies, F. Behrens, Melanie Aurnhammer","doi":"10.1109/ICPR.2002.1048486","DOIUrl":"https://doi.org/10.1109/ICPR.2002.1048486","url":null,"abstract":"We present a fast method for locating iris features in frontal face images based on the Hough transform. it consists of an initial iris detection step and a tracking step which uses iris features from initialisation for speeding lip computation. The purpose of research was to evaluate the feasibility of the method for tracking at 200 frames per second or higher. Processing speed of the prototypical implementation on a 266 Mhz Pentium II PC is approximately 6 seconds for initial iris detection and about 0.05 seconds for each tracking step. Further speed-up using faster equipment seems feasible. The algorithm was applied to images of subjects taken under normal room lighting conditions. Tests showed robustness with respect to shadowing and partial occlusion of the iris. The localisation error was below two pixels. Accuracy for tracking was within one pixel. A reduction of the number of pixels, which are processed in the tracking step by 90% showed a modest degradation of the results.","PeriodicalId":159502,"journal":{"name":"Object recognition supported by user interaction for service robots","volume":"65 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130996952","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 43
Membership authentication in dynamic face groups 动态人脸组的成员身份验证
Pub Date : 2002-12-10 DOI: 10.1109/ICPR.2002.1048317
Shaoning Pang, Daijin Kim, S. Bang
Presents a method to authenticate an individual's membership in a group without revealing the individual's identity and without restricting how the member of the group may be changed. It has the ability to authenticate membership and is robust to cope with the variations of both the group size and the group member of membership.
提供一种方法,在不泄露个人身份的情况下验证个人在组中的成员身份,并且不限制如何更改组中的成员。它具有身份验证成员的能力,并且非常健壮,可以处理组大小和成员的组成员的变化。
{"title":"Membership authentication in dynamic face groups","authors":"Shaoning Pang, Daijin Kim, S. Bang","doi":"10.1109/ICPR.2002.1048317","DOIUrl":"https://doi.org/10.1109/ICPR.2002.1048317","url":null,"abstract":"Presents a method to authenticate an individual's membership in a group without revealing the individual's identity and without restricting how the member of the group may be changed. It has the ability to authenticate membership and is robust to cope with the variations of both the group size and the group member of membership.","PeriodicalId":159502,"journal":{"name":"Object recognition supported by user interaction for service robots","volume":"135 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133687756","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Visual contour tracking based on sequential importance sampling/resampling algorithm 基于顺序重要性采样/重采样算法的视觉轮廓跟踪
Pub Date : 2002-12-10 DOI: 10.1109/ICPR.2002.1048366
P. Li, Tianwen Zhang
The condensation algorithm can deal with non-Gaussian, nonlinear visual contour tracking in a unified way. Despite its simple implementation and generality, it has two main limitations. The first limitation is that in sampling stage the algorithm does not take advantage of the new measurements. As a result of the inefficient sampling strategy, the algorithm needs a large number of samples to represent the posterior distribution of state. The next is in the selection step, resampling may introduce the problem of sample impoverishment. To address these two problems, we present an improved visual tracker based on an importance sampling/resampling algorithm. Gaussian density of each sample is adopted as the sub-optimal importance proposal distribution, which can steer the samples towards the high likelihood by considering the latest observations. We also adopt a criterion of effective sample size to determine whether the resampling is necessary or not. Experiments with real image sequences show that the performance of new algorithm improves considerably for tracking in visual clutter.
该算法可以统一处理非高斯、非线性的视觉轮廓跟踪问题。尽管它的实现简单且具有通用性,但它有两个主要限制。第一个限制是,在采样阶段,算法没有利用新的测量。由于采样策略的低效,该算法需要大量的样本来表示状态的后验分布。其次,在选择步骤中,重采样可能会引入样本贫化问题。为了解决这两个问题,我们提出了一种改进的基于重要性采样/重采样算法的视觉跟踪器。采用每个样本的高斯密度作为次优重要建议分布,通过考虑最新的观测值,使样本向高似然方向倾斜。我们还采用有效样本量标准来确定是否需要重新采样。对真实图像序列的实验表明,该算法在视觉杂波情况下的跟踪性能有较大提高。
{"title":"Visual contour tracking based on sequential importance sampling/resampling algorithm","authors":"P. Li, Tianwen Zhang","doi":"10.1109/ICPR.2002.1048366","DOIUrl":"https://doi.org/10.1109/ICPR.2002.1048366","url":null,"abstract":"The condensation algorithm can deal with non-Gaussian, nonlinear visual contour tracking in a unified way. Despite its simple implementation and generality, it has two main limitations. The first limitation is that in sampling stage the algorithm does not take advantage of the new measurements. As a result of the inefficient sampling strategy, the algorithm needs a large number of samples to represent the posterior distribution of state. The next is in the selection step, resampling may introduce the problem of sample impoverishment. To address these two problems, we present an improved visual tracker based on an importance sampling/resampling algorithm. Gaussian density of each sample is adopted as the sub-optimal importance proposal distribution, which can steer the samples towards the high likelihood by considering the latest observations. We also adopt a criterion of effective sample size to determine whether the resampling is necessary or not. Experiments with real image sequences show that the performance of new algorithm improves considerably for tracking in visual clutter.","PeriodicalId":159502,"journal":{"name":"Object recognition supported by user interaction for service robots","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132740219","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Automatic target detection using PMMW and LADAR imagery 使用PMMW和LADAR图像的自动目标检测
Pub Date : 2002-12-10 DOI: 10.1109/ICPR.2002.1048385
M. R. Stevens, M. Snorrason, Sengvieng Amphay
The need for air-to-ground missiles with autonomous target acquisition (ATA) seekers is in large part driven by the failure of pilot-guided bombs in cloudy conditions (such as demonstrated in Kosovo). Passive-millimeter wave (PMMW) sensors have the ability to see through clouds; in fact, they tend to show metallic objects (such as mobile ground targets) in high contrast regardless of weather conditions. However, their resolution is very low when compared with other popular ATA sensors such as laser-radar (LADAR). We present an A TA algorithm suite that combines the superior target detection potential of PMMW with the high-quality segmentation and recognition abilities of LADAR. Preliminary detection and segmentation results are presented for a set of image-pairs of military vehicles that were collected for this project using an 89GHz, 18" aperture PMMW sensor and a 1.06 /spl mu/ very-high-resolution LADAR.
对带有自动目标捕获(ATA)导引头的空对地导弹的需求在很大程度上是由飞行员制导炸弹在多云条件下(如在科索沃的情况)的失败所驱动的。无源毫米波(PMMW)传感器具有穿透云层的能力;事实上,无论天气状况如何,它们都倾向于以高对比度显示金属物体(如移动地面目标)。然而,与激光雷达(LADAR)等其他流行的ATA传感器相比,它们的分辨率非常低。我们提出了一种结合PMMW优越的目标检测潜力和LADAR高质量分割和识别能力的A - TA算法套件。采用89GHz 18”孔径PMMW传感器和1.06 /spl μ /超高分辨率雷达,对本项目采集的一组军用车辆图像对进行了初步检测和分割。
{"title":"Automatic target detection using PMMW and LADAR imagery","authors":"M. R. Stevens, M. Snorrason, Sengvieng Amphay","doi":"10.1109/ICPR.2002.1048385","DOIUrl":"https://doi.org/10.1109/ICPR.2002.1048385","url":null,"abstract":"The need for air-to-ground missiles with autonomous target acquisition (ATA) seekers is in large part driven by the failure of pilot-guided bombs in cloudy conditions (such as demonstrated in Kosovo). Passive-millimeter wave (PMMW) sensors have the ability to see through clouds; in fact, they tend to show metallic objects (such as mobile ground targets) in high contrast regardless of weather conditions. However, their resolution is very low when compared with other popular ATA sensors such as laser-radar (LADAR). We present an A TA algorithm suite that combines the superior target detection potential of PMMW with the high-quality segmentation and recognition abilities of LADAR. Preliminary detection and segmentation results are presented for a set of image-pairs of military vehicles that were collected for this project using an 89GHz, 18\" aperture PMMW sensor and a 1.06 /spl mu/ very-high-resolution LADAR.","PeriodicalId":159502,"journal":{"name":"Object recognition supported by user interaction for service robots","volume":"219 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132740989","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Knowledge-based numeric open caption recognition for live sportscast 基于知识的数字开放字幕识别,用于体育直播
Pub Date : 2002-12-10 DOI: 10.1109/ICPR.2002.1048429
Si-Hun Sung, Woo-Sung Chun
Knowledge-based numeric open caption recognition is proposed that can recognize numeric captions generated by a character generator (CG) and automatically superimpose a modified caption using the recognized text only when a valid numeric caption appears in the aimed specific region of a live sportscast scene produced by other broadcasting stations. In the proposed method, mesh features are extracted from an enhanced binary image as feature vectors, then valuable information is recovered from a numeric image by perceiving the character using a multilayer perceptron (MLP) network. The result is verified using a knowledge-based rule set designed for a more stable and reliable output and then the modified information is displayed on a screen by CG. MLB EyeCaption based on the proposed algorithm has already been used for regular Major League Baseball (MLB) programs broadcast live over a Korean nationwide TV network and has produced a favorable response from Korean viewers.
提出了一种基于知识的数字字幕识别方法,该方法能够识别由字符生成器(CG)生成的数字字幕,并在其他广播电台制作的体育直播场景的目标特定区域出现有效的数字字幕时,使用识别的文本自动叠加修改后的字幕。该方法首先从增强的二值图像中提取网格特征作为特征向量,然后利用多层感知器(MLP)网络感知特征,从数字图像中恢复有价值的信息。使用基于知识的规则集对结果进行验证,以获得更稳定可靠的输出,然后通过CG将修改后的信息显示在屏幕上。以该算法为基础的“MLB EyeCaption”已经在美国职业棒球大联盟(MLB)的定期直播节目中得到了应用,并得到了观众的好评。
{"title":"Knowledge-based numeric open caption recognition for live sportscast","authors":"Si-Hun Sung, Woo-Sung Chun","doi":"10.1109/ICPR.2002.1048429","DOIUrl":"https://doi.org/10.1109/ICPR.2002.1048429","url":null,"abstract":"Knowledge-based numeric open caption recognition is proposed that can recognize numeric captions generated by a character generator (CG) and automatically superimpose a modified caption using the recognized text only when a valid numeric caption appears in the aimed specific region of a live sportscast scene produced by other broadcasting stations. In the proposed method, mesh features are extracted from an enhanced binary image as feature vectors, then valuable information is recovered from a numeric image by perceiving the character using a multilayer perceptron (MLP) network. The result is verified using a knowledge-based rule set designed for a more stable and reliable output and then the modified information is displayed on a screen by CG. MLB EyeCaption based on the proposed algorithm has already been used for regular Major League Baseball (MLB) programs broadcast live over a Korean nationwide TV network and has produced a favorable response from Korean viewers.","PeriodicalId":159502,"journal":{"name":"Object recognition supported by user interaction for service robots","volume":"354 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133167973","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
期刊
Object recognition supported by user interaction for service robots
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1