A Graph-Partitioning Framework for Aligning Hierarchical Topic Structures to Presentations

IEEE Transactions on Audio Speech and Language Processing Pub Date : 2013-05-01 DOI:10.1109/TASL.2013.2244084

Xiao-Dan Zhu, Colin Cherry, Gerald Penn

引用次数: 0

Abstract

This paper studies the problem of imposing an existing hierarchical semantic structure onto a corresponding spoken document in which the structures are embedded, with the goal of indexing such documents for easier access. We propose a graph-partitioning framework to solve a semantic tree-to-string alignment problem through optimizing a normalized-cut criterion. We present models with different modeling capabilities and time complexities in this framework and provide experimental evidence of their performance. We relate graph partitioning to conventional dynamic time warping (DTW) as it applies to this problem, and show that the proposed framework can naturally include topic segmentation to accommodate cohesion constraints.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

将分层主题结构与演示对齐的图划分框架

本文研究了将现有的分层语义结构强加到相应的语音文档上的问题，目的是为这些文档建立索引，以便于访问。我们提出了一个图划分框架，通过优化规范化切割标准来解决语义树到字符串的对齐问题。我们在这个框架中提出了具有不同建模能力和时间复杂度的模型，并提供了它们性能的实验证据。我们将图划分与传统的动态时间翘曲(DTW)联系起来，因为它适用于这个问题，并表明所提出的框架可以自然地包括主题分割以适应内聚约束。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE Transactions on Audio Speech and Language Processing 工程技术-工程：电子与电气

自引率

0.00%

发文量

审稿时长

24.0 months

期刊介绍： The IEEE Transactions on Audio, Speech and Language Processing covers the sciences, technologies and applications relating to the analysis, coding, enhancement, recognition and synthesis of audio, music, speech and language. In particular, audio processing also covers auditory modeling, acoustic modeling and source separation. Speech processing also covers speech production and perception, adaptation, lexical modeling and speaker recognition. Language processing also covers spoken language understanding, translation, summarization, mining, general language modeling, as well as spoken dialog systems.