首页 > 最新文献

2010 Second International Conference on Knowledge and Systems Engineering最新文献

英文 中文
Extracting Parallel Texts from the Web 从网络中提取平行文本
Pub Date : 2010-10-07 DOI: 10.1109/KSE.2010.14
Le Quang Hung, L. Cuong
Parallel corpus is the valuable resource for some important applications of natural language processing such as statistical machine translation, dictionary construction, cross-language information retrieval. The Web is a huge resource of knowledge, which partly contains bilingual information in various kinds of web pages. It currently attracts many studies on building parallel corpora based on the Internet resource. However, obtaining a parallel corpus with high accuracy is still a challenge. This paper focuses on extracting parallel texts from bilingual web-sites of the English and Vietnamese language pair. We first propose a new way of designing content-based features, and then combining them with structural features under a framework of machine learning. In the experiment we obtain 88.2% of precision for the extracted parallel texts.
并行语料库是统计机器翻译、词典构建、跨语言信息检索等自然语言处理领域重要应用的宝贵资源。网络是一个巨大的知识资源,在各种网页中,部分包含双语信息。目前,基于网络资源构建并行语料库的研究备受关注。然而,获得高精度的并行语料库仍然是一个挑战。本文主要研究了从英语和越南语对双语网站中提取平行文本的方法。我们首先提出了一种新的基于内容的特征设计方法,然后在机器学习的框架下将其与结构特征相结合。在实验中,我们获得了88.2%的准确率提取平行文本。
{"title":"Extracting Parallel Texts from the Web","authors":"Le Quang Hung, L. Cuong","doi":"10.1109/KSE.2010.14","DOIUrl":"https://doi.org/10.1109/KSE.2010.14","url":null,"abstract":"Parallel corpus is the valuable resource for some important applications of natural language processing such as statistical machine translation, dictionary construction, cross-language information retrieval. The Web is a huge resource of knowledge, which partly contains bilingual information in various kinds of web pages. It currently attracts many studies on building parallel corpora based on the Internet resource. However, obtaining a parallel corpus with high accuracy is still a challenge. This paper focuses on extracting parallel texts from bilingual web-sites of the English and Vietnamese language pair. We first propose a new way of designing content-based features, and then combining them with structural features under a framework of machine learning. In the experiment we obtain 88.2% of precision for the extracted parallel texts.","PeriodicalId":158823,"journal":{"name":"2010 Second International Conference on Knowledge and Systems Engineering","volume":"189 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132504784","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
SDA*: A Simple and Unifying Solution to Recent Bioinformatic Challenges for Conservation Genetics SDA*:一个简单和统一的解决方案,以最近的生物信息学挑战保护遗传学
Pub Date : 2010-10-07 DOI: 10.1109/KSE.2010.24
B. Minh, S. Klaere, A. von Haeseler
Recently, several algorithms have been proposed to tackle different conservation questions under phylogenetic diversity. Such questions are variants of the more general problem of budgeted reserve selection under split diversity, an NP-hard problem. Here, we present a novel framework, Split Diversity Algorithm* (SDA*), to unify all these attempts. More specifically, SDA* transforms the budgeted reserve selection problem into a binary linear programming(BLP), that can be solved by available linear optimization techniques. SDA* guarantees to find optimal solutions in reasonable time.
近年来,人们提出了几种算法来解决系统发育多样性下的不同保护问题。这些问题是分裂多样性下预算储备选择问题的变体,这是一个np困难问题。在这里,我们提出了一个新的框架,分裂分集算法* (SDA*),以统一所有这些尝试。更具体地说,SDA*将预算储备选择问题转化为一个二元线性规划(BLP)问题,可用线性优化技术求解。SDA*保证在合理的时间内找到最优解。
{"title":"SDA*: A Simple and Unifying Solution to Recent Bioinformatic Challenges for Conservation Genetics","authors":"B. Minh, S. Klaere, A. von Haeseler","doi":"10.1109/KSE.2010.24","DOIUrl":"https://doi.org/10.1109/KSE.2010.24","url":null,"abstract":"Recently, several algorithms have been proposed to tackle different conservation questions under phylogenetic diversity. Such questions are variants of the more general problem of budgeted reserve selection under split diversity, an NP-hard problem. Here, we present a novel framework, Split Diversity Algorithm* (SDA*), to unify all these attempts. More specifically, SDA* transforms the budgeted reserve selection problem into a binary linear programming(BLP), that can be solved by available linear optimization techniques. SDA* guarantees to find optimal solutions in reasonable time.","PeriodicalId":158823,"journal":{"name":"2010 Second International Conference on Knowledge and Systems Engineering","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131295152","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Investigation of the Hemodynamic Response in Near Infrared Spectroscopy Data Analysis 近红外光谱数据分析中血流动力学响应的研究
Pub Date : 2010-10-07 DOI: 10.1109/KSE.2010.26
Le Hoa Nguyen, K. Hong
Near infrared spectroscopy (NIRS) is an effective technique for examining functional brain activity during cognitive tasks by enabling the measurement of the concentration changes of oxy-hemoglobin and deoxy-hemoglobin. In NIRS data analysis, accurate estimation of the hemodynamic response function (HRF) is still under investigation. Most existing methods assume that the shape of the HRF to be known. This assumption may not be appropriate when the HRF varies from subject to subject or from region to region. In this paper, a deconvolution algorithm to estimate the HRF is presented. The advantage of this method is no prior hypothesis about the shape of the HRF is required. In addition, in order to increase the sensitivity of NIRS to functional brain activity, an adaptive filter is designed to remove physiological noises from the noisy NIRS data. In order to verify the effectiveness of the proposed methods, numerical simulations were performed, the results of which are provided herein.
近红外光谱(NIRS)通过测量氧血红蛋白和脱氧血红蛋白的浓度变化,是一种有效的检测认知任务期间脑功能活动的技术。在近红外光谱数据分析中,血流动力学反应函数(HRF)的准确估计仍在研究中。大多数现有的方法都假定HRF的形状是已知的。当HRF因主题而异或因区域而异时,这种假设可能就不合适了。本文提出了一种估计HRF的反卷积算法。该方法的优点是不需要对HRF的形状进行先验假设。此外,为了提高近红外光谱对脑功能活动的敏感性,设计了一种自适应滤波器,从嘈杂的近红外光谱数据中去除生理噪声。为了验证所提方法的有效性,进行了数值模拟,并给出了仿真结果。
{"title":"Investigation of the Hemodynamic Response in Near Infrared Spectroscopy Data Analysis","authors":"Le Hoa Nguyen, K. Hong","doi":"10.1109/KSE.2010.26","DOIUrl":"https://doi.org/10.1109/KSE.2010.26","url":null,"abstract":"Near infrared spectroscopy (NIRS) is an effective technique for examining functional brain activity during cognitive tasks by enabling the measurement of the concentration changes of oxy-hemoglobin and deoxy-hemoglobin. In NIRS data analysis, accurate estimation of the hemodynamic response function (HRF) is still under investigation. Most existing methods assume that the shape of the HRF to be known. This assumption may not be appropriate when the HRF varies from subject to subject or from region to region. In this paper, a deconvolution algorithm to estimate the HRF is presented. The advantage of this method is no prior hypothesis about the shape of the HRF is required. In addition, in order to increase the sensitivity of NIRS to functional brain activity, an adaptive filter is designed to remove physiological noises from the noisy NIRS data. In order to verify the effectiveness of the proposed methods, numerical simulations were performed, the results of which are provided herein.","PeriodicalId":158823,"journal":{"name":"2010 Second International Conference on Knowledge and Systems Engineering","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128509444","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Safe Typing for Transactional vs. Lock-Based Concurrency in Multi-threaded Java 多线程Java中事务性与基于锁的并发性的安全类型
Thi Mai Thuong Tran, Olaf Owe, M. Steffen
Many concurrency models have been developed for high-level programming languages such as Java. A trend here is towards more flexible concurrency control protocols, going beyond the original Java multi-threading treatment based on lexically-scoped concurrency control mechanism. Two proposals supporting flexible, non-lexical concurrency control are the lock-handling via the Lock-classes in Java5 and Transactional Featherweight Java (TFJ), an extension of Featherweight Java by transactions. Even if these two take quite different approaches towards dealing with concurrency —“pessimistic” or lock-based vs. “optimistic” or based on transactions— the added flexibility of non-lexical use of the corresponding concurrency operators comes at a similar price: improper usage leads to run-time exceptions and unwanted behavior. This is in contrast with the more disciplined use under a lexically scoped regime, where each entrance to a critical region is syntactically accompanied by a corresponding exit (as e.g. with traditional synchronized methods or as with so-called atomic blocks).To assure safe use of locking, resp. transactions in these settings, we present in this paper abstractions in the form of two static type and effect systems, which make sure that for instance, no lock is released by a thread which does not hold it, resp., that no commit is executed outside any transaction. We furthermore compare the two mentioned approaches to concurrency control on the basis of these type abstractions.
已经为Java等高级编程语言开发了许多并发模型。这里的趋势是朝着更灵活的并发控制协议发展,超越了基于词法作用域并发控制机制的原始Java多线程处理。支持灵活的非词法并发控制的两个建议是通过Java5中的lock类和事务性轻量级Java (Transactional Featherweight Java, TFJ)进行锁处理,TFJ是轻量级Java的一种事务扩展。即使这两种方法在处理并发性时采用完全不同的方法——“悲观”或基于锁与“乐观”或基于事务——非词法使用相应并发操作符所增加的灵活性也付出了相似的代价:不正确的使用会导致运行时异常和不想要的行为。这与在词法作用域下更严格的使用形成对比,在词法作用域下,每个关键区域的入口在语法上都伴随着相应的出口(例如,使用传统的同步方法或所谓的原子块)。为确保安全使用锁,请按在这些设置下的事务,我们以两种静态类型和效果系统的形式提出抽象,以确保例如,不持有锁的线程不会释放锁。,在任何事务之外都不会执行任何提交。我们进一步比较了上述两种基于这些类型抽象的并发控制方法。
{"title":"Safe Typing for Transactional vs. Lock-Based Concurrency in Multi-threaded Java","authors":"Thi Mai Thuong Tran, Olaf Owe, M. Steffen","doi":"10.1109/KSE.2010.9","DOIUrl":"https://doi.org/10.1109/KSE.2010.9","url":null,"abstract":"Many concurrency models have been developed for high-level programming languages such as Java. A trend here is towards more flexible concurrency control protocols, going beyond the original Java multi-threading treatment based on lexically-scoped concurrency control mechanism. Two proposals supporting flexible, non-lexical concurrency control are the lock-handling via the Lock-classes in Java5 and Transactional Featherweight Java (TFJ), an extension of Featherweight Java by transactions. Even if these two take quite different approaches towards dealing with concurrency —“pessimistic” or lock-based vs. “optimistic” or based on transactions— the added flexibility of non-lexical use of the corresponding concurrency operators comes at a similar price: improper usage leads to run-time exceptions and unwanted behavior. This is in contrast with the more disciplined use under a lexically scoped regime, where each entrance to a critical region is syntactically accompanied by a corresponding exit (as e.g. with traditional synchronized methods or as with so-called atomic blocks).To assure safe use of locking, resp. transactions in these settings, we present in this paper abstractions in the form of two static type and effect systems, which make sure that for instance, no lock is released by a thread which does not hold it, resp., that no commit is executed outside any transaction. We furthermore compare the two mentioned approaches to concurrency control on the basis of these type abstractions.","PeriodicalId":158823,"journal":{"name":"2010 Second International Conference on Knowledge and Systems Engineering","volume":"308 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131953131","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
期刊
2010 Second International Conference on Knowledge and Systems Engineering
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1