首页 > 最新文献

Applied Corpus Linguistics最新文献

英文 中文
Analysis of verb argument constructions (VACs) in L2 learners across proficiency levels: A corpus-based study in L1 Indonesian 分析不同水平的 L2 学习者的动词论证结构 (VAC):基于语料库的印尼语第一语言研究
Pub Date : 2024-06-16 DOI: 10.1016/j.acorp.2024.100097
Febriana Lestari

This study investigated the constructional knowledge development of L1 Indonesian by examining nineteen Verb-Argument Constructions (VACs). The VACs examined in the present study are a verb pattern, followed by a preposition and a noun phrase, for example, “V about N” as in “He talked about the progress”. This study used the Indonesian subset of the Education First Cambridge Open Language Database (EFCAMDAT) corpus from beginner to advanced levels (CEFR A1 to C1; Council of Europe, 2001). This dataset comprises 2943 writing texts (224,763 words) from 623 learners. Frequency analysis of types and tokens was conducted to examine the distribution of the 19 VACs in learner writings across levels. Growth analyses were conducted to investigate the verbs that learners most frequently associated with the most productive VACs. Correlational analyses were conducted to explore how closely related the verb-VAC associations between proficiency levels and the verb occupants in the associations. The results indicate that learners’ constructional knowledge development was implied by: (1) the frequency increase in types and tokens of VACs from lower to higher proficiency levels, (2) the variety of verbs associated with VACs, and (3) the construction schematicity increase indicated by the use of general to more specific verb productions distinct to proficiency levels. The results suggest that English language learners need more exposure to lexicogrammatical features to facilitate VACs acquisition and usage.

本研究通过考察 19 个动词-论据结构(VAC),对印尼语第一语言的构式知识发展进行了研究。本研究考察的 VAC 是一个动词模式,后面跟一个介词和一个名词短语,例如 "他谈到了进展 "中的 "V about N"。本研究使用了教育第一剑桥开放语言数据库(EFCAMDAT)初级到高级(CEFR A1 到 C1;欧洲委员会,2001 年)语料库中的印尼语子集。该数据集包括来自 623 名学习者的 2943 篇写作文本(224,763 个单词)。我们对类型和词块进行了频率分析,以考察 19 个 VAC 在不同水平的学习者写作中的分布情况。我们还进行了增长分析,以研究学习者最常将哪些动词与最有成效的 VAC 联系起来。我们还进行了相关分析,以探究不同水平的学习者之间的动词-VAC 关联与关联中的动词之间的密切关系。结果表明,学习者的构词知识发展体现在以下几个方面:(1)从较低到较高熟练程度的 VAC 类型和词块的频率增加;(2)与 VAC 相关联的动词的多样性;(3)不同熟练程度的学习者从使用一般动词到使用更具体的动词所显示的构词图式的增加。研究结果表明,英语学习者需要更多地接触词汇语法特征,以促进VACs的习得和使用。
{"title":"Analysis of verb argument constructions (VACs) in L2 learners across proficiency levels: A corpus-based study in L1 Indonesian","authors":"Febriana Lestari","doi":"10.1016/j.acorp.2024.100097","DOIUrl":"10.1016/j.acorp.2024.100097","url":null,"abstract":"<div><p>This study investigated the constructional knowledge development of L1 Indonesian by examining nineteen Verb-Argument Constructions (VACs). The VACs examined in the present study are a verb pattern, followed by a preposition and a noun phrase, for example, “V <em>about</em> N” as in “He <u>talked</u> <em>about</em> <u>the progress</u>”. This study used the Indonesian subset of the Education First Cambridge Open Language Database (EFCAMDAT) corpus from beginner to advanced levels (CEFR A1 to C1; Council of Europe, 2001). This dataset comprises 2943 writing texts (224,763 words) from 623 learners. Frequency analysis of types and tokens was conducted to examine the distribution of the 19 VACs in learner writings across levels. Growth analyses were conducted to investigate the verbs that learners most frequently associated with the most productive VACs. Correlational analyses were conducted to explore how closely related the verb-VAC associations between proficiency levels and the verb occupants in the associations. The results indicate that learners’ constructional knowledge development was implied by: (1) the frequency increase in types and tokens of VACs from lower to higher proficiency levels, (2) the variety of verbs associated with VACs, and (3) the construction schematicity increase indicated by the use of general to more specific verb productions distinct to proficiency levels. The results suggest that English language learners need more exposure to lexicogrammatical features to facilitate VACs acquisition and usage.</p></div>","PeriodicalId":72254,"journal":{"name":"Applied Corpus Linguistics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141403911","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
From Argentina to Zimbabwe: Exploring the global appeal of the International Baccalaureate 从阿根廷到津巴布韦:探索国际高中毕业会考的全球吸引力
Pub Date : 2024-06-15 DOI: 10.1016/j.acorp.2024.100096
Saira Fitzgerald

This paper presents the third stage of a larger research project examining perceptions of the International Baccalaureate (IB) to better understand its growing influence on education systems around the world. The first two stages involved a synchronic and diachronic analysis of IB discourse in a 27 million word specialized corpus of global press articles, created as an unsolicited window into public opinion (Mautner, 2008). The present study uses the same corpus to explore how the IB is represented in different countries, what values and attitudes may be associated with it, and how it interacts with other global education actors. Bottom up and top down methods from corpus-assisted discourse studies (CADS) were used to analyze 34,104 newspaper articles from 56 countries. Frequency, collocation and concordance analyses revealed four dominant discourses of deficiency connected to national education systems in countries across the ideological spectrum that helped to legitimize the inclusion of private actors in the provision of education. Results also showed unique discourses associated with the IB in North America, thereby highlighting the key role that this region plays in the IB world.

为了更好地了解国际文凭组织(IB)对全球教育体系日益增长的影响力,我们开展了一项大型研究项目,本文是该项目第三阶段的研究成果。前两个阶段的研究包括对全球报刊文章中 2,700 万字的专门语料库中的国际文凭话语进行同步和异步分析,该语料库是作为了解公众舆论的一个主动窗口而创建的(Mautner,2008 年)。本研究使用同一语料库来探讨国际文凭在不同国家的表现形式、与之相关的价值观和态度,以及国际文凭与其他全球教育行动者之间的互动关系。本研究采用了语料库辅助话语研究(CADS)中自下而上和自上而下的方法,对来自 56 个国家的 34104 篇报纸文章进行了分析。频率、搭配和一致性分析揭示了与各国教育系统相关的四种主导性缺陷话语,这些话语有助于将私人行为者纳入教育供给中并使其合法化。研究结果还显示了与北美国际文凭组织有关的独特论述,从而突出了该地区在国际文凭组织世界中所发挥的关键作用。
{"title":"From Argentina to Zimbabwe: Exploring the global appeal of the International Baccalaureate","authors":"Saira Fitzgerald","doi":"10.1016/j.acorp.2024.100096","DOIUrl":"10.1016/j.acorp.2024.100096","url":null,"abstract":"<div><p>This paper presents the third stage of a larger research project examining perceptions of the International Baccalaureate (IB) to better understand its growing influence on education systems around the world. The first two stages involved a synchronic and diachronic analysis of IB discourse in a 27 million word specialized corpus of global press articles, created as an unsolicited window into public opinion (Mautner, 2008). The present study uses the same corpus to explore how the IB is represented in different countries, what values and attitudes may be associated with it, and how it interacts with other global education actors. Bottom up and top down methods from corpus-assisted discourse studies (CADS) were used to analyze 34,104 newspaper articles from 56 countries. Frequency, collocation and concordance analyses revealed four dominant discourses of deficiency connected to national education systems in countries across the ideological spectrum that helped to legitimize the inclusion of private actors in the provision of education. Results also showed unique discourses associated with the IB in North America, thereby highlighting the key role that this region plays in the IB world.</p></div>","PeriodicalId":72254,"journal":{"name":"Applied Corpus Linguistics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-06-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666799124000133/pdfft?md5=2c0d3c0ae023763fdea6e64970af8fc6&pid=1-s2.0-S2666799124000133-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141405145","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Attitudes, communicative functions, and lexicogrammatical features of anti-vaccine discourse on Telegram Telegram 上反疫苗言论的态度、交际功能和词汇语法特征
Pub Date : 2024-05-14 DOI: 10.1016/j.acorp.2024.100095
Souad Boumechaal , Serge Sharoff

This paper reports the process of collecting a corpus with examples of anti-vaccine discourse and the results of its linguistic analysis. The overall aim of the project is to help public health authorities to improve their communication campaigns by better understanding the conditions for misinformation spreading via social media. More specifically, this paper analyses linguistic properties of a corpus of prominent misinformation channels in Telegram as compared against a more general COVID corpus as well as against a general purpose English corpus. For this paper, the quantitative analysis relies on corpus querying to identify the most recurrent discourse patterns related to COVID vaccines. We use the appraisal framework to analyse the patterns with respect to the attitudes conveyed in the messages. We have also applied an automatic AI classifier to predict communicative functions of these texts. This allows us to examine them more closely through the use of simple lexicogrammatical features following Biber, as well as their ideational processes following Halliday. The findings show that common collocations in the Telegram corpus containing misinformation draw on three attitudes: fear, insecurity, and mistrust in COVID vaccines which are discursively constructed to promote vaccine hesitancy among social media users. Furthermore, the misinformation messages tend to occur more often in such communicative functions as promotional texts, news reporting, and text expressed as presenting reference information.

本文报告了反疫苗言论实例语料库的收集过程及其语言分析结果。该项目的总体目标是通过更好地了解错误信息通过社交媒体传播的条件,帮助公共卫生机构改进其传播活动。更具体地说,本文分析了 Telegram 中主要错误信息渠道语料库的语言特性,并与更通用的 COVID 语料库和通用英语语料库进行了比较。本文的定量分析依靠语料库查询来识别与 COVID 疫苗相关的最常见话语模式。我们使用评估框架来分析信息中所传达的态度模式。我们还应用了自动人工智能分类器来预测这些文本的交际功能。这样,我们就可以根据比伯(Biber)的简单词法特征以及哈利迪(Halliday)的表意过程,更仔细地研究这些文本。研究结果表明,包含错误信息的 Telegram 语料库中的常见搭配涉及三种态度:恐惧、不安全感和对 COVID 疫苗的不信任。此外,错误信息往往更频繁地出现在宣传文本、新闻报道和以提供参考信息为目的的文本等交际功能中。
{"title":"Attitudes, communicative functions, and lexicogrammatical features of anti-vaccine discourse on Telegram","authors":"Souad Boumechaal ,&nbsp;Serge Sharoff","doi":"10.1016/j.acorp.2024.100095","DOIUrl":"10.1016/j.acorp.2024.100095","url":null,"abstract":"<div><p>This paper reports the process of collecting a corpus with examples of anti-vaccine discourse and the results of its linguistic analysis. The overall aim of the project is to help public health authorities to improve their communication campaigns by better understanding the conditions for misinformation spreading via social media. More specifically, this paper analyses linguistic properties of a corpus of prominent misinformation channels in Telegram as compared against a more general COVID corpus as well as against a general purpose English corpus. For this paper, the quantitative analysis relies on corpus querying to identify the most recurrent discourse patterns related to COVID vaccines. We use the appraisal framework to analyse the patterns with respect to the attitudes conveyed in the messages. We have also applied an automatic AI classifier to predict communicative functions of these texts. This allows us to examine them more closely through the use of simple lexicogrammatical features following Biber, as well as their ideational processes following Halliday. The findings show that common collocations in the Telegram corpus containing misinformation draw on three attitudes: fear, insecurity, and mistrust in COVID vaccines which are discursively constructed to promote vaccine hesitancy among social media users. Furthermore, the misinformation messages tend to occur more often in such communicative functions as promotional texts, news reporting, and text expressed as presenting reference information.</p></div>","PeriodicalId":72254,"journal":{"name":"Applied Corpus Linguistics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-05-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141031247","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Wash your hands: CDC, WHO, and NHS tweets in the #COVID19 pandemic 洗手:疾病预防控制中心、世卫组织和国家医疗服务体系在 #COVID19 大流行中的推文
Pub Date : 2024-05-13 DOI: 10.1016/j.acorp.2024.100094
Katherine A Ireland

This work tracks public health messaging and evidence of stability and change in corpora of the Centers for Disease Control and Prevention (CDC), World Health Organization (WHO), and National Health Service (NHS) official account tweets throughout 2020. Using corpus-based methods, including keyword analysis, major similarities and differences are identified across tweets by each organization over time. Larger macro-level and micro-level discourses and linguistic patterns are revealed, with specific applications relevant to public health and governmental messaging, especially regarding risk and health communication. Findings include the NHS providing the most comprehensive and varied messaging out of each organization, including references to recommended actions, communities and individuals, and information. The WHO focuses predominantly on cases and region-specific information, while the CDC includes a variety of information, with a US-internal focus. Applications include further recommendations for public health communication, including the necessity of diverse linguistic patterns and interactive messaging tactics for governmental organizations.

这项研究追踪了 2020 年间疾病控制和预防中心(CDC)、世界卫生组织(WHO)和国家卫生服务系统(NHS)官方账户推文语料库中的公共卫生信息以及稳定和变化的证据。通过使用基于语料库的方法(包括关键词分析),确定了各组织在不同时期推文中的主要相似点和不同点。研究揭示了更大的宏观和微观层面的话语和语言模式,并将其具体应用于公共卫生和政府信息传播,尤其是风险和健康传播方面。研究结果表明,在每个组织中,英国国家医疗服务系统(NHS)提供的信息最全面、最多样,包括对建议行动、社区和个人以及信息的提及。世卫组织主要侧重于病例和特定地区的信息,而疾病预防控制中心则包括各种信息,以美国国内信息为主。这些应用包括对公共卫生传播的进一步建议,包括政府组织采用多样化语言模式和互动信息传递策略的必要性。
{"title":"Wash your hands: CDC, WHO, and NHS tweets in the #COVID19 pandemic","authors":"Katherine A Ireland","doi":"10.1016/j.acorp.2024.100094","DOIUrl":"10.1016/j.acorp.2024.100094","url":null,"abstract":"<div><p>This work tracks public health messaging and evidence of stability and change in corpora of the Centers for Disease Control and Prevention (CDC), World Health Organization (WHO), and National Health Service (NHS) official account tweets throughout 2020. Using corpus-based methods, including keyword analysis, major similarities and differences are identified across tweets by each organization over time. Larger macro-level and micro-level discourses and linguistic patterns are revealed, with specific applications relevant to public health and governmental messaging, especially regarding risk and health communication. Findings include the NHS providing the most comprehensive and varied messaging out of each organization, including references to recommended actions, communities and individuals, and information. The WHO focuses predominantly on cases and region-specific information, while the CDC includes a variety of information, with a US-internal focus. Applications include further recommendations for public health communication, including the necessity of diverse linguistic patterns and interactive messaging tactics for governmental organizations.</p></div>","PeriodicalId":72254,"journal":{"name":"Applied Corpus Linguistics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141054741","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Applying corpus linguistics to the law 将语料库语言学应用于法律
Pub Date : 2024-04-14 DOI: 10.1016/j.acorp.2024.100093
Jesse Egbert , Ute Römer-Barron
{"title":"Applying corpus linguistics to the law","authors":"Jesse Egbert ,&nbsp;Ute Römer-Barron","doi":"10.1016/j.acorp.2024.100093","DOIUrl":"https://doi.org/10.1016/j.acorp.2024.100093","url":null,"abstract":"","PeriodicalId":72254,"journal":{"name":"Applied Corpus Linguistics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-04-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140620603","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Representations of obesity in Australian and UK news coverage: A diachronic comparison 澳大利亚和英国新闻报道中对肥胖的表述:异时空比较
Pub Date : 2024-03-18 DOI: 10.1016/j.acorp.2024.100092
Luke C. Collins , Paul Baker , Gavin Brookes

In both Australia and the UK, the number of adults living with obesity has been increasing over the last 30 years (AIHW, 2023; Baker, 2023). Although policy has emphasised ‘community-based interventions’ in Australia (AIHW, 2017) and ‘system-wide approaches’ in the UK (Ulijaszek and McLennan, 2016) for overcoming the challenges of obesity, previous research has shown that media representations have been dominated by representations promoting individual responsibility (e.g., Kim & Willis, 2007). In this paper, we report our observations of representations documented in corpora of media coverage from Australia and the UK between 2008-2017. The corpora amount to 16.4 million tokens and 36 million tokens, respectively. We identify key semantic domains for each year of the corpora and discuss both consistent and shifting themes in the data. Our findings show that the Australian coverage provides a more sustained focus on responses to obesity at the societal level, referring to practices in the food industry and differences between communities that can lead to health disparities. By comparison, while there is an increase in the amount of coverage in the UK press referring to obesity, the content became more narrowly focussed on food consumption and weight loss over the study period. The findings demonstrate how media coverage contributes to public understanding of how to respond to the challenges of obesity.

在澳大利亚和英国,患有肥胖症的成年人数量在过去 30 年中一直在增加(AIHW,2023 年;Baker,2023 年)。尽管澳大利亚的政策强调 "基于社区的干预措施"(AIHW,2017 年),英国的政策强调 "全系统方法"(Ulijaszek 和 McLennan,2016 年),以克服肥胖带来的挑战,但先前的研究表明,媒体的表述一直以倡导个人责任的表述为主(例如,Kim & Willis,2007 年)。在本文中,我们报告了对 2008-2017 年间澳大利亚和英国媒体报道语料库中记录的表述的观察。这两个语料库分别有 1,640 万和 3,600 万词条。我们为每年的语料库确定了关键语义域,并讨论了数据中一致和变化的主题。我们的研究结果表明,澳大利亚的报道更加持续地关注社会层面的肥胖应对措施,提到了食品行业的做法以及可能导致健康差异的社区之间的差异。相比之下,虽然英国媒体提及肥胖问题的报道数量有所增加,但在研究期间,报道内容更加狭隘地集中在食品消费和减肥方面。研究结果表明,媒体报道有助于公众了解如何应对肥胖带来的挑战。
{"title":"Representations of obesity in Australian and UK news coverage: A diachronic comparison","authors":"Luke C. Collins ,&nbsp;Paul Baker ,&nbsp;Gavin Brookes","doi":"10.1016/j.acorp.2024.100092","DOIUrl":"https://doi.org/10.1016/j.acorp.2024.100092","url":null,"abstract":"<div><p>In both Australia and the UK, the number of adults living with obesity has been increasing over the last 30 years (AIHW, 2023; Baker, 2023). Although policy has emphasised ‘community-based interventions’ in Australia (AIHW, 2017) and ‘system-wide approaches’ in the UK (Ulijaszek and McLennan, 2016) for overcoming the challenges of obesity, previous research has shown that media representations have been dominated by representations promoting individual responsibility (e.g., Kim &amp; Willis, 2007). In this paper, we report our observations of representations documented in corpora of media coverage from Australia and the UK between 2008-2017. The corpora amount to 16.4 million tokens and 36 million tokens, respectively. We identify key semantic domains for each year of the corpora and discuss both consistent and shifting themes in the data. Our findings show that the Australian coverage provides a more sustained focus on responses to obesity at the societal level, referring to practices in the food industry and differences between communities that can lead to health disparities. By comparison, while there is an increase in the amount of coverage in the UK press referring to obesity, the content became more narrowly focussed on food consumption and weight loss over the study period. The findings demonstrate how media coverage contributes to public understanding of how to respond to the challenges of obesity.</p></div>","PeriodicalId":72254,"journal":{"name":"Applied Corpus Linguistics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-03-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666799124000091/pdfft?md5=6e9ecc0d87ef63dc626b52509b233d53&pid=1-s2.0-S2666799124000091-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140180464","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
‘Luxurious’ metaphors in luxury hotel websites in Singapore and Hong Kong: A mixed-methods study 新加坡和香港豪华酒店网站中的 "奢华 "隐喻:混合方法研究
Pub Date : 2024-02-28 DOI: 10.1016/j.acorp.2024.100090
Joanna Zhuoan Chen, Kathleen Ahrens, Dennis Tay

Previous research has yielded a substantial body of empirical evidence regarding the use of metaphors in various types of discourse. However, limited research exists on the relationship between metaphor and more segmented economic industries, such as the luxury hospitality sector. The attention of this article is directed towards inspecting how metaphorical expressions are deployed by luxury hotels to construct their luxury identity and attract potential guests.

A corpus of 62 lxury hotel websites from Singapore and Hong Kong is used as the contextual background for the investigation of metaphor usage in this study. Using MIPVU (Metaphor Identification Procedure VU University Amsterdam), a total of 6990 metaphorical keywords, including a diverse range of 28 source domains were observed. Among others, the five most productive source domains in the corpus are living organism, physical object, space, artifact, and motion. A mixed-methods approach that combines both quantitative data analytics and qualitative discourse analysis reveals and interprets significant associations between source domains, hotel facilities, and regions, suggesting that the choice of metaphorical expressions is not arbitrary but is influenced by specific factors related to the hotel's offerings and cultures. This study emphasises that the analysis of lexical-conceptual patterns in promotional texts can generate deeper insights into positioning strategies.

以往的研究已经积累了大量有关在各类话语中使用隐喻的经验证据。然而,关于隐喻与更细分的经济行业(如豪华酒店行业)之间关系的研究却十分有限。本文旨在探讨豪华酒店如何使用隐喻表达来构建其豪华身份并吸引潜在客人。本研究以新加坡和香港的 62 个豪华酒店网站为语料库,作为隐喻使用的背景调查。利用阿姆斯特丹 VU 大学的隐喻识别程序(MIPVU),共观察到 6990 个隐喻关键词,包括 28 个不同的源域。其中,语料库中最富有成效的五个源域是生物、实物、空间、人工制品和运动。结合定量数据分析和定性话语分析的混合方法揭示并解释了源域、酒店设施和地区之间的重要关联,表明隐喻表达的选择并非随意而为,而是受到与酒店产品和文化相关的特定因素的影响。本研究强调,通过分析宣传文本中的词汇概念模式,可以更深入地了解定位策略。
{"title":"‘Luxurious’ metaphors in luxury hotel websites in Singapore and Hong Kong: A mixed-methods study","authors":"Joanna Zhuoan Chen,&nbsp;Kathleen Ahrens,&nbsp;Dennis Tay","doi":"10.1016/j.acorp.2024.100090","DOIUrl":"https://doi.org/10.1016/j.acorp.2024.100090","url":null,"abstract":"<div><p>Previous research has yielded a substantial body of empirical evidence regarding the use of metaphors in various types of discourse. However, limited research exists on the relationship between metaphor and more segmented economic industries, such as the luxury hospitality sector. The attention of this article is directed towards inspecting how metaphorical expressions are deployed by luxury hotels to construct their luxury identity and attract potential guests.</p><p>A corpus of 62 lxury hotel websites from Singapore and Hong Kong is used as the contextual background for the investigation of metaphor usage in this study. Using MIPVU (Metaphor Identification Procedure VU University Amsterdam), a total of 6990 metaphorical keywords, including a diverse range of 28 source domains were observed. Among others, the five most productive source domains in the corpus are <span>living organism, physical object, space, artifact, and motion</span>. A mixed-methods approach that combines both quantitative data analytics and qualitative discourse analysis reveals and interprets significant associations between source domains, hotel facilities, and regions, suggesting that the choice of metaphorical expressions is not arbitrary but is influenced by specific factors related to the hotel's offerings and cultures. This study emphasises that the analysis of lexical-conceptual patterns in promotional texts can generate deeper insights into positioning strategies.</p></div>","PeriodicalId":72254,"journal":{"name":"Applied Corpus Linguistics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-02-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140191302","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Using early LLMs for corpus linguistics: Examining ChatGPT's potential and limitations 利用早期法学硕士进行语料库语言学研究:检验 ChatGPT 的潜力和局限性
Pub Date : 2024-02-23 DOI: 10.1016/j.acorp.2024.100089
Satoru Uchida

This study evaluates the extent to which information can be obtained from early Large Language Models (LLMs) for corpus linguistic research. Various tasks were conducted using ChatGPT 3.5, such as generating word frequency lists, collocations, words that fit certain grammatical patterns, and identifying genres. The generations were then compared with the search results from a large-scale general corpus (COCA). While favorable results were not achieved in identifying the genres of words or paragraphs, there was notable congruence in the frequency lists (75.0 %), collocations (42.8 %), and grammatical patterns (53.0 %) for the top 20 items. Even when the generated items did not perfectly match those from COCA, it was evident that high-frequency items were produced. Although LLMs may not be sufficient for rigorous academic research, the results are adequate for discerning overall trends or assisting learners. In addition, the results of this study show that the ability to search at the phrase level is an advantage of using LLMs for corpus research.

本研究评估了从早期大型语言模型(LLM)中获取信息用于语料库语言学研究的程度。研究人员使用 ChatGPT 3.5 完成了多项任务,如生成词频列表、搭配、符合特定语法模式的词语以及识别体裁。然后将生成的结果与大型通用语料库(COCA)的搜索结果进行比较。虽然在识别单词或段落的体裁方面没有取得良好的结果,但在词频表(75.0%)、搭配(42.8%)和语法模式(53.0%)方面,前 20 个项目的结果明显一致。即使生成的词条与 COCA 中的词条不完全一致,也能明显看出生成了高频词条。虽然 LLM 可能不足以进行严谨的学术研究,但其结果却足以用于辨别整体趋势或帮助学习者。此外,本研究的结果表明,在短语层面进行搜索的能力是使用 LLMs 进行语料库研究的一个优势。
{"title":"Using early LLMs for corpus linguistics: Examining ChatGPT's potential and limitations","authors":"Satoru Uchida","doi":"10.1016/j.acorp.2024.100089","DOIUrl":"https://doi.org/10.1016/j.acorp.2024.100089","url":null,"abstract":"<div><p>This study evaluates the extent to which information can be obtained from early Large Language Models (LLMs) for corpus linguistic research. Various tasks were conducted using ChatGPT 3.5, such as generating word frequency lists, collocations, words that fit certain grammatical patterns, and identifying genres. The generations were then compared with the search results from a large-scale general corpus (COCA). While favorable results were not achieved in identifying the genres of words or paragraphs, there was notable congruence in the frequency lists (75.0 %), collocations (42.8 %), and grammatical patterns (53.0 %) for the top 20 items. Even when the generated items did not perfectly match those from COCA, it was evident that high-frequency items were produced. Although LLMs may not be sufficient for rigorous academic research, the results are adequate for discerning overall trends or assisting learners. In addition, the results of this study show that the ability to search at the phrase level is an advantage of using LLMs for corpus research.</p></div>","PeriodicalId":72254,"journal":{"name":"Applied Corpus Linguistics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-02-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666799124000066/pdfft?md5=322cc8730f1db87e3aee8190477b04ed&pid=1-s2.0-S2666799124000066-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140000123","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
here-, there-, and every where-: Exploring the role of pronominal adverbs in legal language here-, there-, and every where-:探索代词副词在法律语言中的作用
Pub Date : 2024-02-01 DOI: 10.1016/j.acorp.2024.100087
David Chandler, Brett Hashimoto

Many have claimed that pronominal adverbs, such as hereby, thereafter, and wherein, are a frequent, distinctive, and problematic in their use in legal language (Tiersma, 1999; Mellinkoff, 2004). The purpose of this study is to examine those claims empirically. In the present study, the prevalence of PAs in legal registers is compared to more general registers of contemporary American English to determine the extent to which these words are distinctly legal. The study will also explore why different types of PAs may be (in)frequent in specific legal registers to better understand their use. The frequency of PAs was extracted from corpora that are designed to represent six registers of English (3 legal; 4 non-legal). Rates of occurrence of PAs per text were then compared across registers using Kruskal-Wallis tests with Dunn post-hoc test with an eta2 effect size. Subsequently, a functional analysis describing the uses of PAs was also conducted. The results indicate that PAs are highly restricted to legal registers because of functions that they serve. The types of functions that PAs perform within a text are discussed. A closer examination of the PAs considered both individually as well as grouped by locative adverb (i.e., here-, there-, and where-) indicates that some PAs are also more distinctive to certain legal registers for different reasons. This study opens the discussion as to the utility and necessity of PAs in legal language and provides suggestions for legal writers on how to use or remove PAs without inhibiting clarity or effectiveness.

许多人认为,诸如 hereby、thereafter 和 wherein 等状语副词在法律语言中的使用频繁、独特且存在问题(Tiersma,1999;Mellinkoff,2004)。本研究的目的就是要通过实证来检验这些说法。在本研究中,PA 在法律语篇中的使用率将与当代美国英语中更普遍的语篇进行比较,以确定这些词在多大程度上具有明显的法律特征。本研究还将探讨不同类型的 PA 在特定法律语篇中(不)频繁出现的原因,以便更好地理解它们的使用。PAs 的频率是从旨在代表六种英语语域(3 种法律语域;4 种非法律语域)的语料库中提取的。然后,使用 Kruskal-Wallis 检验和 Dunn 后置检验以及 eta2 效应大小,比较每个文本中出现的 PAs 的比率。随后,还对 PAs 的用途进行了功能分析。结果表明,PA 因其功能而高度受限于法律注册。本文讨论了 PA 在文本中的功能类型。对单个 PA 和按位置副词(即 here-、there- 和 where-)分组的 PA 的仔细研究表明,某些 PA 由于不同的原因在某些法律文本中更为独特。本研究开启了关于法律语言中扩音词的实用性和必要性的讨论,并就如何在不影响清晰度和有效性的前提下使用或删除扩音词为法律作者提供了建议。
{"title":"here-, there-, and every where-: Exploring the role of pronominal adverbs in legal language","authors":"David Chandler,&nbsp;Brett Hashimoto","doi":"10.1016/j.acorp.2024.100087","DOIUrl":"https://doi.org/10.1016/j.acorp.2024.100087","url":null,"abstract":"<div><p>Many have claimed that pronominal adverbs, such as <em>hereby, thereafter,</em> and <em>wherein</em>, are a frequent, distinctive, and problematic in their use in legal language (<u>Tiersma, 1999</u>; <u>Mellinkoff, 2004</u>). The purpose of this study is to examine those claims empirically. In the present study, the prevalence of PAs in legal registers is compared to more general registers of contemporary American English to determine the extent to which these words are distinctly legal. The study will also explore why different types of PAs may be (in)frequent in specific legal registers to better understand their use. The frequency of PAs was extracted from corpora that are designed to represent six registers of English (3 legal; 4 non-legal). Rates of occurrence of PAs per text were then compared across registers using Kruskal-Wallis tests with Dunn post-hoc test with an eta<sup>2</sup> effect size. Subsequently, a functional analysis describing the uses of PAs was also conducted. The results indicate that PAs are highly restricted to legal registers because of functions that they serve. The types of functions that PAs perform within a text are discussed. A closer examination of the PAs considered both individually as well as grouped by locative adverb (i.e., <em>here-, there-</em>, and <em>where-</em>) indicates that some PAs are also more distinctive to certain legal registers for different reasons. This study opens the discussion as to the utility and necessity of PAs in legal language and provides suggestions for legal writers on how to use or remove PAs without inhibiting clarity or effectiveness.</p></div>","PeriodicalId":72254,"journal":{"name":"Applied Corpus Linguistics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666799124000042/pdfft?md5=e07e56f679be7690beb03b867265ebaa&pid=1-s2.0-S2666799124000042-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139749266","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
War in law: A corpus linguistic study of the lexical item war in the laws of war 法律中的战争:战争法中战争词项的语料库语言学研究
Pub Date : 2024-01-28 DOI: 10.1016/j.acorp.2024.100088
Annabelle Lukin , Alexandra García Marrugo

As a crucial register of modernity, the laws of war provide a discursive environment for the production and/or maintenance of key categories associated with organized violence. The register hosts the concepts which are used to refer to mass organized violence (war, armed conflict), and has both constructed and/or amplified categories of person that have been developed to legitimate war and give coherence to the international laws of war (e.g., prisoners of war, civilians). With the key texts of the international laws of war including such well-known instances as the 1949 Geneva Conventions now available in a searchable corpus format via the Sydney Corpus Lab, this paper explores the usage and meaning of war in this register where, in principle, the word war is a central part of a body of law which purports to put limits on organized violence. The method is essentially corpus driven: it takes the usages of this lexical item in this register and explores its frequency, its typical local lexical environments, and its collocates. The analysis shows that while the concept of war is essential to the laws of war, it remains ill-defined, indeed virtually undefined, at the same time that its collocational habits affirm its naturalness and legitimacy. As has been found elsewhere, in the laws of war, war and violence are treated as distinct phenomena, operating in distinct lexical environments. The paper is a contribution from corpus linguistics to the work of understanding the ideological effects of this highly significant legal register.

作为现代性的重要标志,战争法为产生和/或维持与有组织暴力相关的关键类别提供了话语环境。战争法承载了用来指称大规模有组织暴力(战争、武装冲突)的概念,并构建和/或扩大了为使战争合法化和使国际战争法具有一致性而发展起来的人的类别(如战俘、平民)。悉尼语料库实验室(Sydney Corpus Lab)现已以可检索的语料库格式提供了国际战争法的主要文本,包括《1949 年日内瓦四公约》等著名文本,本文探讨了战争在这一语系中的用法和含义,原则上,战争一词是旨在限制有组织暴力的法律体系的核心部分。该方法基本上是以语料库为驱动的:它利用该语域中该词条的用法,探讨其使用频率、典型的本地词汇环境及其搭配词。分析表明,虽然战争的概念对战争法至关重要,但它的定义仍然不明确,甚至几乎没有定义,与此同时,它的搭配习惯却肯定了它的自然性和合法性。正如在其他地方发现的那样,在战争法中,战争和暴力被视为不同的现象,在不同的词汇环境中运行。本文是语料库语言学对理解这一极为重要的法律语域的意识形态影响的贡献。
{"title":"War in law: A corpus linguistic study of the lexical item war in the laws of war","authors":"Annabelle Lukin ,&nbsp;Alexandra García Marrugo","doi":"10.1016/j.acorp.2024.100088","DOIUrl":"10.1016/j.acorp.2024.100088","url":null,"abstract":"<div><p>As a crucial register of modernity, the laws of war provide a discursive environment for the production and/or maintenance of key categories associated with organized violence. The register hosts the concepts which are used to refer to mass organized violence (<em>war, armed conflict</em>), and has both constructed and/or amplified categories of person that have been developed to legitimate war and give coherence to the international laws of war (e.g., prisoners of war, civilians). With the key texts of the international laws of war including such well-known instances as the 1949 Geneva Conventions now available in a searchable corpus format via the Sydney Corpus Lab, this paper explores the usage and meaning of <em>war</em> in this register where, in principle, the word <em>war</em> is a central part of a body of law which purports to put limits on organized violence. The method is essentially corpus driven: it takes the usages of this lexical item in this register and explores its frequency, its typical local lexical environments, and its collocates. The analysis shows that while the concept of war is essential to the laws of war, it remains ill-defined, indeed virtually undefined, at the same time that its collocational habits affirm its naturalness and legitimacy. As has been found elsewhere, in the laws of war, <em>war</em> and <em>violence</em> are treated as distinct phenomena, operating in distinct lexical environments. The paper is a contribution from corpus linguistics to the work of understanding the ideological effects of this highly significant legal register.</p></div>","PeriodicalId":72254,"journal":{"name":"Applied Corpus Linguistics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-01-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666799124000054/pdfft?md5=ec7ffb0252b1bf940897edc0a6b33ca9&pid=1-s2.0-S2666799124000054-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139631765","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Applied Corpus Linguistics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1