{"title":"基于掩蔽语言模型的双向阅读深度学习","authors":"Hiroyuki Nishimoto","doi":"10.54941/ahfe1001178","DOIUrl":null,"url":null,"abstract":"Google BERT is a neural network that is good at natural language processing. It has two major strategies. One is “Masked language Model” to clear the word-level relationships, and the other is “Next Sentence Prediction” to clear sentence-level relationships. In the masked language model, with the task of masking some words in sentences, BERT learns to predict the original word from context. Some questions come to mind. Why BERT achieves effective learning by reading in two ways from fore and back? What is the difference between bidirectional reading? BERT learns to predict the original word using the surrounding words as context and to make two-way predictions by forward and backward readings in order to increase the precision. Besides, the bidirectional reading technique can be applied to scenario planning especially using back-casting from the future. This paper clarifies these mechanisms.","PeriodicalId":116806,"journal":{"name":"Human Systems Engineering and Design (IHSED2021) Future Trends and Applications","volume":"52 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Effective deep learning through bidirectional reading on masked language model\",\"authors\":\"Hiroyuki Nishimoto\",\"doi\":\"10.54941/ahfe1001178\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Google BERT is a neural network that is good at natural language processing. It has two major strategies. One is “Masked language Model” to clear the word-level relationships, and the other is “Next Sentence Prediction” to clear sentence-level relationships. In the masked language model, with the task of masking some words in sentences, BERT learns to predict the original word from context. Some questions come to mind. Why BERT achieves effective learning by reading in two ways from fore and back? What is the difference between bidirectional reading? BERT learns to predict the original word using the surrounding words as context and to make two-way predictions by forward and backward readings in order to increase the precision. Besides, the bidirectional reading technique can be applied to scenario planning especially using back-casting from the future. This paper clarifies these mechanisms.\",\"PeriodicalId\":116806,\"journal\":{\"name\":\"Human Systems Engineering and Design (IHSED2021) Future Trends and Applications\",\"volume\":\"52 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1900-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Human Systems Engineering and Design (IHSED2021) Future Trends and Applications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.54941/ahfe1001178\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Human Systems Engineering and Design (IHSED2021) Future Trends and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.54941/ahfe1001178","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
Google BERT是一个擅长自然语言处理的神经网络。它有两个主要策略。一种是“屏蔽语言模型”,用于清除词级关系;另一种是“下一句预测”,用于清除句子级关系。在掩蔽语言模型中,BERT以掩蔽句子中的某些单词为任务,学习从上下文中预测原单词。我想到了一些问题。为什么BERT通过前后两种方式的阅读来达到有效的学习?双向阅读的区别是什么?BERT学习使用周围的词作为上下文来预测原词,并通过向前和向后阅读进行双向预测,以提高精度。此外,双向阅读技术可以应用于情景规划,特别是使用从未来回溯的方法。本文阐明了这些机制。
Effective deep learning through bidirectional reading on masked language model
Google BERT is a neural network that is good at natural language processing. It has two major strategies. One is “Masked language Model” to clear the word-level relationships, and the other is “Next Sentence Prediction” to clear sentence-level relationships. In the masked language model, with the task of masking some words in sentences, BERT learns to predict the original word from context. Some questions come to mind. Why BERT achieves effective learning by reading in two ways from fore and back? What is the difference between bidirectional reading? BERT learns to predict the original word using the surrounding words as context and to make two-way predictions by forward and backward readings in order to increase the precision. Besides, the bidirectional reading technique can be applied to scenario planning especially using back-casting from the future. This paper clarifies these mechanisms.