{"title":"Annotation Protocol for Textbook Enrichment with Prerequisite Knowledge Graph","authors":"Chiara Alzetta, Ilaria Torre, Frosina Koceva","doi":"10.1007/s10758-023-09682-6","DOIUrl":null,"url":null,"abstract":"Abstract Extracting and formally representing the knowledge embedded in textbooks, such as the concepts explained and the relations between them, can support the provision of advanced knowledge-based services for learning environments and digital libraries. In this paper, we consider a specific type of relation in textbooks referred to as prerequisite relations (PR). PRs represent precedence relations between concepts aimed to provide the reader with the knowledge needed to understand a further concept(s). Their annotation in educational texts produces datasets that can be represented as a graph of concepts connected by PRs. However, building good-quality and reliable datasets of PRs from a textbook is still an open issue, not just for automated annotation methods but even for manual annotation. In turn, the lack of good-quality datasets and well-defined criteria to identify PRs affect the development and validation of automated methods for prerequisite identification. As a contribution to this issue, in this paper, we propose PREAP, a protocol for the annotation of prerequisite relations in textbooks aimed at obtaining reliable annotated data that can be shared, compared, and reused in the research community. PREAP defines a novel textbook-driven annotation method aimed to capture the structure of prerequisites underlying the text. The protocol has been evaluated against baseline methods for manual and automatic annotation. The findings show that PREAP enables the creation of prerequisite knowledge graphs that have higher inter-annotator agreement, accuracy, and alignment with text than the baseline methods. This suggests that the protocol is able to accurately capture the PRs expressed in the text. Furthermore, the findings show that the time required to complete the annotation using PREAP are significantly shorter than with the other manual baseline methods. The paper includes also guidelines for using PREAP in three annotation scenarios, experimentally tested. We also provide example datasets and a user interface that we developed to support prerequisite annotation.","PeriodicalId":46366,"journal":{"name":"Technology Knowledge and Learning","volume":"2 1","pages":"0"},"PeriodicalIF":3.0000,"publicationDate":"2023-09-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Technology Knowledge and Learning","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/s10758-023-09682-6","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"EDUCATION & EDUCATIONAL RESEARCH","Score":null,"Total":0}
引用次数: 0
Abstract
Abstract Extracting and formally representing the knowledge embedded in textbooks, such as the concepts explained and the relations between them, can support the provision of advanced knowledge-based services for learning environments and digital libraries. In this paper, we consider a specific type of relation in textbooks referred to as prerequisite relations (PR). PRs represent precedence relations between concepts aimed to provide the reader with the knowledge needed to understand a further concept(s). Their annotation in educational texts produces datasets that can be represented as a graph of concepts connected by PRs. However, building good-quality and reliable datasets of PRs from a textbook is still an open issue, not just for automated annotation methods but even for manual annotation. In turn, the lack of good-quality datasets and well-defined criteria to identify PRs affect the development and validation of automated methods for prerequisite identification. As a contribution to this issue, in this paper, we propose PREAP, a protocol for the annotation of prerequisite relations in textbooks aimed at obtaining reliable annotated data that can be shared, compared, and reused in the research community. PREAP defines a novel textbook-driven annotation method aimed to capture the structure of prerequisites underlying the text. The protocol has been evaluated against baseline methods for manual and automatic annotation. The findings show that PREAP enables the creation of prerequisite knowledge graphs that have higher inter-annotator agreement, accuracy, and alignment with text than the baseline methods. This suggests that the protocol is able to accurately capture the PRs expressed in the text. Furthermore, the findings show that the time required to complete the annotation using PREAP are significantly shorter than with the other manual baseline methods. The paper includes also guidelines for using PREAP in three annotation scenarios, experimentally tested. We also provide example datasets and a user interface that we developed to support prerequisite annotation.
期刊介绍:
Technology, Knowledge and Learning emphasizes the increased interest on context-aware adaptive and personalized digital learning environments. Rapid technological developments have led to new research challenges focusing on digital learning, gamification, automated assessment and learning analytics. These emerging systems aim to provide learning experiences delivered via online environments as well as mobile devices and tailored to the educational needs, the personal characteristics and the particular circumstances of the individual learner or a (massive) group of interconnected learners. Such diverse learning experiences in real-world and virtual situations generates big data which provides rich potential for in-depth intelligent analysis and adaptive feedback as well as scaffolds whenever the learner needs it. Novel manuscripts are welcome that account for how these new technologies and systems reconfigure learning experiences, assessment methodologies as well as future educational practices. Technology, Knowledge and Learning also publishes guest-edited themed special issues linked to the emerging field of educational technology.
Submissions can be empirical investigations, work in progress studies or emerging technology reports. Empirical investigations report quantitative or qualitative research demonstrating advances in digital learning, gamification, automated assessment or learning analytics. Work-in-progress studies provide early insights into leading projects or document progressions of excellent research within the field of digital learning, gamification, automated assessment or learning analytics. Emerging technology reports review new developments in educational technology by assessing the potentials for leading digital learning environments. Manuscripts submitted to Technology, Knowledge and Learning undergo a blind review process involving expert reviews and in-depth evaluations. Initial feedback is usually provided within eight weeks including in progress open-access abstracts and review snapshots.