{"title":"[Short paper] Towards improved collaborative text editing CRDTs by using Natural Language Processing","authors":"Jim Bauwens, Kevin De Porre, Elisa Gonzalez Boix","doi":"10.1145/3578358.3591330","DOIUrl":null,"url":null,"abstract":"Collaborative text editing systems are used in a variety of cloud-based products. To ensure that documents remain consistent between users, these systems often rely on CRDTs, operational transformation, or other techniques for achieving (strong) eventual consistency. CRDT-based approaches are appealing as they incorporate strategies to ensure that concurrent updates cannot conflict. However, these strategies do not necessarily take into account program semantics and may result in unexpected behaviour from the end-user's perspective. For example, conflict resolution strategies in collaborative text editors may lead to duplicate words and incorrectly merged sentences. This position paper investigates the use of deterministic natural language processing (NLP) algorithms to improve the concurrency semantics of collaborative text editing systems that rely on CRDTs, aiming to provide a better end-user experience. We explore what is needed to ensure convergence, and highlight potential difficulties with the approach.","PeriodicalId":198398,"journal":{"name":"Proceedings of the 10th Workshop on Principles and Practice of Consistency for Distributed Data","volume":"16 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 10th Workshop on Principles and Practice of Consistency for Distributed Data","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3578358.3591330","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Collaborative text editing systems are used in a variety of cloud-based products. To ensure that documents remain consistent between users, these systems often rely on CRDTs, operational transformation, or other techniques for achieving (strong) eventual consistency. CRDT-based approaches are appealing as they incorporate strategies to ensure that concurrent updates cannot conflict. However, these strategies do not necessarily take into account program semantics and may result in unexpected behaviour from the end-user's perspective. For example, conflict resolution strategies in collaborative text editors may lead to duplicate words and incorrectly merged sentences. This position paper investigates the use of deterministic natural language processing (NLP) algorithms to improve the concurrency semantics of collaborative text editing systems that rely on CRDTs, aiming to provide a better end-user experience. We explore what is needed to ensure convergence, and highlight potential difficulties with the approach.