{"title":"实现余弦相似度算法,提高血液学文本报表生成的灵活性","authors":"Aulia Amirullah, I. Aulia, Dedy Arisandy","doi":"10.1109/DATABIA50434.2020.9190549","DOIUrl":null,"url":null,"abstract":"The previous hematology textual summary representation system, which applies template based method of Natural Language Generation to produce hematology laboratory test results in natural language representation, was at the cutting edge to generate more detailed hematology reports. The produced reports manage to provide texts which break down the critical components and abnormal components of blood found in conventional hematology test results. The produced reports in natural language representation aimed to help patients to easily define, spot and point out which blood components are acting up. Templates provide slots to generate every single sentence to be replaced by the data that we provide. However, the previous system is only able to produce fixed unflexible slots of blood components which are defined by the system, named T-Gen System. It nearly got off the ground as it is very unflexible because the produced templates cannot hold all of both critical and abnormal components found in a produced laboratory examination result. Therefore, this research project implements cosine similarity algorithm to expand template flexibility. Testing and evaluation were carried out manually by examining given components into the system which will be added consecutively. The testing shows that every blood component which was added consecutively succesfully appeared in the produced texts.","PeriodicalId":165106,"journal":{"name":"2020 International Conference on Data Science, Artificial Intelligence, and Business Analytics (DATABIA)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Implementing Cosine Similarity Algorithm to Increase the Flexibility of Hematology Text Report Generation\",\"authors\":\"Aulia Amirullah, I. Aulia, Dedy Arisandy\",\"doi\":\"10.1109/DATABIA50434.2020.9190549\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The previous hematology textual summary representation system, which applies template based method of Natural Language Generation to produce hematology laboratory test results in natural language representation, was at the cutting edge to generate more detailed hematology reports. The produced reports manage to provide texts which break down the critical components and abnormal components of blood found in conventional hematology test results. The produced reports in natural language representation aimed to help patients to easily define, spot and point out which blood components are acting up. Templates provide slots to generate every single sentence to be replaced by the data that we provide. However, the previous system is only able to produce fixed unflexible slots of blood components which are defined by the system, named T-Gen System. It nearly got off the ground as it is very unflexible because the produced templates cannot hold all of both critical and abnormal components found in a produced laboratory examination result. Therefore, this research project implements cosine similarity algorithm to expand template flexibility. Testing and evaluation were carried out manually by examining given components into the system which will be added consecutively. The testing shows that every blood component which was added consecutively succesfully appeared in the produced texts.\",\"PeriodicalId\":165106,\"journal\":{\"name\":\"2020 International Conference on Data Science, Artificial Intelligence, and Business Analytics (DATABIA)\",\"volume\":\"10 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 International Conference on Data Science, Artificial Intelligence, and Business Analytics (DATABIA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/DATABIA50434.2020.9190549\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Conference on Data Science, Artificial Intelligence, and Business Analytics (DATABIA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DATABIA50434.2020.9190549","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Implementing Cosine Similarity Algorithm to Increase the Flexibility of Hematology Text Report Generation
The previous hematology textual summary representation system, which applies template based method of Natural Language Generation to produce hematology laboratory test results in natural language representation, was at the cutting edge to generate more detailed hematology reports. The produced reports manage to provide texts which break down the critical components and abnormal components of blood found in conventional hematology test results. The produced reports in natural language representation aimed to help patients to easily define, spot and point out which blood components are acting up. Templates provide slots to generate every single sentence to be replaced by the data that we provide. However, the previous system is only able to produce fixed unflexible slots of blood components which are defined by the system, named T-Gen System. It nearly got off the ground as it is very unflexible because the produced templates cannot hold all of both critical and abnormal components found in a produced laboratory examination result. Therefore, this research project implements cosine similarity algorithm to expand template flexibility. Testing and evaluation were carried out manually by examining given components into the system which will be added consecutively. The testing shows that every blood component which was added consecutively succesfully appeared in the produced texts.