Nikolay A. Kolpakov, A. Molodchenkov, Anton V. Lukin
{"title":"从专利和科学出版物中提取生物医学信息的方法(以化合物为例)","authors":"Nikolay A. Kolpakov, A. Molodchenkov, Anton V. Lukin","doi":"10.22363/2658-4670-2023-31-1-64-74","DOIUrl":null,"url":null,"abstract":"This article proposes an algorithm for solving the problem of extracting information from biomedical patents and scientific publications. The introduced algorithm is based on machine learning methods. Experiments were carried out on patents from the USPTO database. Experiments have shown that the best extraction quality was achieved by a model based on BioBERT.","PeriodicalId":34192,"journal":{"name":"Discrete and Continuous Models and Applied Computational Science","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-03-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Methods of extracting biomedical information from patents and scientific publications (on the example of chemical compounds)\",\"authors\":\"Nikolay A. Kolpakov, A. Molodchenkov, Anton V. Lukin\",\"doi\":\"10.22363/2658-4670-2023-31-1-64-74\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This article proposes an algorithm for solving the problem of extracting information from biomedical patents and scientific publications. The introduced algorithm is based on machine learning methods. Experiments were carried out on patents from the USPTO database. Experiments have shown that the best extraction quality was achieved by a model based on BioBERT.\",\"PeriodicalId\":34192,\"journal\":{\"name\":\"Discrete and Continuous Models and Applied Computational Science\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-03-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Discrete and Continuous Models and Applied Computational Science\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.22363/2658-4670-2023-31-1-64-74\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Discrete and Continuous Models and Applied Computational Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.22363/2658-4670-2023-31-1-64-74","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Methods of extracting biomedical information from patents and scientific publications (on the example of chemical compounds)
This article proposes an algorithm for solving the problem of extracting information from biomedical patents and scientific publications. The introduced algorithm is based on machine learning methods. Experiments were carried out on patents from the USPTO database. Experiments have shown that the best extraction quality was achieved by a model based on BioBERT.