{"title":"基于语音的腹腔镜手术工具分割及其图像增强","authors":"Soorva Ram Shimgekar, Preetham Reddy Pathi, S. V","doi":"10.1109/i-PACT52855.2021.9696600","DOIUrl":null,"url":null,"abstract":"With more and more people preferring Laparo-scopic surgeries because of its benefits like smaller scars and being relatively painless in nature, there is immense need to solve the issues the doctors face while performing a Laparoscopic surgery. In this paper we propose a surgical tool segmentation and image enhancement method which can be controlled with a voice based interface. The proposed method solves problems like glare and fogging in addition to segmenting the surgical tools. A voice based interface was used so that the doctor does not have to get distracted while performing the surgery. This method can be helpful to surgeons and can also be helpful in achieving automated surgeries. For image segmentation we use Mask-RCNN and Kaldi Voice Recognition toolkit for the voice based interface. Kaldi toolkit has a great influence on the time taken by the system to understand the query.","PeriodicalId":335956,"journal":{"name":"2021 Innovations in Power and Advanced Computing Technologies (i-PACT)","volume":"28 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Voice based Segmentation of Laparoscopic Surgical Tools and its Image Enhancement\",\"authors\":\"Soorva Ram Shimgekar, Preetham Reddy Pathi, S. V\",\"doi\":\"10.1109/i-PACT52855.2021.9696600\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With more and more people preferring Laparo-scopic surgeries because of its benefits like smaller scars and being relatively painless in nature, there is immense need to solve the issues the doctors face while performing a Laparoscopic surgery. In this paper we propose a surgical tool segmentation and image enhancement method which can be controlled with a voice based interface. The proposed method solves problems like glare and fogging in addition to segmenting the surgical tools. A voice based interface was used so that the doctor does not have to get distracted while performing the surgery. This method can be helpful to surgeons and can also be helpful in achieving automated surgeries. For image segmentation we use Mask-RCNN and Kaldi Voice Recognition toolkit for the voice based interface. Kaldi toolkit has a great influence on the time taken by the system to understand the query.\",\"PeriodicalId\":335956,\"journal\":{\"name\":\"2021 Innovations in Power and Advanced Computing Technologies (i-PACT)\",\"volume\":\"28 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-11-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 Innovations in Power and Advanced Computing Technologies (i-PACT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/i-PACT52855.2021.9696600\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 Innovations in Power and Advanced Computing Technologies (i-PACT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/i-PACT52855.2021.9696600","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Voice based Segmentation of Laparoscopic Surgical Tools and its Image Enhancement
With more and more people preferring Laparo-scopic surgeries because of its benefits like smaller scars and being relatively painless in nature, there is immense need to solve the issues the doctors face while performing a Laparoscopic surgery. In this paper we propose a surgical tool segmentation and image enhancement method which can be controlled with a voice based interface. The proposed method solves problems like glare and fogging in addition to segmenting the surgical tools. A voice based interface was used so that the doctor does not have to get distracted while performing the surgery. This method can be helpful to surgeons and can also be helpful in achieving automated surgeries. For image segmentation we use Mask-RCNN and Kaldi Voice Recognition toolkit for the voice based interface. Kaldi toolkit has a great influence on the time taken by the system to understand the query.