首页 > 最新文献

Journal of Korea Multimedia Society最新文献

英文 中文
U-Net Based Single Image Deraining Using the Wavelet Residue Channel Fusion Strategy 基于U-Net的小波残差信道融合单幅图像去噪
Pub Date : 2023-09-30 DOI: 10.9717/kmms.2023.26.9.1115
Tae-Hee Park
{"title":"U-Net Based Single Image Deraining Using the Wavelet Residue Channel Fusion Strategy","authors":"Tae-Hee Park","doi":"10.9717/kmms.2023.26.9.1115","DOIUrl":"https://doi.org/10.9717/kmms.2023.26.9.1115","url":null,"abstract":"","PeriodicalId":16316,"journal":{"name":"Journal of Korea Multimedia Society","volume":"35 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135040395","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Analysis of Visual Attention on Korean Essay Book Covers Using Eye-tracking Devices 用眼动仪分析韩语随笔书封面的视觉注意
Pub Date : 2023-09-30 DOI: 10.9717/kmms.2023.26.9.1149
Minhee Park, Mahnwoo Kwon, Mikyung Hwang, Hyeonseong Kim
Today, with the advent of various communication media, the importance of the visual component of the book cover is increasing as the purchasing behavior pattern changes from paper books to e-books and from offline to online and mobile channels. In particular, social and environmental changes such as COVID-19 caused consumers
在各种传播媒介出现的今天,随着购买行为模式从纸质书转向电子书,从线下转向线上和移动渠道,图书封面的视觉成分的重要性日益增加。特别是新冠疫情等社会和环境变化给消费者带来的影响
{"title":"Analysis of Visual Attention on Korean Essay Book Covers Using Eye-tracking Devices","authors":"Minhee Park, Mahnwoo Kwon, Mikyung Hwang, Hyeonseong Kim","doi":"10.9717/kmms.2023.26.9.1149","DOIUrl":"https://doi.org/10.9717/kmms.2023.26.9.1149","url":null,"abstract":"Today, with the advent of various communication media, the importance of the visual component of the book cover is increasing as the purchasing behavior pattern changes from paper books to e-books and from offline to online and mobile channels. In particular, social and environmental changes such as COVID-19 caused consumers","PeriodicalId":16316,"journal":{"name":"Journal of Korea Multimedia Society","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135040365","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Attention Based-ConvLSTM-DNN Networks for Fine Dust Concentration Prediction 基于注意力的convlstm - dnn网络微尘浓度预测
Pub Date : 2023-08-31 DOI: 10.9717/kmms.2023.26.8.911
Joon-Min Lee, Kyeong-Tae Kim, Jae-Young Choi
Air pollution, particularly fine dust, poses a significant threat to public health and necessitates accurate prediction models for effective mitigation strategies. In this paper, we propose a so-called attention-based ConvLSTM-DNN networks for fine dust concentration prediction. Our proposed model integrates the feature extraction capabilities of a 2D Convolutional Neural Network (CNN) with the long-term memory retention of an LSTM, capturing spatial and temporal dependencies in the input data. We apply an attention mechanism to enhance the model
空气污染,特别是微细粉尘,对公众健康构成重大威胁,必须建立准确的预测模型,以便实施有效的缓解战略。在本文中,我们提出了一种所谓的基于注意力的ConvLSTM-DNN网络用于细尘浓度预测。我们提出的模型将2D卷积神经网络(CNN)的特征提取能力与LSTM的长期记忆保留相结合,捕获输入数据中的空间和时间依赖性。我们应用注意机制来增强模型
{"title":"Attention Based-ConvLSTM-DNN Networks for Fine Dust Concentration Prediction","authors":"Joon-Min Lee, Kyeong-Tae Kim, Jae-Young Choi","doi":"10.9717/kmms.2023.26.8.911","DOIUrl":"https://doi.org/10.9717/kmms.2023.26.8.911","url":null,"abstract":"Air pollution, particularly fine dust, poses a significant threat to public health and necessitates accurate prediction models for effective mitigation strategies. In this paper, we propose a so-called attention-based ConvLSTM-DNN networks for fine dust concentration prediction. Our proposed model integrates the feature extraction capabilities of a 2D Convolutional Neural Network (CNN) with the long-term memory retention of an LSTM, capturing spatial and temporal dependencies in the input data. We apply an attention mechanism to enhance the model","PeriodicalId":16316,"journal":{"name":"Journal of Korea Multimedia Society","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135991523","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Improvement of Prostate Cancer Aggressiveness Prediction Performance Using a Self-Supervised Learning Model Fine-Turned on Similar Medical Images from Multi-Parametric MR Images 利用自监督学习模型对多参数MR图像的相似医学图像进行微调,提高前列腺癌侵袭性预测性能
Pub Date : 2023-08-31 DOI: 10.9717/kmms.2023.26.8.995
Yejin Shin, Min-Jin Lee, Helen Hong, Sung-Il Hwang
In this paper, we propose a prostate cancer aggressiveness prediction model using self-supervised learning based on SimCLR with multi-parametric MR images. Self-supervised learning model is initially trained on the STL10 dataset, and then fine-tuned on the ProstateX dataset, which is similar to the downstream task dataset. To predict prostate cancer aggressiveness, downstream tasks are performed using each sequence of images from the multi-parametric MR dataset. The predicted results are combined using either majority voting or average voting for ensembling. Experimental results demonstrate that the self-supervised learning model fine-turned with similar images improves the performance by an average of 4.56% in accuracy, 20.69% in sensitivity, and 12.02% in negative predictive value. The ensemble method using majority voting with the self-supervised learning model fine-turned on similar images from the multi-parametric MR dataset yields the best performance in terms of accuracy (72.58%), balance accuracy (72.16%), and sensitivity (67.86%).
在本文中,我们提出了一种基于SimCLR的多参数MR图像自监督学习的前列腺癌侵袭性预测模型。自监督学习模型首先在STL10数据集上进行训练,然后在类似于下游任务数据集的ProstateX数据集上进行微调。为了预测前列腺癌的侵袭性,下游任务使用来自多参数MR数据集的每个图像序列进行。预测结果使用多数投票或平均投票进行组合。实验结果表明,对相似图像进行微调的自监督学习模型,准确率平均提高4.56%,灵敏度平均提高20.69%,负预测值平均提高12.02%。使用多数投票和自监督学习模型对多参数MR数据集中的相似图像进行微调的集成方法在准确率(72.58%)、平衡准确率(72.16%)和灵敏度(67.86%)方面表现最佳。
{"title":"Improvement of Prostate Cancer Aggressiveness Prediction Performance Using a Self-Supervised Learning Model Fine-Turned on Similar Medical Images from Multi-Parametric MR Images","authors":"Yejin Shin, Min-Jin Lee, Helen Hong, Sung-Il Hwang","doi":"10.9717/kmms.2023.26.8.995","DOIUrl":"https://doi.org/10.9717/kmms.2023.26.8.995","url":null,"abstract":"In this paper, we propose a prostate cancer aggressiveness prediction model using self-supervised learning based on SimCLR with multi-parametric MR images. Self-supervised learning model is initially trained on the STL10 dataset, and then fine-tuned on the ProstateX dataset, which is similar to the downstream task dataset. To predict prostate cancer aggressiveness, downstream tasks are performed using each sequence of images from the multi-parametric MR dataset. The predicted results are combined using either majority voting or average voting for ensembling. Experimental results demonstrate that the self-supervised learning model fine-turned with similar images improves the performance by an average of 4.56% in accuracy, 20.69% in sensitivity, and 12.02% in negative predictive value. The ensemble method using majority voting with the self-supervised learning model fine-turned on similar images from the multi-parametric MR dataset yields the best performance in terms of accuracy (72.58%), balance accuracy (72.16%), and sensitivity (67.86%).","PeriodicalId":16316,"journal":{"name":"Journal of Korea Multimedia Society","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135991682","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Design of Bone Conduction Implants Piezoelectric Transducer Based on Rhombus Mechanism for Magnetic Resonance Compatibility Improvement 基于菱形机构的骨传导植入体压电换能器的磁共振相容性改善设计
Pub Date : 2023-08-31 DOI: 10.9717/kmms.2023.26.8.956
Dong-Ho Shin, Hyung-Gyu Lim, Myoung-Nam Kim, Ki-Woong Seong
This study introduces a novel piezoelectric transducer for bone conduction implants that combines piezoelectric elements with a rhombus mechanism to enhance compatibility with magnetic resonance environments. To derive the optimal design of the rhombus structure, various parameters were investigated using theoretical analysis and finite element analysis. A theoretical model of the rhombus structure was employed to identify the parameters affecting displacement amplification magnitude. Based on this, a parametric analysis was performed to calculate the displacement amplification ratio according to these parameters. The results showed that as the beam thickness and width increased, the amplification ratio reduced, while with an increased length, the amplification ratio was increased. Therefore, the optimal rhombus structure for the transducer featured beam dimensions of 0.15 mm thickness, 2 mm width, 3.5 mm length, and 5.5° inclination. This configuration amplified piezoelectric element displacement by a factor of 7.02. The amplification ratio remained constant as long as the mass applied to the rhombus frame to control mechanical resonance did not exceed the blocking force of the piezoelectric element. When a mass of 0.3 g was applied to the frame, mechanical resonance occurred at a frequency of 2 kHz, making it suitable as a transducer for bone conduction implants.
本研究介绍了一种新型骨传导植入物的压电换能器,该换能器将压电元件与菱形机构相结合,以增强与磁共振环境的兼容性。采用理论分析和有限元分析相结合的方法对各参数进行了研究,得出了菱形结构的最优设计方案。采用菱形结构的理论模型,确定了影响位移放大幅度的参数。在此基础上,进行了参数化分析,根据这些参数计算位移放大比。结果表明:随着光束厚度和宽度的增加,放大比减小,随着光束长度的增加,放大比增大;因此,换能器的最佳菱形结构的光束尺寸为0.15 mm厚,2mm宽,3.5 mm长,5.5°倾角。这种结构将压电元件的位移放大了7.02倍。只要施加在菱形框架上控制机械共振的质量不超过压电元件的阻挡力,放大比保持不变。当0.3 g的质量施加到框架上时,机械共振频率为2khz,使其适合作为骨传导植入物的换能器。
{"title":"Design of Bone Conduction Implants Piezoelectric Transducer Based on Rhombus Mechanism for Magnetic Resonance Compatibility Improvement","authors":"Dong-Ho Shin, Hyung-Gyu Lim, Myoung-Nam Kim, Ki-Woong Seong","doi":"10.9717/kmms.2023.26.8.956","DOIUrl":"https://doi.org/10.9717/kmms.2023.26.8.956","url":null,"abstract":"This study introduces a novel piezoelectric transducer for bone conduction implants that combines piezoelectric elements with a rhombus mechanism to enhance compatibility with magnetic resonance environments. To derive the optimal design of the rhombus structure, various parameters were investigated using theoretical analysis and finite element analysis. A theoretical model of the rhombus structure was employed to identify the parameters affecting displacement amplification magnitude. Based on this, a parametric analysis was performed to calculate the displacement amplification ratio according to these parameters. The results showed that as the beam thickness and width increased, the amplification ratio reduced, while with an increased length, the amplification ratio was increased. Therefore, the optimal rhombus structure for the transducer featured beam dimensions of 0.15 mm thickness, 2 mm width, 3.5 mm length, and 5.5° inclination. This configuration amplified piezoelectric element displacement by a factor of 7.02. The amplification ratio remained constant as long as the mass applied to the rhombus frame to control mechanical resonance did not exceed the blocking force of the piezoelectric element. When a mass of 0.3 g was applied to the frame, mechanical resonance occurred at a frequency of 2 kHz, making it suitable as a transducer for bone conduction implants.","PeriodicalId":16316,"journal":{"name":"Journal of Korea Multimedia Society","volume":"49 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135991517","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Research on Corporate Bankruptcy Prediction Analysis Based on Financial and Non-Financial Information Using Deep Learning 基于深度学习的财务与非财务信息的企业破产预测分析研究
Pub Date : 2023-08-31 DOI: 10.9717/kmms.2023.26.8.1003
Joong-Hyun Park
In the past, research related to corporate bankruptcy has primarily conducted empirical analyses through bankruptcy prediction models using financial ratios. However, with the advancement of ICT technology, there has been a growing trend in applying artificial intelligence. In this study, both traditional corporate bankruptcy prediction methodologies and machine learning and deep learning methodologies from the field of deep learning were applied to present the results of corporate bankruptcy prediction models and their predictive power. The dataset used included corporate characteristics, including financial ratios and non-financial information, as well as macroeconomic indicators to account for economic conditions. Five models, SVM, RF, DNN, CNN, and LSTM, were designated, and the model reliability and prediction accuracy for each model were analyzed. The LSTM model demonstrated superior performance and the highest prediction accuracy among the models. When comparing different approaches using only financial ratios (Set 1), using financial ratios and corporate characteristics together (Set 2), and incorporating financial ratios, corporate characteristics, and macroeconomic indicators (Set 3), which included all of these factors, consistently exhibited the highest model reliability and prediction accuracy.
过去对企业破产的研究主要是利用财务比率进行破产预测模型的实证分析。然而,随着信息通信技术的进步,人工智能的应用呈现出日益增长的趋势。本研究运用传统的企业破产预测方法,以及深度学习领域的机器学习和深度学习方法来呈现企业破产预测模型的结果及其预测能力。所使用的数据集包括公司特征,包括财务比率和非财务信息,以及用于说明经济状况的宏观经济指标。设计了SVM、RF、DNN、CNN和LSTM 5种模型,并对每种模型的模型可靠性和预测精度进行了分析。LSTM模型表现出较好的预测性能和较高的预测精度。当比较仅使用财务比率(集合1)、同时使用财务比率和公司特征(集合2)以及合并财务比率、公司特征和宏观经济指标(集合3)的不同方法时,包括所有这些因素,始终表现出最高的模型可靠性和预测准确性。
{"title":"Research on Corporate Bankruptcy Prediction Analysis Based on Financial and Non-Financial Information Using Deep Learning","authors":"Joong-Hyun Park","doi":"10.9717/kmms.2023.26.8.1003","DOIUrl":"https://doi.org/10.9717/kmms.2023.26.8.1003","url":null,"abstract":"In the past, research related to corporate bankruptcy has primarily conducted empirical analyses through bankruptcy prediction models using financial ratios. However, with the advancement of ICT technology, there has been a growing trend in applying artificial intelligence. In this study, both traditional corporate bankruptcy prediction methodologies and machine learning and deep learning methodologies from the field of deep learning were applied to present the results of corporate bankruptcy prediction models and their predictive power. The dataset used included corporate characteristics, including financial ratios and non-financial information, as well as macroeconomic indicators to account for economic conditions. Five models, SVM, RF, DNN, CNN, and LSTM, were designated, and the model reliability and prediction accuracy for each model were analyzed. The LSTM model demonstrated superior performance and the highest prediction accuracy among the models. When comparing different approaches using only financial ratios (Set 1), using financial ratios and corporate characteristics together (Set 2), and incorporating financial ratios, corporate characteristics, and macroeconomic indicators (Set 3), which included all of these factors, consistently exhibited the highest model reliability and prediction accuracy.","PeriodicalId":16316,"journal":{"name":"Journal of Korea Multimedia Society","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135991534","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Generation and Analysis of Webtoon Background Images Using GAN 基于GAN的网络漫画背景图像生成与分析
Pub Date : 2023-08-31 DOI: 10.9717/kmms.2023.26.8.1075
Je-Kyung Lee, Jeoung-Gi Kim, Jeong-In Ahn, Ji-Yeon Lim, Kyung-Ae Cha
In this paper, we propose a method to reduce the amount of manual work in webtoon creation and utilize creative contents derived from AI learning through deep learning-based technology that generates background images of various styles. To achieve this goal, we train CartoonGAN and AnimeGAN models that are specialized in creating images in the style of webtoons and animations, and create background images that can be used for webtoons. Recently, various Generative Adversarial Network (GAN) models have been actively used to create digital content, but cartoon-style images should be created with simplified textures and sharp outlines. In addition, when converting a real image into a cartoon style, it is necessary to create a simple and abstract image while maintaining the content expressed by the image. We build training data suitable for the production of these webtoon-style images, and analyze whether the images generated by the two GAN models can be used for webtoon production, and seek ways to utilize generative AI.
在本文中,我们提出了一种方法,通过基于深度学习的技术生成各种风格的背景图像,减少网络漫画创作中的手工工作量,并利用人工智能学习衍生的创意内容。为了实现这一目标,我们训练了专门用于创建网络漫画和动画风格图像的CartoonGAN和AnimeGAN模型,并创建可用于网络漫画的背景图像。最近,各种生成对抗网络(GAN)模型已被积极用于创建数字内容,但卡通风格的图像应该使用简化的纹理和清晰的轮廓来创建。此外,在将真实的图像转换为卡通风格时,在保持图像所表达的内容的同时,需要创造一个简单抽象的图像。我们构建适合制作这些网络漫画风格图像的训练数据,并分析两种GAN模型生成的图像是否可以用于网络漫画制作,并寻求利用生成式AI的方法。
{"title":"Generation and Analysis of Webtoon Background Images Using GAN","authors":"Je-Kyung Lee, Jeoung-Gi Kim, Jeong-In Ahn, Ji-Yeon Lim, Kyung-Ae Cha","doi":"10.9717/kmms.2023.26.8.1075","DOIUrl":"https://doi.org/10.9717/kmms.2023.26.8.1075","url":null,"abstract":"In this paper, we propose a method to reduce the amount of manual work in webtoon creation and utilize creative contents derived from AI learning through deep learning-based technology that generates background images of various styles. To achieve this goal, we train CartoonGAN and AnimeGAN models that are specialized in creating images in the style of webtoons and animations, and create background images that can be used for webtoons. Recently, various Generative Adversarial Network (GAN) models have been actively used to create digital content, but cartoon-style images should be created with simplified textures and sharp outlines. In addition, when converting a real image into a cartoon style, it is necessary to create a simple and abstract image while maintaining the content expressed by the image. We build training data suitable for the production of these webtoon-style images, and analyze whether the images generated by the two GAN models can be used for webtoon production, and seek ways to utilize generative AI.","PeriodicalId":16316,"journal":{"name":"Journal of Korea Multimedia Society","volume":"57 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135991687","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Analysis of Usability on Secondary Functions for the Hearing Impaired to Enjoy Metaverse Contents - Focusing on popular music concerts - 听觉障碍者欣赏虚拟世界内容的辅助功能可用性分析——以流行音乐会为例
Pub Date : 2023-08-31 DOI: 10.9717/kmms.2023.26.8.1064
Nam-Hee Kim, Soon-Bum Lim
Hearing-impaired people should be able to enjoy virtual reality content without discrimination. The development of metaverse has led to the success of virtual environment contents, including virtual concerts, but services for the deaf are still insufficient in this environment. In this paper, we present an auxiliary solution for hearing-impaired people to enjoy metaverse concerts and made video test samples for usability evaluation. To demonstrate that the solutions presented in this study help hearing-impaired people enjoy metaverse concerts, we made video test materials that provide secondary functions such as motion captions and haptic vibration. Test materials that provide basic subtitles were compared with test materials that applied motion captions, haptic vibrations, and both. To this end, an effectiveness evaluation and satisfaction evaluation using the Likert scale were conducted, and then a subjective satisfaction-oriented interview was conducted. Providing motion captions and haptic vibration together resulted in a relatively high usability evaluation effectiveness score, and specific requirements, problems, and improvements were identified through user interviews. If actual programs are developed and distributed based on this study, the quality of cultural life can be improved in the virtual environment of the hearing-impaired people.
听障人士应该能够不受歧视地享受虚拟现实内容。随着metaverse的发展,包括虚拟音乐会在内的虚拟环境内容获得了成功,但在这种环境下,对聋人的服务仍然不足。本文提出了一种帮助听障人士欣赏虚拟音乐会的辅助解决方案,并制作了视频测试样本进行可用性评估。为了证明本研究提出的解决方案可以帮助听障人士欣赏虚拟音乐会,我们制作了视频测试材料,提供运动字幕和触觉振动等次要功能。提供基本字幕的测试材料与应用运动字幕、触觉振动和两者兼而有之的测试材料进行了比较。为此,采用李克特量表进行有效性评价和满意度评价,然后进行主观满意度访谈。同时提供运动字幕和触觉振动导致了相对较高的可用性评估有效性得分,并通过用户访谈确定了具体的需求、问题和改进。如果在本研究的基础上开发和发布实际的节目,可以提高听障人群在虚拟环境中的文化生活质量。
{"title":"An Analysis of Usability on Secondary Functions for the Hearing Impaired to Enjoy Metaverse Contents - Focusing on popular music concerts -","authors":"Nam-Hee Kim, Soon-Bum Lim","doi":"10.9717/kmms.2023.26.8.1064","DOIUrl":"https://doi.org/10.9717/kmms.2023.26.8.1064","url":null,"abstract":"Hearing-impaired people should be able to enjoy virtual reality content without discrimination. The development of metaverse has led to the success of virtual environment contents, including virtual concerts, but services for the deaf are still insufficient in this environment. In this paper, we present an auxiliary solution for hearing-impaired people to enjoy metaverse concerts and made video test samples for usability evaluation. To demonstrate that the solutions presented in this study help hearing-impaired people enjoy metaverse concerts, we made video test materials that provide secondary functions such as motion captions and haptic vibration. Test materials that provide basic subtitles were compared with test materials that applied motion captions, haptic vibrations, and both. To this end, an effectiveness evaluation and satisfaction evaluation using the Likert scale were conducted, and then a subjective satisfaction-oriented interview was conducted. Providing motion captions and haptic vibration together resulted in a relatively high usability evaluation effectiveness score, and specific requirements, problems, and improvements were identified through user interviews. If actual programs are developed and distributed based on this study, the quality of cultural life can be improved in the virtual environment of the hearing-impaired people.","PeriodicalId":16316,"journal":{"name":"Journal of Korea Multimedia Society","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135991525","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deep Learning Network Algorithm Based on X-transfer Learning for Micro Object Detection 基于x -迁移学习的微目标检测深度学习网络算法
Pub Date : 2023-08-31 DOI: 10.9717/kmms.2023.26.8.925
Oh-Seol Kwon
In this paper, a low-resolution object detection algorithm was proposed based on X-transfer learning on GAN model. The proposed method is effective in improving detection of micro objects by optimizing with GAN network for super-resolution and an object recognition network. In addition, the proposed X-transfer learning technique alternately uses transfer learning and curriculum learning to overcome the lack of training data. This method can improve the accuracy, robustness, and localization performance of object recognition based on rich visual information on entire network. The proposed model was evaluated with remote sensing data sets. It was confirmed that the proposed method is more accurate than existing methods in terms of mAP@0.5 and F1 scores.
本文提出了一种基于GAN模型x -迁移学习的低分辨率目标检测算法。该方法通过对GAN超分辨率网络和目标识别网络进行优化,有效地提高了微目标的检测效果。此外,本文提出的x迁移学习技术交替使用迁移学习和课程学习来克服训练数据缺乏的问题。该方法可以提高基于全网丰富视觉信息的目标识别的准确性、鲁棒性和定位性能。利用遥感数据集对该模型进行了评价。结果表明,本文方法在mAP@0.5和F1得分方面均优于现有方法。
{"title":"Deep Learning Network Algorithm Based on X-transfer Learning for Micro Object Detection","authors":"Oh-Seol Kwon","doi":"10.9717/kmms.2023.26.8.925","DOIUrl":"https://doi.org/10.9717/kmms.2023.26.8.925","url":null,"abstract":"In this paper, a low-resolution object detection algorithm was proposed based on X-transfer learning on GAN model. The proposed method is effective in improving detection of micro objects by optimizing with GAN network for super-resolution and an object recognition network. In addition, the proposed X-transfer learning technique alternately uses transfer learning and curriculum learning to overcome the lack of training data. This method can improve the accuracy, robustness, and localization performance of object recognition based on rich visual information on entire network. The proposed model was evaluated with remote sensing data sets. It was confirmed that the proposed method is more accurate than existing methods in terms of mAP@0.5 and F1 scores.","PeriodicalId":16316,"journal":{"name":"Journal of Korea Multimedia Society","volume":"148 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135991530","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deep Learning Ar chitectur e for Choice-based Recommendation System: A Case Study of Flight Sear ch Engine 基于选择的推荐系统的深度学习架构——以航班搜索引擎为例
Pub Date : 2023-08-31 DOI: 10.9717/kmms.2023.26.8.1027
Hamdi Abdurhman Ahmed, Jihwan Lee, Donghyun Kim, ByeongSeok Yu
First, we propose a class of efficient models classed as choice-based recommendation (CBR) for parametric metrics, such as a logit model as a recommendation system using nonparametric approaches. The rest of the papers is organized as follow : we used a simple, streamlined architecture that uses a nonparametric approach such as a feedforward deep neural network (DNN). The study implemented a method to deal with a choice set with a fixed and variable-length option, investigate deep learning methods that consider each choice set as one sample point, the effect of embedding categorical features and accuracy impact, and the efficiency of batch normalization toward a more stable network. To check the performance of our approach, we conducted extensive experiments on multiple datasets and used the top-k accuracy as a metric. We then show the effectiveness of CBR across two industrial applications and use cases, including hotel booking and airline itineraries. The results show that the DNN outperforms the multinomial logit model (MNL) with significant top-k accuracy. The top-k accuracy was further divided into three different DNN models. Among the models, a model that included a layer with batch normalization embedding outperforms with top-k accuracy compared with the model that does not include both batch normalization and embedding layer in the proposed DNN architecture.
首先,我们针对参数度量提出了一类高效的基于选择的推荐(CBR)模型,例如使用非参数方法的logit模型作为推荐系统。其余的论文组织如下:我们使用了一个简单的流线型架构,使用非参数方法,如前馈深度神经网络(DNN)。本研究实现了一种处理具有固定和可变长度选项的选择集的方法,研究了将每个选择集视为一个样本点的深度学习方法,嵌入分类特征的效果和准确性影响,以及批归一化的效率,以实现更稳定的网络。为了检查我们的方法的性能,我们在多个数据集上进行了广泛的实验,并使用top-k精度作为度量。然后,我们展示了CBR在两个工业应用程序和用例中的有效性,包括酒店预订和航空公司行程。结果表明,DNN在top-k精度上优于多项logit模型(MNL)。top-k精度进一步划分为三种不同的DNN模型。在这些模型中,包含批归一化嵌入层的模型在top-k精度上优于不包含批归一化和嵌入层的模型。
{"title":"Deep Learning Ar chitectur e for Choice-based Recommendation System: A Case Study of Flight Sear ch Engine","authors":"Hamdi Abdurhman Ahmed, Jihwan Lee, Donghyun Kim, ByeongSeok Yu","doi":"10.9717/kmms.2023.26.8.1027","DOIUrl":"https://doi.org/10.9717/kmms.2023.26.8.1027","url":null,"abstract":"First, we propose a class of efficient models classed as choice-based recommendation (CBR) for parametric metrics, such as a logit model as a recommendation system using nonparametric approaches. The rest of the papers is organized as follow : we used a simple, streamlined architecture that uses a nonparametric approach such as a feedforward deep neural network (DNN). The study implemented a method to deal with a choice set with a fixed and variable-length option, investigate deep learning methods that consider each choice set as one sample point, the effect of embedding categorical features and accuracy impact, and the efficiency of batch normalization toward a more stable network. To check the performance of our approach, we conducted extensive experiments on multiple datasets and used the top-k accuracy as a metric. We then show the effectiveness of CBR across two industrial applications and use cases, including hotel booking and airline itineraries. The results show that the DNN outperforms the multinomial logit model (MNL) with significant top-k accuracy. The top-k accuracy was further divided into three different DNN models. Among the models, a model that included a layer with batch normalization embedding outperforms with top-k accuracy compared with the model that does not include both batch normalization and embedding layer in the proposed DNN architecture.","PeriodicalId":16316,"journal":{"name":"Journal of Korea Multimedia Society","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135991685","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of Korea Multimedia Society
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1