首页 > 最新文献

Journal of Korea Multimedia Society最新文献

英文 中文
Modeling and Simulation of Periodic Review Inventory Policy in the Supply Chain Using DEVS 基于DEVS的供应链定期评审库存策略建模与仿真
Pub Date : 2023-10-31 DOI: 10.9717/kmms.2023.26.10.1288
Young-Dan Noh, Bo-Seung Kwon, Sang-Won Jung, Young-Shin Han, Jong Sik Lee
The enterprise seeks profit and aims to maximize it. Inventory costs are among the various costs that enterprise can incur. And this inventory cost refers to various costs incurred due to inventory. Inventory management aims to reduce inventory costs while satisfying customers
企业追求利润,以利润最大化为目标。存货成本是企业可能产生的各种成本之一。而存货成本是指因存货而产生的各种成本。库存管理的目的是在满足客户需求的同时降低库存成本
{"title":"Modeling and Simulation of Periodic Review Inventory Policy in the Supply Chain Using DEVS","authors":"Young-Dan Noh, Bo-Seung Kwon, Sang-Won Jung, Young-Shin Han, Jong Sik Lee","doi":"10.9717/kmms.2023.26.10.1288","DOIUrl":"https://doi.org/10.9717/kmms.2023.26.10.1288","url":null,"abstract":"The enterprise seeks profit and aims to maximize it. Inventory costs are among the various costs that enterprise can incur. And this inventory cost refers to various costs incurred due to inventory. Inventory management aims to reduce inventory costs while satisfying customers","PeriodicalId":16316,"journal":{"name":"Journal of Korea Multimedia Society","volume":"6 ","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135978357","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Zero-Shot Cell Image Super-Resolution 零拍摄单元图像超分辨率
Pub Date : 2023-10-31 DOI: 10.9717/kmms.2023.26.10.1261
Jeonghyun Noh, Jinsun Park
The shape of a cell is an important factor in cell examinations that diagnose cancer or certain disease, however, due to the limitations and nature of the microscope, low-resolution (LR) cell images can be obtained. LR images have limitations in analyzing the phenotype or morphological characteristics of cells. Therefore, they need to be restored to high-resolution (HR) images. In this paper, we propose a zero-shot super-resolution (ZSSR) algorithm to reconstruct cell shape information. In specific, a high-frequency filtering module (HFM) is adopted to calculate the difference between HR and LR by extracting various information such as the edge and corners of cells which are high-frequency information in an image. In addition, channel attention blocks (CAB) that suppress and emphasize feature information are used for SR without being confused with similar cell shapes in an image. It also improves the generalization performance of the network by sharing the network’s parameters. As a result, PSNR is improved by 0.04dB compared to that of the previous ZSSR. The source code will be made available at : https://github.com/JJeong-Gari/Cell-ZSSR/
细胞的形状是诊断癌症或某些疾病的细胞检查中的一个重要因素,然而,由于显微镜的局限性和性质,可以获得低分辨率(LR)细胞图像。LR图像在分析细胞表型或形态特征方面有局限性。因此,它们需要恢复为高分辨率(HR)图像。在本文中,我们提出了一种零镜头超分辨率(ZSSR)算法来重建细胞形状信息。其中,采用高频滤波模块(HFM),通过提取图像中作为高频信息的细胞的边缘、角落等各种信息来计算HR和LR的差值。此外,抑制和强调特征信息的通道注意块(CAB)用于SR,而不会与图像中相似的细胞形状混淆。通过共享网络参数,提高了网络的泛化性能。结果表明,与之前的ZSSR相比,PSNR提高了0.04dB。源代码将在https://github.com/JJeong-Gari/Cell-ZSSR/上提供
{"title":"Zero-Shot Cell Image Super-Resolution","authors":"Jeonghyun Noh, Jinsun Park","doi":"10.9717/kmms.2023.26.10.1261","DOIUrl":"https://doi.org/10.9717/kmms.2023.26.10.1261","url":null,"abstract":"The shape of a cell is an important factor in cell examinations that diagnose cancer or certain disease, however, due to the limitations and nature of the microscope, low-resolution (LR) cell images can be obtained. LR images have limitations in analyzing the phenotype or morphological characteristics of cells. Therefore, they need to be restored to high-resolution (HR) images. In this paper, we propose a zero-shot super-resolution (ZSSR) algorithm to reconstruct cell shape information. In specific, a high-frequency filtering module (HFM) is adopted to calculate the difference between HR and LR by extracting various information such as the edge and corners of cells which are high-frequency information in an image. In addition, channel attention blocks (CAB) that suppress and emphasize feature information are used for SR without being confused with similar cell shapes in an image. It also improves the generalization performance of the network by sharing the network’s parameters. As a result, PSNR is improved by 0.04dB compared to that of the previous ZSSR. The source code will be made available at : https://github.com/JJeong-Gari/Cell-ZSSR/","PeriodicalId":16316,"journal":{"name":"Journal of Korea Multimedia Society","volume":"150 ","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135979350","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Design and Implementation of LoRA-Based College Entrance Examination and Related Information System 基于lora的高考及相关信息系统的设计与实现
Pub Date : 2023-10-31 DOI: 10.9717/kmms.2023.26.10.1353
Sungwook Yoon
This research primarily focused on the development of an LLM response system tailored for university information, leveraging the capabilities and efficiencies of the LoRA technique. LoRA presents a methodology for efficiently fine-tuning large language models for specific tasks, and its effectiveness and efficiency were substantiated through this study. Consequently, a high-accuracy university information response system was established even under constrained resources. Especially with the utilization of LoRA
本研究主要集中于开发一个专为大学信息定制的法学硕士响应系统,利用LoRA技术的能力和效率。LoRA提出了一种针对特定任务高效微调大型语言模型的方法,并通过本研究证实了其有效性和效率。从而在资源有限的情况下,建立了高精度的高校信息响应系统。尤其是LoRA的使用
{"title":"Design and Implementation of LoRA-Based College Entrance Examination and Related Information System","authors":"Sungwook Yoon","doi":"10.9717/kmms.2023.26.10.1353","DOIUrl":"https://doi.org/10.9717/kmms.2023.26.10.1353","url":null,"abstract":"This research primarily focused on the development of an LLM response system tailored for university information, leveraging the capabilities and efficiencies of the LoRA technique. LoRA presents a methodology for efficiently fine-tuning large language models for specific tasks, and its effectiveness and efficiency were substantiated through this study. Consequently, a high-accuracy university information response system was established even under constrained resources. Especially with the utilization of LoRA","PeriodicalId":16316,"journal":{"name":"Journal of Korea Multimedia Society","volume":"34 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135978358","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Design and Application of Mapping Model for Emotion-Based Font Recommendation System 基于情感的字体推荐系统映射模型的设计与应用
Pub Date : 2023-10-31 DOI: 10.9717/kmms.2023.26.10.1303
YoungSeo Ji, DongWhan Kim, JaeHong Park, Soon-Bum Lim
Font usage is effective in accentuating meaning and establishing the overall tone of a message. Nevertheless, the process of selecting an appropriate font can be burdensome for users as it necessitates examining all available fonts. Furthermore, users with limited font usage experience might inadvertently choose an inappropriate font. To tackle this concern, we developed a system that recommends fonts by evaluating similarity between font keyword values and emotions extracted from content through deep learning emotion analysis. Considering the disparity in criteria utilized for classifying content emotions and font keywords, the necessity arose for a mapping model to evaluate the similarity between these two sets of criteria. Accordingly we designed our mapping model constructed based on the PAD model, a framework that represents emotions along three axes on a coordinate plane. We formulated two distinct methods to assess similarity: the first converts content and font characteristics into a single PAD value, subsequently discerning the distance; The second method analyzes the Pearson correlation coefficient between the criteria for emotional classification to determine the similarity. A comparative evaluation was conducted between these two methods. The results of the evaluation affirmed that the model reflecting the correlation coefficient yielded greater efficacy. As a result, we opted for this mapping model as the approach for calculating similarity between content and font.
字体的使用在强调意义和建立信息的整体基调方面是有效的。然而,选择合适字体的过程可能会给用户带来负担,因为它需要检查所有可用的字体。此外,字体使用经验有限的用户可能会无意中选择不合适的字体。为了解决这个问题,我们开发了一个系统,通过评估字体关键字值与通过深度学习情感分析从内容中提取的情感之间的相似性来推荐字体。考虑到用于分类内容情感和字体关键字的标准的差异,有必要建立一个映射模型来评估这两组标准之间的相似性。因此,我们设计了基于PAD模型构建的映射模型,该模型是一个在坐标平面上沿三个轴表示情感的框架。我们制定了两种不同的方法来评估相似性:第一种是将内容和字体特征转换为单个PAD值,然后识别距离;第二种方法通过分析情感分类标准之间的Pearson相关系数来确定相似性。对两种方法进行了比较评价。评价结果证实,反映相关系数的模型效果更好。因此,我们选择这个映射模型作为计算内容和字体之间相似性的方法。
{"title":"Design and Application of Mapping Model for Emotion-Based Font Recommendation System","authors":"YoungSeo Ji, DongWhan Kim, JaeHong Park, Soon-Bum Lim","doi":"10.9717/kmms.2023.26.10.1303","DOIUrl":"https://doi.org/10.9717/kmms.2023.26.10.1303","url":null,"abstract":"Font usage is effective in accentuating meaning and establishing the overall tone of a message. Nevertheless, the process of selecting an appropriate font can be burdensome for users as it necessitates examining all available fonts. Furthermore, users with limited font usage experience might inadvertently choose an inappropriate font. To tackle this concern, we developed a system that recommends fonts by evaluating similarity between font keyword values and emotions extracted from content through deep learning emotion analysis. Considering the disparity in criteria utilized for classifying content emotions and font keywords, the necessity arose for a mapping model to evaluate the similarity between these two sets of criteria. Accordingly we designed our mapping model constructed based on the PAD model, a framework that represents emotions along three axes on a coordinate plane. We formulated two distinct methods to assess similarity: the first converts content and font characteristics into a single PAD value, subsequently discerning the distance; The second method analyzes the Pearson correlation coefficient between the criteria for emotional classification to determine the similarity. A comparative evaluation was conducted between these two methods. The results of the evaluation affirmed that the model reflecting the correlation coefficient yielded greater efficacy. As a result, we opted for this mapping model as the approach for calculating similarity between content and font.","PeriodicalId":16316,"journal":{"name":"Journal of Korea Multimedia Society","volume":"52 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135979492","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Study on the Nostalgic Cinematic Storytelling Characteristics of <Top Gun: Maverick> - Focusing on Intertextuality 《壮志凌云:独行侠》怀旧电影叙事特征研究——注重互文性
Pub Date : 2023-10-31 DOI: 10.9717/kmms.2023.26.10.1344
Dongha Shim, Wonsik Jung
This study examines the nostalgic cinematic storytelling characteristics of Top Gun: Maverick, which evokes nostalgia, especially focusing on the expression and use of intertextuality. To this end, we compare and analyze Top Gun: Maverick and Top Gun in terms of story structure and character characteristics, which are key elements of storytelling. As a result of the study, the story structure of Top Gun: Maverick is very similar to the preceding text Top Gun, and based on this, strong intertextuality is revealed. In addition, Top Gun: Maverick enhances intertextuality through the three-dimensional use of characters from the previous work and the nostalgic cinematic transformation of the characters
本研究考察了《壮志凌云:特立独行》的怀旧电影叙事特点,它唤起了怀旧情绪,尤其关注互文性的表达和使用。为此,我们对《壮志凌云:独行侠》和《壮志凌云》的故事结构和人物特征进行了比较分析,这是叙事的关键要素。研究结果表明,《壮志凌云:特立独行》的故事结构与前面的文本《壮志凌云》非常相似,并在此基础上显示出强烈的互文性。此外,《壮志凌云:独行侠》通过对前作人物的立体运用和对人物的怀旧电影化改造,增强了影片的互文性
{"title":"A Study on the Nostalgic Cinematic Storytelling Characteristics of &lt;Top Gun: Maverick&gt; - Focusing on Intertextuality","authors":"Dongha Shim, Wonsik Jung","doi":"10.9717/kmms.2023.26.10.1344","DOIUrl":"https://doi.org/10.9717/kmms.2023.26.10.1344","url":null,"abstract":"This study examines the nostalgic cinematic storytelling characteristics of Top Gun: Maverick, which evokes nostalgia, especially focusing on the expression and use of intertextuality. To this end, we compare and analyze Top Gun: Maverick and Top Gun in terms of story structure and character characteristics, which are key elements of storytelling. As a result of the study, the story structure of Top Gun: Maverick is very similar to the preceding text Top Gun, and based on this, strong intertextuality is revealed. In addition, Top Gun: Maverick enhances intertextuality through the three-dimensional use of characters from the previous work and the nostalgic cinematic transformation of the characters","PeriodicalId":16316,"journal":{"name":"Journal of Korea Multimedia Society","volume":"46 ","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135978342","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Class Activation Map based Random Erasing for Data Augmentation 基于类激活映射的随机擦除数据增强
Pub Date : 2023-10-31 DOI: 10.9717/kmms.2023.26.10.1231
Juhyeon Oh, Kyujoong Lee
Random erasing offers various levels of occlusion for data augmentation. However, due to its uniform distribution of random selection, it sometimes occludes regions that are unrelated to the object of interest. In this paper, we propose a novel method that utilizes Gradient Weighted Class Activation Mapping (Grad-CAM) for estimating the location of the object of interest and selectively erasing the surrounding areas. By utilizing Grad-CAM, we improve random erasing for CNN models without requiring additional modules or architectural changes. We generate Grad-CAM after the intermediate epochs where CNN models have sufficient representational power for the training data. The hyperparameter that restrict the erasing to the vicinity of the object is set based on Grad-CAM, and experiments were conducted accordingly. As a result of our experiments, we observed a 0.33% decrease in error-rate for image classification tasks using ResNet-20 on the CIFAR-10 dataset.
随机擦除为数据增强提供了不同级别的遮挡。然而,由于其随机选择的均匀分布,有时会遮挡与感兴趣对象无关的区域。在本文中,我们提出了一种利用梯度加权类激活映射(Gradient Weighted Class Activation Mapping, Grad-CAM)来估计感兴趣对象的位置并选择性地擦除周围区域的新方法。通过使用Grad-CAM,我们改进了CNN模型的随机擦除,而不需要额外的模块或架构更改。我们在中间时代之后生成Grad-CAM,其中CNN模型对训练数据具有足够的表征能力。基于Grad-CAM设置了将擦除限制在目标附近的超参数,并进行了相应的实验。通过实验,我们发现在CIFAR-10数据集上使用ResNet-20进行图像分类任务的错误率降低了0.33%。
{"title":"Class Activation Map based Random Erasing for Data Augmentation","authors":"Juhyeon Oh, Kyujoong Lee","doi":"10.9717/kmms.2023.26.10.1231","DOIUrl":"https://doi.org/10.9717/kmms.2023.26.10.1231","url":null,"abstract":"Random erasing offers various levels of occlusion for data augmentation. However, due to its uniform distribution of random selection, it sometimes occludes regions that are unrelated to the object of interest. In this paper, we propose a novel method that utilizes Gradient Weighted Class Activation Mapping (Grad-CAM) for estimating the location of the object of interest and selectively erasing the surrounding areas. By utilizing Grad-CAM, we improve random erasing for CNN models without requiring additional modules or architectural changes. We generate Grad-CAM after the intermediate epochs where CNN models have sufficient representational power for the training data. The hyperparameter that restrict the erasing to the vicinity of the object is set based on Grad-CAM, and experiments were conducted accordingly. As a result of our experiments, we observed a 0.33% decrease in error-rate for image classification tasks using ResNet-20 on the CIFAR-10 dataset.","PeriodicalId":16316,"journal":{"name":"Journal of Korea Multimedia Society","volume":"27 2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135978570","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Study on the Effect of Audience Experience on Audience Satisfaction with Interactive Movies 观众体验对互动电影观众满意度的影响研究
Pub Date : 2023-10-31 DOI: 10.9717/kmms.2023.26.10.1321
JiHeun Kong, JiaJun Xu, CheeYong Kim
Interactive movies are multimedia content characterized by the addition of nonlinear narratives to the traditional narrative structure of movies and the interaction mechanism of video games. The nonlinear structure of interactive movies enables a unique viewing experience by allowing the audience to participate in the narrative and the audience to compose their own plot. In this study, we tried to find out the effect of audience experience on audience satisfaction for interactive movies. The audience experience was divided into Flow Experience, Emotional Experience, Relational Experience, and Marketing Communication Experience, respectively, and a total of five variables, including Audience Satisaction, were defined, and the correlation between each variable was proved. Reliability, validity, and hypothesis were verified using SPSS 26.0 and AMOS 24.0 based on a total of 272 questionnaires. Studies have shown that Flow Experience has a significant positive effect on customers
互动电影是在传统电影叙事结构和电子游戏互动机制的基础上加入非线性叙事的多媒体内容。互动电影的非线性结构使观众能够参与到叙事中来,形成自己的情节,从而获得独特的观影体验。在本研究中,我们试图找出观众体验对互动电影观众满意度的影响。将受众体验分为Flow experience、Emotional experience、Relational experience和Marketing Communication experience,并定义了包括audience satisfaction在内的共5个变量,并证明了各变量之间的相关性。采用SPSS 26.0和AMOS 24.0对272份问卷进行信度、效度和假设检验。研究表明,心流体验对顾客有显著的积极影响
{"title":"A Study on the Effect of Audience Experience on Audience Satisfaction with Interactive Movies","authors":"JiHeun Kong, JiaJun Xu, CheeYong Kim","doi":"10.9717/kmms.2023.26.10.1321","DOIUrl":"https://doi.org/10.9717/kmms.2023.26.10.1321","url":null,"abstract":"Interactive movies are multimedia content characterized by the addition of nonlinear narratives to the traditional narrative structure of movies and the interaction mechanism of video games. The nonlinear structure of interactive movies enables a unique viewing experience by allowing the audience to participate in the narrative and the audience to compose their own plot. In this study, we tried to find out the effect of audience experience on audience satisfaction for interactive movies. The audience experience was divided into Flow Experience, Emotional Experience, Relational Experience, and Marketing Communication Experience, respectively, and a total of five variables, including Audience Satisaction, were defined, and the correlation between each variable was proved. Reliability, validity, and hypothesis were verified using SPSS 26.0 and AMOS 24.0 based on a total of 272 questionnaires. Studies have shown that Flow Experience has a significant positive effect on customers","PeriodicalId":16316,"journal":{"name":"Journal of Korea Multimedia Society","volume":"25 4","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135979496","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Proposal of an Advanced Structure of YOLOX for Hornet Detection Accuracy Improvement 一种提高大黄蜂探测精度的YOLOX先进结构的提出
Pub Date : 2023-10-31 DOI: 10.9717/kmms.2023.26.10.1238
Yeongjae Kwon, Cheolhee Lee
In this paper, an advanced backbone structure for YOLOX is proposed to obtain better detection accuracy in small object detection such as hornet by replacing CSPLayer with ShuffleLayer. By this replacement, numbers of convolution operation are reduced in each layer of the backbone. This can conserve spatial information of small objects in each layer and through layers in backbone, reducing processing time. In order to evaluate the proposed method, four types of experiments were executed such as mAP comparison for our hornet dataset, another mAP comparison for the standard dataset VEDAI dedicated small objects, generalization test for RTMDet, and detection speed between the default YOLOX model and the proposed YOLOX model. As a result, the first mAP under 50% IoU condition for the hornet dataset showed 86.21% and 87.35% for the default and the proposed, respectively. The experiment, mAP test for the standard VEDAI, represented 47% and 41.7% for each model and also showed better accuracy by 5.3%. In the generalization test with RTMDet, the proposed model showed similar or higher accuracy according to IoU. In addition, in terms of speed the proposed ShuffleLayerbased backbone was faster than the default by 1.35 times due to reduced convolution parameters. Thus, experiments above verified that the proposed backbone structure for YOLOX can be effectively utilized to enhance accuracy and inference speed in real-time detection for small objects.
本文提出了一种改进的YOLOX骨干结构,用ShuffleLayer代替CSPLayer,在黄蜂等小目标检测中获得更好的检测精度。通过这种替换,减少了主干网每层的卷积操作次数。这样可以在每一层和骨干层之间保存小目标的空间信息,减少处理时间。为了评估所提出的方法,我们进行了四种类型的实验,包括对我们的大黄蜂数据集的mAP比较,对标准数据集VEDAI专用小目标的mAP比较,RTMDet的泛化测试,以及默认YOLOX模型和所提出的YOLOX模型的检测速度。结果表明,在50% IoU条件下,hornet数据集的首个mAP值分别为默认值86.21%和建议值87.35%。实验中,mAP测试对标准VEDAI的准确率分别为47%和41.7%,准确率也提高了5.3%。在RTMDet泛化检验中,所提出的模型在IoU上具有相似或更高的精度。此外,在速度方面,由于减少了卷积参数,所提出的基于shufflelayer的骨干比默认的快1.35倍。因此,上述实验验证了所提出的YOLOX骨干结构可以有效地提高小目标实时检测的精度和推理速度。
{"title":"Proposal of an Advanced Structure of YOLOX for Hornet Detection Accuracy Improvement","authors":"Yeongjae Kwon, Cheolhee Lee","doi":"10.9717/kmms.2023.26.10.1238","DOIUrl":"https://doi.org/10.9717/kmms.2023.26.10.1238","url":null,"abstract":"In this paper, an advanced backbone structure for YOLOX is proposed to obtain better detection accuracy in small object detection such as hornet by replacing CSPLayer with ShuffleLayer. By this replacement, numbers of convolution operation are reduced in each layer of the backbone. This can conserve spatial information of small objects in each layer and through layers in backbone, reducing processing time. In order to evaluate the proposed method, four types of experiments were executed such as mAP comparison for our hornet dataset, another mAP comparison for the standard dataset VEDAI dedicated small objects, generalization test for RTMDet, and detection speed between the default YOLOX model and the proposed YOLOX model. As a result, the first mAP under 50% IoU condition for the hornet dataset showed 86.21% and 87.35% for the default and the proposed, respectively. The experiment, mAP test for the standard VEDAI, represented 47% and 41.7% for each model and also showed better accuracy by 5.3%. In the generalization test with RTMDet, the proposed model showed similar or higher accuracy according to IoU. In addition, in terms of speed the proposed ShuffleLayerbased backbone was faster than the default by 1.35 times due to reduced convolution parameters. Thus, experiments above verified that the proposed backbone structure for YOLOX can be effectively utilized to enhance accuracy and inference speed in real-time detection for small objects.","PeriodicalId":16316,"journal":{"name":"Journal of Korea Multimedia Society","volume":"9 2","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135978356","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Reward and Sense of Belonging, A study of Transmedia Contents by Expectancy Theory of Motivation - Focused on the Movie ‘Along with the Gods’ 奖励与归属感:基于动机期望理论的跨媒体内容研究——以电影《与神同行》为例
Pub Date : 2023-10-31 DOI: 10.9717/kmms.2023.26.10.1333
Jeehae Park, Wonsik Jung
‘Along with the Gods’ can be said to be Korea’s representative transmedia franchise. This paper aims to study the statistical significance between ‘Along with God’ and Bloom’s Expectancy Theory and derive meaningful implications. For this purpose, a survey was conducted on 300 people in their teens to 30s and analysis was performed based on this. Audiences who use transmedia are considered not only participatory users but also producers or members of transmedia. Assuming that the audience is a member of transmedia, they gain expectations about the world view through the film ‘Along with God’. Additionally, they believe that they can sustain their worldview desires through secondary creative content. Furthermore, they gain a sense of belonging as a reward through user-created content.
《与众神同行》可以说是韩国最具代表性的跨媒体作品。本文旨在研究“与上帝同行”理论与布鲁姆期望理论之间的统计显著性,并得出有意义的启示。为此,对300多名10 ~ 30多岁的人进行了调查,并以此为基础进行了分析。使用跨媒体的受众不仅被认为是参与用户,也是跨媒体的生产者或成员。假设观众是跨媒体的一员,他们通过电影《与上帝同行》获得对世界观的期望。此外,他们相信他们可以通过次要的创造性内容来维持他们的世界观欲望。此外,他们还通过用户创造的内容获得归属感。
{"title":"Reward and Sense of Belonging, A study of Transmedia Contents by Expectancy Theory of Motivation - Focused on the Movie ‘Along with the Gods’","authors":"Jeehae Park, Wonsik Jung","doi":"10.9717/kmms.2023.26.10.1333","DOIUrl":"https://doi.org/10.9717/kmms.2023.26.10.1333","url":null,"abstract":"‘Along with the Gods’ can be said to be Korea’s representative transmedia franchise. This paper aims to study the statistical significance between ‘Along with God’ and Bloom’s Expectancy Theory and derive meaningful implications. For this purpose, a survey was conducted on 300 people in their teens to 30s and analysis was performed based on this. Audiences who use transmedia are considered not only participatory users but also producers or members of transmedia. Assuming that the audience is a member of transmedia, they gain expectations about the world view through the film ‘Along with God’. Additionally, they believe that they can sustain their worldview desires through secondary creative content. Furthermore, they gain a sense of belonging as a reward through user-created content.","PeriodicalId":16316,"journal":{"name":"Journal of Korea Multimedia Society","volume":"46 3","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135978355","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Semantic Super-Resolution Using a Transformer Model 使用变压器模型的语义超分辨率
Pub Date : 2023-10-31 DOI: 10.9717/kmms.2023.26.10.1251
Donghyun Ku, Hanhoon Park
This paper proposes an effective method to improve the performance of SwinIR, a vision Transformer-based super-resolution neural network model, by introducing a Transformer decoder with learnable category queries. The decoder allows to extract semantic information of each dataset belonging to different categories (e.g., text and face); the semantic information can improve category-specific texture reconstruction in the process of super-resolution. Experiments were conducted using decoders of different architectures to analyze the performance of the proposed method. The experimental results confirm that the use of decoder can improve the quality of super-resolution images produced by SwinIR qualitatively and quantitatively, although improvements may vary depending on the depth of the decoder and how semantic information is applied.
本文提出了一种有效的方法,通过引入具有可学习类别查询的Transformer解码器来提高基于视觉Transformer的超分辨率神经网络模型SwinIR的性能。解码器允许提取属于不同类别(如文本和人脸)的每个数据集的语义信息;在超分辨率过程中,语义信息可以改善分类纹理的重建。利用不同结构的解码器进行了实验,分析了所提方法的性能。实验结果证实,解码器的使用可以在定性和定量上提高SwinIR产生的超分辨率图像的质量,尽管改进可能取决于解码器的深度和语义信息的应用方式。
{"title":"Semantic Super-Resolution Using a Transformer Model","authors":"Donghyun Ku, Hanhoon Park","doi":"10.9717/kmms.2023.26.10.1251","DOIUrl":"https://doi.org/10.9717/kmms.2023.26.10.1251","url":null,"abstract":"This paper proposes an effective method to improve the performance of SwinIR, a vision Transformer-based super-resolution neural network model, by introducing a Transformer decoder with learnable category queries. The decoder allows to extract semantic information of each dataset belonging to different categories (e.g., text and face); the semantic information can improve category-specific texture reconstruction in the process of super-resolution. Experiments were conducted using decoders of different architectures to analyze the performance of the proposed method. The experimental results confirm that the use of decoder can improve the quality of super-resolution images produced by SwinIR qualitatively and quantitatively, although improvements may vary depending on the depth of the decoder and how semantic information is applied.","PeriodicalId":16316,"journal":{"name":"Journal of Korea Multimedia Society","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135978360","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of Korea Multimedia Society
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1