Signal & Image Processing Trends最新文献

英文中文

LaBelle: A Deep Learning APP that Helps You Learn Ballet LaBelle:一个深度学习应用程序，帮助你学习芭蕾

Signal & Image Processing Trends

Pub Date : 2022-10-22 DOI: 10.5121/csit.2022.121714

Sarah Fan, Kevin Guo, Yu Sun

Human Pose Estimation has proven versatility in improving real-world applications in healthcare, sports, etc. [1]. Proper stance, form and movement is instrumental to succeeding in these activities. This paper will explain the research process behind the deep learning mobile ballet app, LaBelle [2]. LaBelle takes in two short videos: one of a teacher, and one of a student. Utilizing MediaPipe Pose to identify, analyze, and store data about the poses and movements of both dancers, the app calculates the angles created between different joints and major body parts. The app’s AI Model uses a K-means clustering algorithm to create a group of clusters for both the student dataset and the teacher dataset [3]. Using the two sets of clusters, LaBelle identifies the key frames in the student-video and searches the teacher cluster set for a matching set of properties and frames. It evaluates the differences between the paired frames and produces a final score as well as feedback on the poses that need improving. We propose an unsupervised guided-learning approach with improved efficiency in video comparison, which is usually both time and resource consuming. This efficient model can be used not just in dance, but athletics and medicine (physical therapy like activities) as well, where stance, form, and movements are often hard to track with the naked eye.

人体姿势估计已被证明在改善医疗保健，体育等现实世界应用中的多功能性[1]。正确的站位、姿势和动作是这些活动成功的关键。本文将解释深度学习移动芭蕾应用LaBelle[2]背后的研究过程。LaBelle拍摄了两个短视频:一个是老师的，一个是学生的。该应用程序利用MediaPipe Pose来识别、分析和存储两位舞者的姿势和动作数据，计算不同关节和主要身体部位之间产生的角度。该应用的AI模型使用K-means聚类算法为学生数据集和教师数据集创建一组聚类[3]。使用这两组聚类，LaBelle识别学生视频中的关键帧，并在教师聚类集中搜索匹配的属性和帧集。它会评估配对帧之间的差异，并产生最终分数以及需要改进的姿势的反馈。我们提出了一种无监督的引导学习方法，提高了视频比较的效率，这种方法通常既费时又耗资源。这种有效的模型不仅可以用于舞蹈，也可以用于体育和医学(物理治疗之类的活动)，在这些领域，姿势、形式和动作通常很难用肉眼追踪。

{"title":"LaBelle: A Deep Learning APP that Helps You Learn Ballet","authors":"Sarah Fan, Kevin Guo, Yu Sun","doi":"10.5121/csit.2022.121714","DOIUrl":"https://doi.org/10.5121/csit.2022.121714","url":null,"abstract":"Human Pose Estimation has proven versatility in improving real-world applications in healthcare, sports, etc. [1]. Proper stance, form and movement is instrumental to succeeding in these activities. This paper will explain the research process behind the deep learning mobile ballet app, LaBelle [2]. LaBelle takes in two short videos: one of a teacher, and one of a student. Utilizing MediaPipe Pose to identify, analyze, and store data about the poses and movements of both dancers, the app calculates the angles created between different joints and major body parts. The app’s AI Model uses a K-means clustering algorithm to create a group of clusters for both the student dataset and the teacher dataset [3]. Using the two sets of clusters, LaBelle identifies the key frames in the student-video and searches the teacher cluster set for a matching set of properties and frames. It evaluates the differences between the paired frames and produces a final score as well as feedback on the poses that need improving. We propose an unsupervised guided-learning approach with improved efficiency in video comparison, which is usually both time and resource consuming. This efficient model can be used not just in dance, but athletics and medicine (physical therapy like activities) as well, where stance, form, and movements are often hard to track with the naked eye.","PeriodicalId":170432,"journal":{"name":"Signal & Image Processing Trends","volume":"49 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115693460","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Using Singular Value Decomposition in a Convolutional Neural Network to Improve Brain Tumor Segmentation Accuracy 利用卷积神经网络奇异值分解提高脑肿瘤分割精度

Signal & Image Processing Trends

Pub Date : 2022-10-22 DOI: 10.5121/csit.2022.121717

P. Ahadian, Maryam Babaei, K. Parand

A brain tumor consists of cells showing abnormal brain growth. The area of the brain tumor significantly affects choosing the type of treatment and following the course of the disease during the treatment. At the same time, pictures of Brain MRIs are accompanied by noise. Eliminating existing noises can significantly impact the better segmentation and diagnosis of brain tumors. In this work, we have tried using the analysis of eigenvalues. We have used the MSVD algorithm, reducing the image noise and then using the deep neural network to segment the tumor in the images. The proposed method's accuracy was increased by 2.4% compared to using the original images. With Using the MSVD method, convergence speed has also increased, showing the proposed method's effectiveness.

脑瘤由表现出大脑异常生长的细胞组成。脑肿瘤的面积显著影响治疗类型的选择和治疗期间的病程。与此同时，脑核磁共振成像的图像伴随着噪音。消除已有噪声对更好地分割和诊断脑肿瘤有重要影响。在这项工作中，我们尝试使用特征值分析。我们使用MSVD算法，降低图像噪声，然后使用深度神经网络对图像中的肿瘤进行分割。与使用原始图像相比，该方法的准确率提高了2.4%。使用MSVD方法后，收敛速度也有所提高，表明了该方法的有效性。

引用次数: 0

An Intelligent System to Automate the Detection of Online Cheating Activities using AI and Context Aware Techniques 一个使用人工智能和上下文感知技术自动检测在线作弊活动的智能系统

Signal & Image Processing Trends

Pub Date : 2022-10-22 DOI: 10.5121/csit.2022.121707

Qinyuhan Zhao, Mingze Gao, Yu Sun

In the environment of online courses and online exams, cheating in online courses is prevalent [1]. To better ensure fairness in exams, schools and educational institutions need to use technology to detect and deter cheating [2]. Starting from practical application, this paper discusses 3 different methods to detect cheating behavior, and proposes a new way. for online exam supervision.

在网络课程和网络考试的环境下，网络课程作弊现象十分普遍[1]。为了更好地确保考试的公平性，学校和教育机构需要使用技术来检测和阻止作弊行为[2]。本文从实际应用出发，讨论了三种不同的检测作弊行为的方法，并提出了一种新的方法。用于在线考试监督。

引用次数: 0

ComputerBank: A Community-based Computer Donation Platform using Machine Learning and NFT 计算机银行:基于社区的计算机捐赠平台，使用机器学习和NFT

Signal & Image Processing Trends

Pub Date : 2022-10-22 DOI: 10.5121/csit.2022.121704

Erick Gong, Yu Sun

Many people donate money to fund organizations, but very rarely do those donors have information about where those donations go. Donation platforms are both non-transparent and also leave a large portion of potential donors unnoticed: gamers [1]. This paper explores the concept of utilizing blockchain technology and its existence as a web3 token-based platform in order to provide transparency for donation routes, showing donors and other companies exactly where donations are coming from and where that money is going. Our application utilizes HTTP requests in order to greatly increase compatibility, and also uses multiple private key encryptions in order to ensure that any user data or information and monetary transactions are kept secure and private [2].

许多人向基金组织捐款，但很少有捐赠者知道这些捐款的去向。捐赠平台既不透明，也忽视了很大一部分潜在捐赠者:玩家[1]。本文探讨了利用区块链技术及其作为web3代币平台的概念，以便为捐赠路线提供透明度，向捐赠者和其他公司准确显示捐赠的来源和资金的去向。我们的应用程序利用HTTP请求来大大提高兼容性，并且还使用多个私钥加密来确保任何用户数据或信息和货币交易保持安全和私密性[2]。

引用次数: 0

Artificial Intelligence Designed For Attendance 为考勤设计的人工智能

Signal & Image Processing Trends

Pub Date : 2022-10-22 DOI: 10.5121/csit.2022.121711

Jasmin Liao, Yu Sun

Engaging online students is a challenge for many teachers. While I was a student, I saw teachers struggling to take attendance due to the number of students leaving their classes after attendance. Students would be held responsible for their work using facial recognition technology. To simplify the process of applying absences to students in each class, this paper proposes an application that would allow teachers to stay on top of their work. We applied our software to test “students” in the classroom and used various libraries/CSC styles to create a classroom that is easy for both the student and the teacher to read. Our designs are built upon OpenCV and PIL which are used as geometric classifiers to determine if the student is present. We tested several faces to see if the algorithm was suitable for the program. After conducting a qualitative evaluation of the approach, we’ve begun to implement registration, create new classrooms with different databases, and apply verification. With the addition of HTML code, we wereable to create a classroom that is safe, engaging, and easy to use.

对许多教师来说，吸引在线学生是一个挑战。当我还是个学生的时候，我看到老师们因为下课后离开的学生太多而很难点名。学生们将对使用面部识别技术的作业负责。为了简化每节课学生的缺勤申请流程，本文提出了一种应用程序，可以让教师掌握自己的工作。我们用我们的软件在教室里测试“学生”，并使用各种图书馆/CSC风格来创建一个学生和老师都易于阅读的教室。我们的设计基于OpenCV和PIL，它们被用作几何分类器来确定学生是否在场。我们测试了几张脸，看看算法是否适合这个程序。在对该方法进行定性评估后，我们开始实施注册，使用不同的数据库创建新教室，并应用验证。通过添加HTML代码，我们可以创建一个安全、吸引人且易于使用的教室。

引用次数: 0

A Data-Driven and Collaborative Mobile Application to Assist Sensors using Artificial Intelligence and Machine Learning 使用人工智能和机器学习辅助传感器的数据驱动和协作移动应用程序

Signal & Image Processing Trends

Pub Date : 2022-10-22 DOI: 10.5121/csit.2022.121712

A. Tse, Yu Sun

There are numerous arguments as to what the best GPS software is [1]. However, there are no definitive answers as to which is the best. In this paper, we use multiple GPS applications tracking user locations to determine what GPS is best in terms of tracking users through a mobile app [3]. The app utilizes a GPS as well as a Google Firebase Realtime Database to manage, pinpoint, and track users’ locations [2]. The application is explicitly applied to track locations of people that need care, such as the elderly. This will allow concerned caretakers to help keep track and take care of people in need.

关于什么是最好的GPS软件有很多争论[1]。然而，对于哪个是最好的，并没有明确的答案。在本文中，我们使用多个GPS应用程序跟踪用户位置，以确定哪种GPS在通过移动应用程序跟踪用户方面是最好的[3]。该应用程序利用GPS以及谷歌Firebase实时数据库来管理，精确定位和跟踪用户的位置[2]。该应用程序明确用于跟踪需要护理的人(如老年人)的位置。这将使关心的看护人能够帮助跟踪和照顾有需要的人。

引用次数: 0

Handwritten Digit Recognition System based on CNN and SVM 基于CNN和SVM的手写数字识别系统

Signal & Image Processing Trends

Pub Date : 2022-10-22 DOI: 10.5121/csit.2022.121701

Yousra Berrich, Z. Guennoun

The recognition of handwritten digits has aroused the interest of the scientific community and is the subject of a large number of research works thanks to its various applications. The objective of this paper is to develop a system capable of recognizing handwritten digits using a convolutional neural network (CNN) combined with machine learning approaches to ensure diversity in automatic classification tools. In this work, we propose a classification method based on deep learning, in particular the convolutional neural network for feature extraction, it is a powerful tool that has had great success in image classification, followed by the support vector machine (SVM) for higher performance. We used the dataset (MNIST), and the results obtained showed that the combination of CNN with SVM improves the performance of the model as well as the classification accuracy with a rate of 99.12%.

手写体数字识别由于其广泛的应用，引起了科学界的兴趣，并成为大量研究工作的主题。本文的目标是开发一个能够使用卷积神经网络(CNN)结合机器学习方法识别手写数字的系统，以确保自动分类工具的多样性。在这项工作中，我们提出了一种基于深度学习的分类方法，特别是卷积神经网络的特征提取，它是一个强大的工具，在图像分类中取得了巨大的成功，其次是支持向量机(SVM)的更高性能。我们使用数据集(MNIST)，得到的结果表明，CNN与SVM的结合提高了模型的性能，分类准确率达到99.12%。

引用次数: 0

Ensembles for Class Imbalance Problems in Various Domains 多领域类不平衡问题的集成

Signal & Image Processing Trends

Pub Date : 2022-10-22 DOI: 10.5121/csit.2022.121718

Deepakindresh N, Gauthum J, Jeffrin Harris, Harshavardhan J, Shivaditya Shivganesh

The paper is an analysis of class imbalance problems from various domains such as the medical field, sentiment analysis, software de-fects, water portability, and relationship status of students and summarizes the performance of data resampling techniques such as random undersampling and oversampling. Synthetic minority oversampling techniques combined with the power of ensemble methods such as bagging, boosting, and hybrid techniques are generally used to solve the class imbalance problem.

本文从医学领域、情感分析、软件缺陷、可携水性、学生关系状况等多个领域对班级失衡问题进行了分析，总结了随机欠采样和过采样等数据重采样技术的性能。综合少数派过采样技术与诸如bagging、boosting和hybrid技术等集成方法相结合，通常用于解决类不平衡问题。

引用次数: 0

Measurement Study on 5G NSA Architecture over Fading Channel 5G NSA架构衰落信道测量研究

Signal & Image Processing Trends

Pub Date : 2022-10-22 DOI: 10.5121/csit.2022.121703

B. S. D. Silva, Iury da S. Batalha

The 5G NR network with the Non-Standalone (NSA) architecture aims to advance with regard to throughput. When compared to fourth-generation mobile communication (4G LTE), the 5G has a higher data exchange capability through the gNB and the UE (User Equipment). For evaluation and optimization, it is necessary to carry out practical studies on the behaviour of the system in different environmental conditions, subject to attenuation processes, such as large-scale fading (Shading) and small-scale fading (Multipath propagation). This work has analysed the effect of the MCS (Modulation and Coding Scheme) variation on Throughput/BLER for, initially, a channel degraded by default AWGN, then the analysis extends to the multipath fading effect, which emulates more realistically a mobile communication network. The analysis confirmed the need for robust decision process algorithms in terms of MCS switching to maintain adequate data rates according to the requirement of each scenario with specific QoS (Quality of service), considering both 64 QAM and 256 QAM. The throughput degradation effect was more evident in higher-order modulations due to the higher probability of error inherent in the symbol arrangement. This study can be a key for understanding and developing huge modulation and coding schemes for fifth generation communications.

采用非独立(NSA)架构的5G NR网络旨在提高吞吐量。与第四代移动通信(4G LTE)相比，5G通过gNB和UE(用户设备)具有更高的数据交换能力。为了评估和优化，有必要对系统在不同环境条件下受衰减过程影响的行为进行实际研究，例如大规模衰落(Shading)和小规模衰落(Multipath propagation)。本工作分析了MCS(调制和编码方案)变化对吞吐量/BLER的影响，最初，信道被默认AWGN降级，然后分析扩展到多径衰落效应，更真实地模拟了移动通信网络。分析证实了在MCS切换方面需要健壮的决策过程算法，以根据具有特定QoS(服务质量)的每个场景的要求保持足够的数据速率，同时考虑64 QAM和256 QAM。在高阶调制中，由于符号排列中固有的错误概率更高，吞吐量下降效应更为明显。本研究是理解和开发第五代通信的大型调制和编码方案的关键。

{"title":"Measurement Study on 5G NSA Architecture over Fading Channel","authors":"B. S. D. Silva, Iury da S. Batalha","doi":"10.5121/csit.2022.121703","DOIUrl":"https://doi.org/10.5121/csit.2022.121703","url":null,"abstract":"The 5G NR network with the Non-Standalone (NSA) architecture aims to advance with regard to throughput. When compared to fourth-generation mobile communication (4G LTE), the 5G has a higher data exchange capability through the gNB and the UE (User Equipment). For evaluation and optimization, it is necessary to carry out practical studies on the behaviour of the system in different environmental conditions, subject to attenuation processes, such as large-scale fading (Shading) and small-scale fading (Multipath propagation). This work has analysed the effect of the MCS (Modulation and Coding Scheme) variation on Throughput/BLER for, initially, a channel degraded by default AWGN, then the analysis extends to the multipath fading effect, which emulates more realistically a mobile communication network. The analysis confirmed the need for robust decision process algorithms in terms of MCS switching to maintain adequate data rates according to the requirement of each scenario with specific QoS (Quality of service), considering both 64 QAM and 256 QAM. The throughput degradation effect was more evident in higher-order modulations due to the higher probability of error inherent in the symbol arrangement. This study can be a key for understanding and developing huge modulation and coding schemes for fifth generation communications.","PeriodicalId":170432,"journal":{"name":"Signal & Image Processing Trends","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132189632","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

MusicApp, A Music Sheet Transcribing Moblie Platform using Machine Learning and Nature Language Processing MusicApp，一个使用机器学习和自然语言处理的乐谱转录移动平台

Signal & Image Processing Trends

Pub Date : 2022-10-22 DOI: 10.5121/csit.2022.121709

Daniel Wang, Yu Sun

As technology advances, we have found more practical uses for it. This ranges from such things as cleaning the house using machines to serving restaurants with robots. Using technology, what if we can use machines to automatically write sheet music for us, transcribing it from audio [1]. This paper designs an application to do exactly that. We used Java to write a program and app that would be able to transcribe audio into sheet music and store it on an app. We applied our application to multiple cases and conducted a qualitative evaluation of the approach. The results show that it is possible with some fine tuning and may be usable in the near future.

随着技术的进步，我们发现了它的更多实际用途。这包括用机器打扫房间到用机器人为餐馆服务等。使用技术，如果我们可以使用机器自动为我们写乐谱，从音频中转录它[1]。本文设计了一个应用程序来做到这一点。我们使用Java编写了一个程序和应用程序，能够将音频转录成乐谱并存储在应用程序中。我们将应用程序应用于多个案例，并对该方法进行了定性评估。结果表明，经过一定的微调，该方法是可行的，在不久的将来可以使用。

引用次数: 0

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Signal & Image Processing Trends

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀

Signal &amp; Image Processing Trends最新文献

Signal & Image Processing Trends最新文献