International Journal of Next-Generation Computing最新文献

英文中文

Fruit Detection and Three-Stage Maturity Grading Using CNN 基于CNN的水果检测与三期成熟度分级

IF 0.3

International Journal of Next-Generation Computing

Pub Date : 2023-02-15 DOI: 10.47164/ijngc.v14i1.1099

Harsh Mundhada, Sanskriti Sood, Saitejaswi Sanagavarapu, Rina Damdoo, Kanak Kalyani

Agriculture is a major sector for economic growth and development. The cultivation of fruit crops is a part of agriculture thus helping in the prosperity of our nation. In recent years, there has been a sudden hike in health problems and therefore, it has led to increasing demand for fruits and vegetables. Therefore, the use of innovative technologies is of significant importance for the fruit sector to give ripe and fresh fruits. Currently, Artificial Intelligence is a technology that is transforming every line of work. Particularly, Deep Learning (DL) has diverse applications due to its potential to learn mighty representations from images. A Convolutional Neural Network (CNN) is a noteworthy class of Deep Learning architecture that is built with the capability to bring out distinctive characteristics from image data. The utmost concern of many customers, vendors, and farmers is the quality of fruits and vegetables produced. Differentiating the fruits according to their ripening stages is the most crucialfactor in regulating the quality of fruits. This work used a high-quality dataset with 9997 images comprising 15 fruit classes. Moreover, based on the significant applications that Convolutional Neural Networks have had till now, it proposes an analysis of deep learning algorithms for fruit detection and three-stage maturity grading and achieves 90.24 percent accuracy. The results obtained will help in the development of fast and accurate detection of fruits and their quality

农业是经济增长和发展的主要部门。水果作物的种植是农业的一部分，因此有助于我们国家的繁荣。近年来，健康问题突然增加，因此导致对水果和蔬菜的需求增加。因此，使用创新技术对于水果部门提供成熟和新鲜的水果具有重要意义。目前，人工智能是一项正在改变各行各业的技术。特别是，深度学习(DL)由于其从图像中学习强大表示的潜力而具有多种应用。卷积神经网络(CNN)是一种值得注意的深度学习架构，它具有从图像数据中提取独特特征的能力。许多顾客、供应商和农民最关心的是所生产的水果和蔬菜的质量。根据成熟阶段来区分果实是调节果实品质的最关键因素。这项工作使用了一个包含15个水果类别的9997张图像的高质量数据集。此外，基于卷积神经网络迄今为止的重要应用，提出了一种用于水果检测和三阶段成熟度分级的深度学习算法分析，准确率达到90.24%。所得结果将有助于快速、准确地检测水果及其质量

{"title":"Fruit Detection and Three-Stage Maturity Grading Using CNN","authors":"Harsh Mundhada, Sanskriti Sood, Saitejaswi Sanagavarapu, Rina Damdoo, Kanak Kalyani","doi":"10.47164/ijngc.v14i1.1099","DOIUrl":"https://doi.org/10.47164/ijngc.v14i1.1099","url":null,"abstract":"Agriculture is a major sector for economic growth and development. The cultivation of fruit crops is a part of agriculture thus helping in the prosperity of our nation. In recent years, there has been a sudden hike in health problems and therefore, it has led to increasing demand for fruits and vegetables. Therefore, the use of innovative technologies is of significant importance for the fruit sector to give ripe and fresh fruits. Currently, Artificial Intelligence is a technology that is transforming every line of work. Particularly, Deep Learning (DL) has diverse applications due to its potential to learn mighty representations from images. A Convolutional Neural Network (CNN) is a noteworthy class of Deep Learning architecture that is built with the capability to bring out distinctive characteristics from image data. The utmost concern of many customers, vendors, and farmers is the quality of fruits and vegetables produced. Differentiating the fruits according to their ripening stages is the most crucialfactor in regulating the quality of fruits. This work used a high-quality dataset with 9997 images comprising 15 fruit classes. Moreover, based on the significant applications that Convolutional Neural Networks have had till now, it proposes an analysis of deep learning algorithms for fruit detection and three-stage maturity grading and achieves 90.24 percent accuracy. The results obtained will help in the development of fast and accurate detection of fruits and their quality","PeriodicalId":42021,"journal":{"name":"International Journal of Next-Generation Computing","volume":"61 1","pages":""},"PeriodicalIF":0.3,"publicationDate":"2023-02-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74408355","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Text-based Language Identifier using Multinomial Naïve Bayes Algorithm 基于文本的语言标识符使用多项Naïve贝叶斯算法

IF 0.3

International Journal of Next-Generation Computing

Pub Date : 2023-02-15 DOI: 10.47164/ijngc.v14i1.1024

S. Rawat, Lakshita Werulkar, Sagarika Jaywant

Language Identification is among the crucial steps in any NLP based application. Text - based documents and webpages are rapidly increasing in the modern Internet. It is simple to locate documents written in different languages from all across the world that are available with just one click. Therefore, a language identifier is absolutely necessary in order to help the user interpret the content. Language identification has so far tended to be more concentrated on European languages and is still rather limited for Indian Traditional Languages. Many researchers have become more interested in the study of language identification for similar languages from popular languages. In this paper, Multinomial Na¨ıve Bayes Algorithm is used for detecting languages in Devanagari like Marathi, Sanskrit and Hindi, and three European languages French, Italian and English. An experiment done ondatasets of each language has produced satisfactorily accurate results after training and testing the model.

语言识别是任何基于自然语言处理的应用程序的关键步骤之一。在现代互联网中，基于文本的文档和网页正在迅速增加。只需点击一下，就可以轻松找到世界各地用不同语言编写的文档。因此，为了帮助用户解释内容，语言标识符是绝对必要的。到目前为止，语言识别倾向于更多地集中在欧洲语言上，而对印度传统语言的识别仍然相当有限。许多研究人员对大众语言中相似语言的语言识别研究越来越感兴趣。在本文中，多项Na¨ıve贝叶斯算法用于检测Devanagari语言，如马拉地语，梵语和印地语，以及三种欧洲语言法语，意大利语和英语。在每种语言的数据集上进行了实验，经过训练和测试，得到了令人满意的准确结果。

引用次数: 0

Forest Change Detection in the Amazon Rainforest 亚马逊雨林的森林变化检测

IF 0.3

International Journal of Next-Generation Computing

Pub Date : 2023-02-15 DOI: 10.47164/ijngc.v14i1.1047

Tanisha Agrawal, Aarti Karandikar, Avinash Agrawal

Remote sensing is widely used in the prediction of forest cover. Forest plays an important role in the balance of the ecosystem. It helps to maintain the balance between climate. We depend a lot on forests for wood, oxygen, and also for the control of soil erosion. Hence it is our duty to maintain the forest cover on earth. Remote sensing images provide us with lots of information regarding the different landforms and materials. It is also used to predict natural disasters like forest fires, floods, etc. The normalized difference vegetation index is a simple graphical indicator that is used to analyze remote sensing measurements,(eg. space platform) predicting whether the target is live green vegetation or not. However, we have found out that it cannot be used for accurate prediction of forest land cover. With the help of time series data on the Amazon forest, it has been observed that the NDVI index fails to determine some of the important changes in the landform. To rectify this problem, the deep learning model was used to give an accuracy of 100 percent. The deep learning model gives similar results as observed results, hence making it the best-preferred method for predicting forest cover. With the help of multispectral analysis of the data, the deep learning model gives the best results for the red band, and near-infrared bands.

遥感在森林覆盖预测中有着广泛的应用。森林在生态系统的平衡中起着重要作用。它有助于维持气候之间的平衡。我们在很大程度上依赖森林提供木材、氧气，以及控制土壤侵蚀。因此，维护地球上的森林覆盖是我们的责任。遥感图像为我们提供了许多关于不同地形和物质的信息。它也被用来预测自然灾害，如森林火灾、洪水等。归一化植被指数是一种简单的图形指标，用于分析遥感测量结果，例如:空间平台)预测目标是否是活的绿色植被。然而，我们发现它不能用于准确预测森林土地覆盖。借助亚马逊森林的时间序列数据，可以观察到NDVI指数无法确定地形的一些重要变化。为了纠正这个问题，深度学习模型被用来给出100%的准确率。深度学习模型给出了与观测结果相似的结果，因此使其成为预测森林覆盖的最佳首选方法。在多光谱数据分析的帮助下，深度学习模型在红波段和近红外波段得到了最好的结果。

{"title":"Forest Change Detection in the Amazon Rainforest","authors":"Tanisha Agrawal, Aarti Karandikar, Avinash Agrawal","doi":"10.47164/ijngc.v14i1.1047","DOIUrl":"https://doi.org/10.47164/ijngc.v14i1.1047","url":null,"abstract":"Remote sensing is widely used in the prediction of forest cover. Forest plays an important role in the balance of the ecosystem. It helps to maintain the balance between climate. We depend a lot on forests for wood, oxygen, and also for the control of soil erosion. Hence it is our duty to maintain the forest cover on earth. Remote sensing images provide us with lots of information regarding the different landforms and materials. It is also used to predict natural disasters like forest fires, floods, etc. The normalized difference vegetation index is a simple graphical indicator that is used to analyze remote sensing measurements,(eg. space platform) predicting whether the target is live green vegetation or not. However, we have found out that it cannot be used for accurate prediction of forest land cover. With the help of time series data on the Amazon forest, it has been observed that the NDVI index fails to determine some of the important changes in the landform. To rectify this problem, the deep learning model was used to give an accuracy of 100 percent. The deep learning model gives similar results as observed results, hence making it the best-preferred method for predicting forest cover. With the help of multispectral analysis of the data, the deep learning model gives the best results for the red band, and near-infrared bands.","PeriodicalId":42021,"journal":{"name":"International Journal of Next-Generation Computing","volume":"37 1","pages":""},"PeriodicalIF":0.3,"publicationDate":"2023-02-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79272769","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

An Approach for generating best possible questions from the given text using Natural Language Processing 一种利用自然语言处理从给定文本生成最佳问题的方法

IF 0.3

International Journal of Next-Generation Computing

Pub Date : 2023-02-15 DOI: 10.47164/ijngc.v14i1.1044

Neha Bhagwatkar, Kimaya Vaidya, Aditi Singh, Sneha Borikar, Hirkani Padwad

A crucial ability for every person is the capacity to ask pertinent questions. By automating the process of question formation, an automatic question generator is able to decrease the time and effort needed for manual question creation. Along with benefitting educational institutions like schools and colleges, automated question generation can be used in chatbots and for automated tutoring systems. Question Generation is an area in NLP that is still under research for greater accuracy. Research work has been done in many languages too. The goal of an automatic question generator is to generate syntactically and semantically correct questions, valid according to the given input. The Bidirectional Encoder Representations from Transformers (BERT) model is one of the pre-trained models adopted to implement the same. Additionally, we used Python packages, including NLTK, Spacy, and PKE. To test our findings, we evaluated the validity and relevance of generated questions using human-level cognition and evaluation. We were successful in creating inquiries that adequately reflected several of the peculiarities of English so that a person might comprehend them.

每个人的一项关键能力是提出相关问题的能力。通过自动化问题形成过程，自动问题生成器能够减少手动问题创建所需的时间和精力。除了有利于学校和大学等教育机构外，自动问题生成还可以用于聊天机器人和自动辅导系统。问题生成是NLP的一个领域，目前仍在研究更高的准确性。研究工作也以多种语言进行。自动问题生成器的目标是生成语法和语义正确的问题，根据给定的输入有效。双向编码器表示从变压器(BERT)模型是一种预训练模型采用实现相同的。此外，我们还使用了Python包，包括NLTK、Spacy和PKE。为了验证我们的发现，我们使用人类水平的认知和评估来评估生成问题的有效性和相关性。我们成功地创造了能够充分反映英语特点的查询，以便人们能够理解它们。

{"title":"An Approach for generating best possible questions from the given text using Natural Language Processing","authors":"Neha Bhagwatkar, Kimaya Vaidya, Aditi Singh, Sneha Borikar, Hirkani Padwad","doi":"10.47164/ijngc.v14i1.1044","DOIUrl":"https://doi.org/10.47164/ijngc.v14i1.1044","url":null,"abstract":"A crucial ability for every person is the capacity to ask pertinent questions. By automating the process of question formation, an automatic question generator is able to decrease the time and effort needed for manual question creation. Along with benefitting educational institutions like schools and colleges, automated question generation can be used in chatbots and for automated tutoring systems. Question Generation is an area in NLP that is still under research for greater accuracy. Research work has been done in many languages too. The goal of an automatic question generator is to generate syntactically and semantically correct questions, valid according to the given input. The Bidirectional Encoder Representations from Transformers (BERT) model is one of the pre-trained models adopted to implement the same. Additionally, we used Python packages, including NLTK, Spacy, and PKE. To test our findings, we evaluated the validity and relevance of generated questions using human-level cognition and evaluation. We were successful in creating inquiries that adequately reflected several of the peculiarities of English so that a person might comprehend them.","PeriodicalId":42021,"journal":{"name":"International Journal of Next-Generation Computing","volume":"33 1","pages":""},"PeriodicalIF":0.3,"publicationDate":"2023-02-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75609358","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A Secure approach for point-to-point communication in a real time environment using a WebRtc framework 在使用WebRtc框架的实时环境中实现点对点通信的安全方法

IF 0.3

International Journal of Next-Generation Computing

Pub Date : 2023-02-15 DOI: 10.47164/ijngc.v14i1.1036

P. Pardhi, P. Sonsare

With the progress of the internet from web 2.0 to web 3.0, the increased use of decentralized applications has emerged. Popularized by BlockChain but not limited to decentralized finance, decentralized applications have vast applications with regards to security, storage, and delivery of content over the web. In this paper we outlines the development of a web application prototype using JavaScript programming language, JavaScript based libraries such as BugOut and PeerJS, and the WebRTC (Web Real Time Communication) framework. We have also discuss the brief comparisons between the existing centralized applications and our proposed model. An essential component of this prototype is outlined via the use of P2P networking, which is the backbone of decentralization.

随着互联网从web 2.0到web 3.0的发展，分散式应用程序的使用越来越多。由区块链推广，但不限于去中心化金融，去中心化应用程序在安全性，存储和网络内容交付方面具有广泛的应用。在本文中，我们概述了使用JavaScript编程语言，基于JavaScript的库(如BugOut和PeerJS)和WebRTC (web Real Time Communication)框架开发web应用程序原型。我们还讨论了现有集中式应用程序与我们提出的模型之间的简要比较。这个原型的一个重要组成部分是通过使用P2P网络来概述的，P2P网络是去中心化的支柱。

引用次数: 0

Information Retrieval Based Legal Search System 基于信息检索的法律检索系统

IF 0.3

International Journal of Next-Generation Computing

Pub Date : 2023-02-15 DOI: 10.47164/ijngc.v14i1.1004

Nilotpal Chatterjee, Inshal Khan, Mrigank Pagey, Anant Loiya, A. Agrawal, A. Zadgaonkar

Calculating the similarity between two legal documents to find similar legal judgments is an important challenge in legal information. Efficiently computing this similarity by expanding widely used information retrieval and search engine techniques has practical applications in a number of tasks, like locating pertinent prior cases for a specific case document. Programmed data recovery frameworks or reports are the main parts of today’s selected emotional support networks or web indexes to reduce data overload. Investigating methodologies to work on the presentation of report recovery frameworks and web search tools is a working area of research. Various methods have been pro- posed in this research paper to explore ways to search the common law system for cases with a similar outcome. Building a legal decision support system is intended to increase efficiency by assisting stakeholders—including judges and attorneys—in finding related rulings promptly. In order to prepare arguments, a lawyer typically has to review earlier decisions that are comparable to (or pertinent to) the current case. The attorney examines the judgement database to discover similar judgements. Legal rulings are complex in nature and refer to other judgments. For this, proper techniques are needed for quality analysis of judgments and correct deductions from them. A proper analysis of several types of similarity measures, such as all-term-based similarity methods, legal terms, co-citations, and bibliographic links, performed to look for comparable conclusions. According to experimental findings, the law term similarity approach outperforms all term cosine similarity methods. The out- comes also demonstrate that the co-citation approach performs worse than the bibliographic linkage similarity method and improves performance over the co-citation approach. After proper analysis of various methods in this field, proper comparison can be made between documents and similar legal documents can also be easily searched based on their similarity pattern and can be used to make meaningful deductions.

计算两份法律文书之间的相似度以寻找相似的法律判决是法律信息领域的一个重要挑战。通过扩展广泛使用的信息检索和搜索引擎技术，有效地计算这种相似性在许多任务中具有实际应用，例如为特定案例文档定位相关的先前案例。程序化数据恢复框架或报告是当今选择的情感支持网络或web索引的主要部分，以减少数据过载。调查报告恢复框架和网络搜索工具的呈现方法是研究的一个工作领域。本研究报告提出了各种方法，以探索如何在普通法体系中寻找具有类似结果的案例。建立法律决策支持系统的目的是通过帮助包括法官和律师在内的利益相关者及时发现相关裁决，从而提高效率。为了准备辩论，律师通常必须审查与当前案件相当(或相关)的早期判决。律师检查判决书数据库以发现相似的判决书。法律裁决本质上是复杂的，并涉及其他判决。为此，需要适当的技术对判断进行高质量的分析和正确的演绎。对几种类型的相似度量进行了适当的分析，例如基于所有术语的相似方法、法律术语、共引和书目链接，以寻找可比较的结论。实验结果表明，规律项相似度方法优于所有项余弦相似度方法。研究结果还表明，共被引方法比书目链接相似度方法性能差，但比共被引方法性能好。在对该领域的各种方法进行适当的分析后，可以对文件进行适当的比较，也可以根据相似模式轻松地搜索到类似的法律文件，并可以使用它们进行有意义的推论。

{"title":"Information Retrieval Based Legal Search System","authors":"Nilotpal Chatterjee, Inshal Khan, Mrigank Pagey, Anant Loiya, A. Agrawal, A. Zadgaonkar","doi":"10.47164/ijngc.v14i1.1004","DOIUrl":"https://doi.org/10.47164/ijngc.v14i1.1004","url":null,"abstract":"\u0000\u0000\u0000Calculating the similarity between two legal documents to find similar legal judgments is an important challenge in legal information. Efficiently computing this similarity by expanding widely used information retrieval and search engine techniques has practical applications in a number of tasks, like locating pertinent prior cases for a specific case document. Programmed data recovery frameworks or reports are the main parts of today’s selected emotional support networks or web indexes to reduce data overload. Investigating methodologies to work on the presentation of report recovery frameworks and web search tools is a working area of research. Various methods have been pro- posed in this research paper to explore ways to search the common law system for cases with a similar outcome. Building a legal decision support system is intended to increase efficiency by assisting stakeholders—including judges and attorneys—in finding related rulings promptly. In order to prepare arguments, a lawyer typically has to review earlier decisions that are comparable to (or pertinent to) the current case. The attorney examines the judgement database to discover similar judgements. Legal rulings are complex in nature and refer to other judgments. For this, proper techniques are needed for quality analysis of judgments and correct deductions from them. A proper analysis of several types of similarity measures, such as all-term-based similarity methods, legal terms, co-citations, and bibliographic links, performed to look for comparable conclusions. According to experimental findings, the law term similarity approach outperforms all term cosine similarity methods. The out- comes also demonstrate that the co-citation approach performs worse than the bibliographic linkage similarity method and improves performance over the co-citation approach. After proper analysis of various methods in this field, proper comparison can be made between documents and similar legal documents can also be easily searched based on their similarity pattern and can be used to make meaningful deductions.\u0000\u0000\u0000","PeriodicalId":42021,"journal":{"name":"International Journal of Next-Generation Computing","volume":"358 1","pages":""},"PeriodicalIF":0.3,"publicationDate":"2023-02-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82628594","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Plant Disease Detection using CNN Models 利用CNN模型进行植物病害检测

IF 0.3

International Journal of Next-Generation Computing

Pub Date : 2023-02-15 DOI: 10.47164/ijngc.v14i1.1015

Shreyas Bobde, Kavita B. Kalambe, Anurag Tripathi, Kushal Deoda, Vyankatesh Haridas

In this modern planet it is very much important to have a good and healthy life for an individual to survive. Just as we humans have a lot of diseases, similarly many diseases are found in plants too. Many models have been made who detect these diseases, but they are not able to give such good accuracy to know which disease has happened. Recognizing plant infection in crops utilizing pictures is an inherently troublesome assignment.This research demonstrates the potential of disease detection models for plant leaves. It covers a number of stages, including picture capture, classification and many more. Extensive researches have already been done by using the CNN model. We have analyzed all these CNN models and on the basis of analysis we have made our own.

在这个现代星球上，一个人要想生存，拥有一个良好健康的生活是非常重要的。就像我们人类有很多疾病一样，植物也有很多疾病。已经建立了许多模型来检测这些疾病，但它们不能给出如此好的准确性来知道哪种疾病发生了。利用图片识别作物的植物侵染是一项棘手的任务。本研究证明了植物叶片疾病检测模型的潜力。它涵盖了许多阶段，包括图片捕获，分类等等。利用CNN模型已经做了大量的研究。我们分析了所有这些CNN模型，并在分析的基础上做出了自己的模型。

引用次数: 0

Crypto-Currency Price Prediction Using Deep Learning 使用深度学习的加密货币价格预测

IF 0.3

International Journal of Next-Generation Computing

Pub Date : 2023-02-15 DOI: 10.47164/ijngc.v14i1.1029

Supriya S. Thombre, Aarti Devikar, Vaishnav Gangamwar, Pratik Majrikar, Tanmay Patil

After the price swings of crypto-currencies in past years, it has been considered as an asset. As crypto-currency is unpredictable, there arises the requirement of crypto-currency price prediction with greater level of accuracy. For this many researchers uses variety of ML and DL algorithms and are applying them to build a model which will predict crypto-currency price with improved accuracy. To build successful investment plan, accurate prediction is needed. The proposed method uses combination of LSTM and GRU for the bitcoin price prediction in order to find the closing price of bitcoin

在过去几年加密货币的价格波动之后，它被视为一种资产。由于加密货币具有不可预测性，因此对加密货币价格预测的准确性提出了更高的要求。为此，许多研究人员使用各种ML和DL算法，并将它们应用于构建一个模型，该模型将以更高的准确性预测加密货币价格。为了制定成功的投资计划，准确的预测是必要的。提出的方法结合LSTM和GRU对比特币价格进行预测，以找到比特币的收盘价

引用次数: 0

Incorporating Transfer Learning in CNN Architecture 在CNN建筑中应用迁移学习

IF 0.3

International Journal of Next-Generation Computing

Pub Date : 2023-02-15 DOI: 10.47164/ijngc.v14i1.1052

Aparna Gurjar, Preeti S. Voditel

Machine learning (ML) is a data intensive process. For training of ML algorithms huge datasets are required.There are times when enough data is not available due to multitude of reasons. This could be due to lack ofavailability of annotated data in a particular domain or paucity of time in data collection process resulting innon-availability of enough data. Many a times data collection is very expensive and in few domains data collectionis very difficult. In such cases, if methods can be designed to reuse the knowledge gained in one domain havingenough training data, to some other related domain having less training data, then problems associated with lackof data can be overcome. Transfer Learning (TL) is one such method. TL improves the performance of the targetdomain through knowledge transfer from some different but related source domain. This knowledge transfer canbe in form of feature extraction, domain adaptation, rule extraction for advice and so on. TL also works withvarious kinds of ML tasks related to supervised, unsupervised and reinforcement learning. The ConvolutionalNeural Networks are well suited for the TL approach. The general features learned on a base network (source)are shifted to the target network. The target network then uses its own data and learns new features specific toits requirement.

机器学习(ML)是一个数据密集型的过程。对于机器学习算法的训练，需要大量的数据集。有时由于多种原因而无法获得足够的数据。这可能是由于在特定领域缺乏可用的注释数据，或者在数据收集过程中缺乏时间，导致无法获得足够的数据。很多时候，数据收集是非常昂贵的，在少数领域的数据收集是非常困难的。在这种情况下，如果方法可以被设计成将在一个有足够训练数据的领域中获得的知识重用到具有较少训练数据的其他相关领域，那么与缺乏数据相关的问题就可以克服。迁移学习(TL)就是这样一种方法。TL通过从一些不同但相关的源领域转移知识来提高目标领域的性能。这种知识转移可以通过特征提取、领域自适应、建议规则提取等形式进行。TL还适用于与监督学习、无监督学习和强化学习相关的各种ML任务。卷积神经网络非常适合TL方法。在基本网络(源)上学习到的一般特征被转移到目标网络上。然后，目标网络使用自己的数据并学习特定于其需求的新特征。

{"title":"Incorporating Transfer Learning in CNN Architecture","authors":"Aparna Gurjar, Preeti S. Voditel","doi":"10.47164/ijngc.v14i1.1052","DOIUrl":"https://doi.org/10.47164/ijngc.v14i1.1052","url":null,"abstract":"Machine learning (ML) is a data intensive process. For training of ML algorithms huge datasets are required.There are times when enough data is not available due to multitude of reasons. This could be due to lack ofavailability of annotated data in a particular domain or paucity of time in data collection process resulting innon-availability of enough data. Many a times data collection is very expensive and in few domains data collectionis very difficult. In such cases, if methods can be designed to reuse the knowledge gained in one domain havingenough training data, to some other related domain having less training data, then problems associated with lackof data can be overcome. Transfer Learning (TL) is one such method. TL improves the performance of the targetdomain through knowledge transfer from some different but related source domain. This knowledge transfer canbe in form of feature extraction, domain adaptation, rule extraction for advice and so on. TL also works withvarious kinds of ML tasks related to supervised, unsupervised and reinforcement learning. The ConvolutionalNeural Networks are well suited for the TL approach. The general features learned on a base network (source)are shifted to the target network. The target network then uses its own data and learns new features specific toits requirement.","PeriodicalId":42021,"journal":{"name":"International Journal of Next-Generation Computing","volume":"9 1","pages":""},"PeriodicalIF":0.3,"publicationDate":"2023-02-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82324141","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Effect of Stemming on Hindi Text Classification 词干提取对印地语文本分类的影响

IF 0.3

International Journal of Next-Generation Computing

Pub Date : 2023-02-15 DOI: 10.47164/ijngc.v14i1.1063

Dr. Anjusha Pimpalshende, Preety Singh, Dr. Archana Potnurwar

Abstract. Text classification is very useful to search large amount of textual data available online by dividing it into smaller relevant units. Now a day’s large amount of digital documents are available in Indian languages. Designing text classifiers in Indian languages is one of the research areas so that people can search and read required documents in their local languages. In proposed work tried to design Text classifier for Hindi text documents and tried to show how stemmer affects the performance of Hindi text classifiers. Stemming is a process to convert words in any language to its base or root words. Stemmers are used for written documents not for spoken languages. Performance of many applications such as text summarization, Information Retrieval (IR) system,text classification systems, syntactic parsing can be improved by applying stemmers. Stemmer eliminates suffix or prefix of the word and form original root word. These root words helps in the preprocessing step required in many algorithms. We applied various stemmers on Hindi text classification models. Experiments and results show that performance of the classifiers is improved by applying stemmers.

抽象的。文本分类是一种非常有用的方法，它可以将大量的在线文本数据划分成较小的相关单元。现在，每天大量的数字文件都有印度语言版本。设计印度语言的文本分类器是研究领域之一，以便人们可以用当地语言搜索和阅读所需的文档。在提议的工作中，试图为印地语文本文档设计文本分类器，并试图展示stemmer如何影响印地语文本分类器的性能。词干提取是将任何语言中的单词转换为其基础或词根的过程。词干用于书写文件而不是口语。系统的应用可以提高文本摘要、信息检索系统、文本分类系统、句法分析等应用的性能。词根去掉单词的后缀或前缀，形成原词根。这些词根有助于许多算法所需的预处理步骤。我们在印地语文本分类模型上应用了不同的词干。实验和结果表明，系统的应用提高了分类器的性能。

{"title":"Effect of Stemming on Hindi Text Classification","authors":"Dr. Anjusha Pimpalshende, Preety Singh, Dr. Archana Potnurwar","doi":"10.47164/ijngc.v14i1.1063","DOIUrl":"https://doi.org/10.47164/ijngc.v14i1.1063","url":null,"abstract":"Abstract. Text classification is very useful to search large amount of textual data available online by dividing it into smaller relevant units. Now a day’s large amount of digital documents are available in Indian languages. Designing text classifiers in Indian languages is one of the research areas so that people can search and read required documents in their local languages. In proposed work tried to design Text classifier for Hindi text documents and tried to show how stemmer affects the performance of Hindi text classifiers. Stemming is a process to convert words in any language to its base or root words. Stemmers are used for written documents not for spoken languages. Performance of many applications such as text summarization, Information Retrieval (IR) system,text classification systems, syntactic parsing can be improved by applying stemmers. Stemmer eliminates suffix or prefix of the word and form original root word. These root words helps in the preprocessing step required in many algorithms. We applied various stemmers on Hindi text classification models. Experiments and results show that performance of the classifiers is improved by applying stemmers.","PeriodicalId":42021,"journal":{"name":"International Journal of Next-Generation Computing","volume":"40 1","pages":""},"PeriodicalIF":0.3,"publicationDate":"2023-02-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74309378","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

International Journal of Next-Generation Computing

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀