South African Computer Journal最新文献_第6页

Improved semi-supervised learning technique for automatic detection of South African abusive language on Twitter 改进的半监督学习技术，用于自动检测Twitter上的南非辱骂性语言

Q3 Social Sciences

South African Computer Journal

Pub Date : 2020-12-08 DOI: 10.18489/sacj.v32i2.847

O. Oriola, E. Kotzé

Semi-supervised learning is a potential solution for improving training data in low-resourced abusive language detection contexts such as South African abusive language detection on Twitter. However, the existing semi-supervised learning methods have been skewed towards small amounts of labelled data, with small feature space. This paper, therefore, presents a semi-supervised learning technique that improves the distribution of training data by assigning labels to unlabelled data based on the majority voting over different feature sets of labelled and unlabelled data clusters. The technique is applied to South African English corpora consisting of labelled and unlabelled abusive tweets. The proposed technique is compared with state-of-the-art self-learning and active learning techniques based on syntactic and semantic features. The performance of these techniques with Logistic Regression, Support Vector Machine and Neural Networks are evaluated. The proposed technique, with accuracy and F1-score of 0.97 and 0.95, respectively, outperforms existing semi-supervised learning techniques. The learning curves show that the training data was used more efficiently by the proposed technique compared to existing techniques. Overall, n-gram syntactic features with a Logistic Regression classifier records the highest performance. The paper concludes that the proposed semi-supervised learning technique effectively detected implicit and explicit South African abusive language on Twitter.

半监督学习是一种潜在的解决方案，可以在资源不足的辱骂性语言检测环境中改进训练数据，例如推特上的南非辱骂性语言测试。然而，现有的半监督学习方法已经偏向于具有小特征空间的少量标记数据。因此，本文提出了一种半监督学习技术，该技术通过基于对标记和未标记数据簇的不同特征集的多数投票，为未标记数据分配标签来改善训练数据的分布。该技术被应用于南非英语语料库，该语料库由标记和未标记的辱骂推文组成。将所提出的技术与最先进的基于句法和语义特征的自学习和主动学习技术进行了比较。用Logistic回归、支持向量机和神经网络对这些技术的性能进行了评估。所提出的技术的准确度和F1得分分别为0.97和0.95，优于现有的半监督学习技术。学习曲线表明，与现有技术相比，所提出的技术更有效地使用了训练数据。总体而言，使用逻辑回归分类器的n-gram句法特征记录了最高的性能。文章得出结论，所提出的半监督学习技术有效地检测到了推特上南非人的内隐和外显辱骂语言。

{"title":"Improved semi-supervised learning technique for automatic detection of South African abusive language on Twitter","authors":"O. Oriola, E. Kotzé","doi":"10.18489/sacj.v32i2.847","DOIUrl":"https://doi.org/10.18489/sacj.v32i2.847","url":null,"abstract":"Semi-supervised learning is a potential solution for improving training data in low-resourced abusive language detection contexts such as South African abusive language detection on Twitter. However, the existing semi-supervised learning methods have been skewed towards small amounts of labelled data, with small feature space. This paper, therefore, presents a semi-supervised learning technique that improves the distribution of training data by assigning labels to unlabelled data based on the majority voting over different feature sets of labelled and unlabelled data clusters. The technique is applied to South African English corpora consisting of labelled and unlabelled abusive tweets. The proposed technique is compared with state-of-the-art self-learning and active learning techniques based on syntactic and semantic features. The performance of these techniques with Logistic Regression, Support Vector Machine and Neural Networks are evaluated. The proposed technique, with accuracy and F1-score of 0.97 and 0.95, respectively, outperforms existing semi-supervised learning techniques. The learning curves show that the training data was used more efficiently by the proposed technique compared to existing techniques. Overall, n-gram syntactic features with a Logistic Regression classifier records the highest performance. The paper concludes that the proposed semi-supervised learning technique effectively detected implicit and explicit South African abusive language on Twitter.","PeriodicalId":55859,"journal":{"name":"South African Computer Journal","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2020-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43745529","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

Editorial: More Covid 社论:更多新冠病毒

Q3 Social Sciences

South African Computer Journal

Pub Date : 2020-12-08 DOI: 10.18489/sacj.v32i2.916

P. Machanick

South African Computer Journal has not passed through the Covid-19 pandemic unscathed. Fortunately, at time of writing, none of our editors has contracted the virus. However, we have all lost time to unaccustomed activities like online courses. Another problem many academics have had is large numbers of plagiarism cases, arising from having to switch fast to online learning with inadequate time to prepare. These problems pale into insignificance compared with the massive socioeconomic destruction around the world. Despite all this, we are able to publish a second issue to schedule in December. In this issue, most of the papers are extended papers from the inaugural Artificial Intelligence research conference, Forum on AI Research (FAIR), held in Cape Town, South Africa over 3–6 December 2019. I therefore defer the main part of editorialising on content of the issue to the guest editors, Deshendran Moodley and Marelie Davel.

《南非计算机杂志》并没有毫发无损地度过Covid-19大流行。幸运的是，在撰写本文时，我们的编辑还没有感染病毒。然而，我们都把时间浪费在不习惯的活动上，比如在线课程。许多学者面临的另一个问题是大量的抄袭案件，这是由于不得不迅速转向在线学习，而没有足够的准备时间造成的。与世界各地大规模的社会经济破坏相比，这些问题显得微不足道。尽管如此，我们还是按计划在12月出版了第二期。在这一期中，大多数论文都是2019年12月3日至6日在南非开普敦举行的首届人工智能研究会议——人工智能研究论坛(FAIR)的延伸论文。因此，我把对本期内容的主要评论工作推迟给特邀编辑德申德里·穆迪利和马瑞莉·戴维尔。

引用次数: 0

Decoding the underlying cognitive processes and related support strategies utilised by expert instructors during source code comprehension 解码潜在的认知过程和相关支持策略，由专家导师在源代码理解过程中使用

Q3 Social Sciences

South African Computer Journal

Pub Date : 2020-12-08 DOI: 10.18489/sacj.v32i2.811

Pakiso J. Khomokhoana, Liezel Nel

Many novice programmers fail to comprehend source code and its related concepts in the same way that their instructors do. As emphasised in the Decoding the Disciplines (DtDs) framework, each discipline (including Computer Science) has its own unique set of mental operations. However, instructors often take certain important mental operations for granted and do not explain these 'hidden' steps explicitly when modelling problem solutions. A clear understanding of the underlying cognitive processes and related support strategies employed by experts during source code comprehension (SCC) could ultimately be utilised to help novice programmers to better execute the cognitive processes necessary to efficiently comprehend source code. Positioned within Step 2 of the DtDs framework, this study employed decoding interviews and observations, followed by narrative data analysis, to identify the underlying cognitive processes and related support (though often 'hidden') strategies utilised by a select group of experienced programming instructors during an SCC task. The insights gained were then used to formulate a set of important cognitive-related support strategies for efficient SCC. Programming instructors are encouraged to continuously emphasise strategies like these when modelling their expert ways of thinking regarding efficient SCC more explicitly to their novice students.

许多新手程序员不能像他们的导师那样理解源代码及其相关概念。正如在解码学科(dtd)框架中所强调的，每个学科(包括计算机科学)都有自己独特的一套心理操作。然而，教师经常把某些重要的心理操作视为理所当然，并且在建模问题解决方案时不会明确解释这些“隐藏”步骤。对专家在源代码理解(SCC)过程中使用的潜在认知过程和相关支持策略的清晰理解最终可以用来帮助新手程序员更好地执行有效理解源代码所必需的认知过程。本研究定位于dtd框架的第2步，采用解码访谈和观察，然后进行叙事数据分析，以确定在SCC任务中选定的一组经验丰富的编程教师使用的潜在认知过程和相关支持(尽管通常是“隐藏的”)策略。然后利用所获得的见解来制定一套重要的认知相关支持策略，以实现高效的SCC。鼓励编程教师在向新手更明确地建模他们关于高效SCC的专家思维方式时，不断强调这些策略。

{"title":"Decoding the underlying cognitive processes and related support strategies utilised by expert instructors during source code comprehension","authors":"Pakiso J. Khomokhoana, Liezel Nel","doi":"10.18489/sacj.v32i2.811","DOIUrl":"https://doi.org/10.18489/sacj.v32i2.811","url":null,"abstract":"Many novice programmers fail to comprehend source code and its related concepts in the same way that their instructors do. As emphasised in the Decoding the Disciplines (DtDs) framework, each discipline (including Computer Science) has its own unique set of mental operations. However, instructors often take certain important mental operations for granted and do not explain these 'hidden' steps explicitly when modelling problem solutions. A clear understanding of the underlying cognitive processes and related support strategies employed by experts during source code comprehension (SCC) could ultimately be utilised to help novice programmers to better execute the cognitive processes necessary to efficiently comprehend source code. Positioned within Step 2 of the DtDs framework, this study employed decoding interviews and observations, followed by narrative data analysis, to identify the underlying cognitive processes and related support (though often 'hidden') strategies utilised by a select group of experienced programming instructors during an SCC task. The insights gained were then used to formulate a set of important cognitive-related support strategies for efficient SCC. Programming instructors are encouraged to continuously emphasise strategies like these when modelling their expert ways of thinking regarding efficient SCC more explicitly to their novice students.","PeriodicalId":55859,"journal":{"name":"South African Computer Journal","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2020-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45831979","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Using Summary Layers to Probe Neural Network Behaviour 利用摘要层探测神经网络行为

Q3 Social Sciences

South African Computer Journal

Pub Date : 2020-12-08 DOI: 10.18489/sacj.v32i2.861

Marelie Hattingh Davel

No framework exists that can explain and predict the generalisation ability of deep neural networks in general circumstances. In fact, this question has not been answered for some of the least complicated of neural network architectures: fully-connected feedforward networks with rectified linear activations and a limited number of hidden layers. For such an architecture, we show how adding a summary layer to the network makes it more amenable to analysis, and allows us to define the conditions that are required to guarantee that a set of samples will all be classified correctly. This process does not describe the generalisation behaviour of these networks, but produces a number of metrics that are useful for probing their learning and generalisation behaviour. We support the analytical conclusions with empirical results, both to confirm that the mathematical guarantees hold in practice, and to demonstrate the use of the analysis process.

没有一个框架可以解释和预测深度神经网络在一般情况下的泛化能力。事实上，对于一些最不复杂的神经网络架构来说，这个问题还没有得到回答:具有整流线性激活和有限数量隐藏层的全连接前馈网络。对于这样的体系结构，我们展示了如何向网络添加摘要层使其更易于分析，并允许我们定义保证一组样本全部正确分类所需的条件。这个过程并没有描述这些网络的泛化行为，但是产生了一些对探测它们的学习和泛化行为有用的指标。我们用实证结果来支持分析结论，既证实了数学保证在实践中成立，又证明了分析过程的使用。

引用次数: 0

A survey of benchmarks for reinforcement learning algorithms 强化学习算法的基准调查

Q3 Social Sciences

South African Computer Journal

Pub Date : 2020-12-08 DOI: 10.18489/sacj.v32i2.746

B. Stapelberg, K. Malan

Reinforcement learning has recently experienced increased prominence in the machine learning community. There are many approaches to solving reinforcement learning problems with new techniques developed constantly. When solving problems using reinforcement learning, there are various difficult challenges to overcome. par To ensure progress in the field, benchmarks are important for testing new algorithms and comparing with other approaches. The reproducibility of results for fair comparison is therefore vital in ensuring that improvements are accurately judged. This paper provides an overview of different contributions to reinforcement learning benchmarking and discusses how they can assist researchers to address the challenges facing reinforcement learning. The contributions discussed are the most used and recent in the literature. The paper discusses the contributions in terms of implementation, tasks and provided algorithm implementations with benchmarks. par The survey aims to bring attention to the wide range of reinforcement learning benchmarking tasks available and to encourage research to take place in a standardised manner. Additionally, this survey acts as an overview for researchers not familiar with the different tasks that can be used to develop and test new reinforcement learning algorithms.

最近，强化学习在机器学习社区中越来越受到重视。随着新技术的不断发展，解决强化学习问题的方法也越来越多。当使用强化学习解决问题时，有各种困难的挑战需要克服。为了确保该领域的进步，基准测试对于测试新算法和与其他方法进行比较非常重要。因此，为了公平比较，结果的可重复性对于确保准确判断改进是至关重要的。本文概述了对强化学习基准的不同贡献，并讨论了它们如何帮助研究人员解决强化学习面临的挑战。讨论的贡献是最常用的和最近的文献。本文从实现、任务和提供的算法实现基准等方面讨论了贡献。该调查旨在引起人们对广泛的强化学习基准任务的关注，并鼓励以标准化的方式进行研究。此外，本调查还为不熟悉可用于开发和测试新的强化学习算法的不同任务的研究人员提供了概述。

{"title":"A survey of benchmarks for reinforcement learning algorithms","authors":"B. Stapelberg, K. Malan","doi":"10.18489/sacj.v32i2.746","DOIUrl":"https://doi.org/10.18489/sacj.v32i2.746","url":null,"abstract":"Reinforcement learning has recently experienced increased prominence in the machine learning community. There are many approaches to solving reinforcement learning problems with new techniques developed constantly. When solving problems using reinforcement learning, there are various difficult challenges to overcome. par To ensure progress in the field, benchmarks are important for testing new algorithms and comparing with other approaches. The reproducibility of results for fair comparison is therefore vital in ensuring that improvements are accurately judged. This paper provides an overview of different contributions to reinforcement learning benchmarking and discusses how they can assist researchers to address the challenges facing reinforcement learning. The contributions discussed are the most used and recent in the literature. The paper discusses the contributions in terms of implementation, tasks and provided algorithm implementations with benchmarks. par The survey aims to bring attention to the wide range of reinforcement learning benchmarking tasks available and to encourage research to take place in a standardised manner. Additionally, this survey acts as an overview for researchers not familiar with the different tasks that can be used to develop and test new reinforcement learning algorithms.","PeriodicalId":55859,"journal":{"name":"South African Computer Journal","volume":"1 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2020-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89871418","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

DDLV: A System for rational preferential reasoning for datalog DDLV：一个数据日志的合理优先推理系统

Q3 Social Sciences

South African Computer Journal

Pub Date : 2020-12-08 DOI: 10.18489/sacj.v32i2.850

Michael Harrison, T. Meyer

Datalog is a powerful language that can be used to represent explicit knowledge and compute inferences in knowledge bases. Datalog cannot, however, represent or reason about contradictory rules. This is a limitation as contradictions are often present in domains that contain exceptions. In this paper, we extend Datalog to represent contradictory and defeasible information. We define an approach to efficiently reason about contradictory information in Datalog and show that it satisfies the KLM requirements for a rational consequence relation. We introduce DDLV, a defeasible Datalog reasoning system that implements this approach. Finally, we evaluate the performance of DDLV.

Datalog是一种强大的语言，可用于表示显式知识和计算知识库中的推论。然而，Datalog不能代表或解释相互矛盾的规则。这是一个限制，因为矛盾经常出现在包含异常的领域中。在本文中，我们扩展了Datalog来表示矛盾和可否定的信息。定义了一种对Datalog中矛盾信息进行有效推理的方法，并证明了该方法满足KLM对合理推理关系的要求。我们介绍了dddlv，一个可行的数据推理系统，实现了这种方法。最后，对DDLV的性能进行了评价。

引用次数: 0

Ht-index for empirical evaluation of the sampled graph-based Discrete Pulse Transform 基于采样图的离散脉冲变换的经验评价ht指数

Q3 Social Sciences

South African Computer Journal

Pub Date : 2020-12-08 DOI: 10.18489/sacj.v32i2.849

Mark De Lancey, I. Fabris-Rotelli

The Discrete Pulse Transform decomposes a signal into pulses, with the most recent and effective implementation being a graph-base algorithm called the Roadmaker’s Pavage. Even though an efficient implementation, the theoretical structure results in a slow, deterministic algorithm. This paper examines the use of the spectral domain of graphs and designs graph filter banks to downsample the algorithm, investigating the extent to which this speeds up the algorithm. Converting graph signals to the spectral domain is costly, thus estimation for filter banks is examined, as well as the design of a reusable filter bank. The sampled version requires hyperparameters to reconstruct the same textures of the image as the original algorithm, preventing a large scale study. Here an objective and efficient way of deriving similar results between the original and our proposed Filtered Roadmaker’s Pavage is provided. The method makes use of the Ht-index, separating the distribution of information at scale intervals. Empirical research using benchmark datasets provides improved results, showing that using the proposed algorithm consistently runs faster, uses less computational resources, while having a positive SSIM with low variance. This provides an informative and faster approximation to the nonlinear DPT, a property not standardly achievable.

离散脉冲变换（Discrete Pulse Transform）将信号分解为脉冲，最新有效的实现是一种称为Roadmaker’s Pavage的基于图形的算法。即使是一个有效的实现，理论结构也会导致一个缓慢的、确定性的算法。本文研究了图的谱域的使用，并设计了图滤波器组来对算法进行下采样，研究了这在多大程度上加快了算法的速度。将图信号转换到频域是昂贵的，因此检查了滤波器组的估计，以及可重复使用的滤波器组的设计。采样版本需要超参数来重建与原始算法相同的图像纹理，从而阻止了大规模研究。这里提供了一种客观有效的方法，可以在原始的和我们提出的过滤道路制造商的Pavage之间获得类似的结果。该方法利用Ht指数，以比例间隔分离信息的分布。使用基准数据集的实证研究提供了改进的结果，表明使用所提出的算法始终运行更快，使用更少的计算资源，同时具有低方差的正SSIM。这为非线性DPT提供了一种信息丰富且更快的近似，这是一种标准无法实现的特性。

{"title":"Ht-index for empirical evaluation of the sampled graph-based Discrete Pulse Transform","authors":"Mark De Lancey, I. Fabris-Rotelli","doi":"10.18489/sacj.v32i2.849","DOIUrl":"https://doi.org/10.18489/sacj.v32i2.849","url":null,"abstract":"The Discrete Pulse Transform decomposes a signal into pulses, with the most recent and effective implementation being a graph-base algorithm called the Roadmaker’s Pavage. Even though an efficient implementation, the theoretical structure results in a slow, deterministic algorithm. This paper examines the use of the spectral domain of graphs and designs graph filter banks to downsample the algorithm, investigating the extent to which this speeds up the algorithm. Converting graph signals to the spectral domain is costly, thus estimation for filter banks is examined, as well as the design of a reusable filter bank. The sampled version requires hyperparameters to reconstruct the same textures of the image as the original algorithm, preventing a large scale study. Here an objective and efficient way of deriving similar results between the original and our proposed Filtered Roadmaker’s Pavage is provided. The method makes use of the Ht-index, separating the distribution of information at scale intervals. Empirical research using benchmark datasets provides improved results, showing that using the proposed algorithm consistently runs faster, uses less computational resources, while having a positive SSIM with low variance. This provides an informative and faster approximation to the nonlinear DPT, a property not standardly achievable.","PeriodicalId":55859,"journal":{"name":"South African Computer Journal","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2020-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48157230","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Guest Editorial: FAIR 2019 special issue 客座编辑：FAIR 2019特刊

Q3 Social Sciences

South African Computer Journal

Pub Date : 2020-12-08 DOI: 10.18489/sacj.v32i2.915

Deshendran Moodley, Marelie Hattingh Davel

In this special issue, we feature selected papers from the inaugural Forum for Artificial Intelligence Research (FAIR), established and hosted by the Centre for Artificial Intelligence Research (CAIR)1. FAIR 2019 was held at the UCT Graduate School of Business Conference Centre in Cape Town, between 3 and 6 December 2019. The Department of Science and Technology’s (DST) latest White Paper on Science, Technology and Innovation (2019) identifies Artificial Intelligence (AI) and advanced Information and Communication Technologies (ICTs) as priority areas for South Africa. It recognises that these technologies will change the way the South African society and economy function. The potential of AI is already being unlocked in key areas of South African society. For example, South Africa’s power utility, Eskom, has identified AI as a future area for research and innovation, and is exploring the use of machine learning for real-time monitoring and fault prediction at their power stations (Bhugwandin et al., 2019). The South African Revenue Service is aggressively building an in-house AI capability for analysing and detecting non-compliance in tax returns (South African Revenue Services, 2020). The South African AI research community has also grown substantially over the last few years. While AI is generally considered to be a subdiscipline of Computer Science (Stone et al., 2016), it is at heart multidisciplinary: active AI research groups in South African universities can be found in Computer Science, Engineering, Philosophy, Information Systems, Statistics and Applied Mathematics departments. Within this context, FAIR was established to provide a venue for South African AI researchers from a broad range of disciplines to meet, interact and publish their work. Research contributions were solicited in five tracks, namely applications of AI, ethics and AI, knowledge representation, machine learning, and other topics in AI. A total of 72 submissions were received, consisting of full papers, work in progress and extended abstracts (of work under review or published elsewhere). Full paper submissions were blind reviewed by at least two independent reviewers from the relevant disciplines and 20 full papers were accepted for publication in the conference proceedings (Davel & Barnard, 2019).

在本期特刊中，我们精选了由人工智能研究中心(CAIR)建立和主办的首届人工智能研究论坛(FAIR)的论文。2019年南非国际贸易博览会于2019年12月3日至6日在开普敦大学商学院会议中心举行。南非科技部(DST)最新的《科学、技术和创新白皮书(2019)》将人工智能(AI)和先进信息通信技术(ict)确定为南非的优先领域。它认识到这些技术将改变南非社会和经济运作的方式。人工智能的潜力已经在南非社会的关键领域得到释放。例如，南非电力公司Eskom已将人工智能确定为未来的研究和创新领域，并正在探索将机器学习用于其发电站的实时监控和故障预测(Bhugwandin等人，2019)。南非税务局正在积极建立内部人工智能能力，用于分析和检测纳税申报表中的不合规情况(南非税务局，2020年)。南非人工智能研究界在过去几年中也有了长足的发展。虽然人工智能通常被认为是计算机科学的一个分支学科(Stone等人，2016)，但它的核心是多学科的:南非大学中活跃的人工智能研究小组可以在计算机科学、工程、哲学、信息系统、统计和应用数学系找到。在此背景下，FAIR的成立是为了为来自广泛学科的南非人工智能研究人员提供一个会面、互动和发表他们工作的场所。征集了五个方面的研究成果，即人工智能的应用、伦理与人工智能、知识表示、机器学习和人工智能的其他主题。共收到72份意见书，包括全文、正在进行的工作和(正在审查或已在其他地方发表的工作的)扩展摘要。提交的论文全文由至少两名来自相关学科的独立审稿人进行盲审，20篇论文全文被接受在会议论文集中发表(Davel & Barnard, 2019)。

{"title":"Guest Editorial: FAIR 2019 special issue","authors":"Deshendran Moodley, Marelie Hattingh Davel","doi":"10.18489/sacj.v32i2.915","DOIUrl":"https://doi.org/10.18489/sacj.v32i2.915","url":null,"abstract":"In this special issue, we feature selected papers from the inaugural Forum for Artificial Intelligence Research (FAIR), established and hosted by the Centre for Artificial Intelligence Research (CAIR)1. FAIR 2019 was held at the UCT Graduate School of Business Conference Centre in Cape Town, between 3 and 6 December 2019. The Department of Science and Technology’s (DST) latest White Paper on Science, Technology and Innovation (2019) identifies Artificial Intelligence (AI) and advanced Information and Communication Technologies (ICTs) as priority areas for South Africa. It recognises that these technologies will change the way the South African society and economy function. The potential of AI is already being unlocked in key areas of South African society. For example, South Africa’s power utility, Eskom, has identified AI as a future area for research and innovation, and is exploring the use of machine learning for real-time monitoring and fault prediction at their power stations (Bhugwandin et al., 2019). The South African Revenue Service is aggressively building an in-house AI capability for analysing and detecting non-compliance in tax returns (South African Revenue Services, 2020). The South African AI research community has also grown substantially over the last few years. While AI is generally considered to be a subdiscipline of Computer Science (Stone et al., 2016), it is at heart multidisciplinary: active AI research groups in South African universities can be found in Computer Science, Engineering, Philosophy, Information Systems, Statistics and Applied Mathematics departments. Within this context, FAIR was established to provide a venue for South African AI researchers from a broad range of disciplines to meet, interact and publish their work. Research contributions were solicited in five tracks, namely applications of AI, ethics and AI, knowledge representation, machine learning, and other topics in AI. A total of 72 submissions were received, consisting of full papers, work in progress and extended abstracts (of work under review or published elsewhere). Full paper submissions were blind reviewed by at least two independent reviewers from the relevant disciplines and 20 full papers were accepted for publication in the conference proceedings (Davel & Barnard, 2019).","PeriodicalId":55859,"journal":{"name":"South African Computer Journal","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2020-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42741549","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Pairwise networks for feature ranking of a geomagnetic storm model 地磁风暴模型特征排序的成对网络

Q3 Social Sciences

South African Computer Journal

Pub Date : 2020-12-08 DOI: 10.18489/sacj.v32i2.860

J. Beukes, Marelie Hattingh Davel, S. Lotz

Feedforward neural networks provide the basis for complex regression models that produce accurate predictions in a variety of applications. However, they generally do not explicitly provide any information about the utility of each of the input parameters in terms of their contribution to model accuracy. With this in mind, we develop the pairwise network, an adaptation to the fully connected feedforward network that allows the ranking of input parameters according to their contribution to model output. The application is demonstrated in the context of a space physics problem. Geomagnetic storms are multi-day events characterised by significant perturbations to the magnetic field of the Earth, driven by solar activity. Previous storm forecasting efforts typically use solar wind measurements as input parameters to a regression problem tasked with predicting a perturbation index such as the 1-minute cadence symmetric-H (Sym-H) index. We re-visit the task of predicting Sym-H from solar wind parameters, with two ‘twists’: (i) Geomagnetic storm phase information is incorporated as model inputs and shown to increase prediction performance. (ii) We describe the pairwise network structure and training process – first validating ranking ability on synthetic data, before using the network to analyse the Sym-H problem.

前馈神经网络为复杂回归模型提供了基础，该模型在各种应用中产生准确的预测。然而，就它们对模型精度的贡献而言，它们通常不明确地提供关于每个输入参数的效用的任何信息。考虑到这一点，我们开发了成对网络，这是对全连接前馈网络的一种适应，允许根据输入参数对模型输出的贡献对输入参数进行排序。该应用是在一个空间物理问题的背景下演示的。地磁风暴是一种多日事件，其特征是由太阳活动驱动的地球磁场发生重大扰动。先前的风暴预测工作通常使用太阳风测量值作为回归问题的输入参数，该回归问题的任务是预测扰动指数，例如1分钟节奏对称性-H（Sym-H）指数。我们重新访问了根据太阳风参数预测Sym-H的任务，有两个“转折”：（i）地磁风暴相位信息被纳入模型输入，并被证明可以提高预测性能。（ii）我们描述了成对网络结构和训练过程——首先验证合成数据的排序能力，然后使用网络分析Sym-H问题。

{"title":"Pairwise networks for feature ranking of a geomagnetic storm model","authors":"J. Beukes, Marelie Hattingh Davel, S. Lotz","doi":"10.18489/sacj.v32i2.860","DOIUrl":"https://doi.org/10.18489/sacj.v32i2.860","url":null,"abstract":"Feedforward neural networks provide the basis for complex regression models that produce accurate predictions in a variety of applications. However, they generally do not explicitly provide any information about the utility of each of the input parameters in terms of their contribution to model accuracy. With this in mind, we develop the pairwise network, an adaptation to the fully connected feedforward network that allows the ranking of input parameters according to their contribution to model output. The application is demonstrated in the context of a space physics problem. Geomagnetic storms are multi-day events characterised by significant perturbations to the magnetic field of the Earth, driven by solar activity. Previous storm forecasting efforts typically use solar wind measurements as input parameters to a regression problem tasked with predicting a perturbation index such as the 1-minute cadence symmetric-H (Sym-H) index. We re-visit the task of predicting Sym-H from solar wind parameters, with two ‘twists’: (i) Geomagnetic storm phase information is incorporated as model inputs and shown to increase prediction performance. (ii) We describe the pairwise network structure and training process – first validating ranking ability on synthetic data, before using the network to analyse the Sym-H problem.","PeriodicalId":55859,"journal":{"name":"South African Computer Journal","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2020-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44370390","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Benign interpolation of noise in deep learning 深度学习中噪声的良性插值

Q3 Social Sciences

South African Computer Journal

Pub Date : 2020-12-08 DOI: 10.18489/sacj.v32i2.833

Marthinus W. Theunissen, Marelie Hattingh Davel, E. Barnard

The understanding of generalisation in machine learning is in a state of flux, in part due to the ability of deep learning models to interpolate noisy training data and still perform appropriately on out-of-sample data, thereby contradicting long-held intuitions about the bias-variance tradeoff in learning. We expand upon relevant existing work by discussing local attributes of neural network training within the context of a relatively simple framework. We describe how various types of noise can be compensated for within the proposed framework in order to allow the deep learning model to generalise in spite of interpolating spurious function descriptors. Empirically, we support our postulates with experiments involving overparameterised multilayer perceptrons and controlled training data noise. The main insights are that deep learning models are optimised for training data modularly, with different regions in the function space dedicated to fitting distinct types of sample information. Additionally, we show that models tend to fit uncorrupted samples first. Based on this finding, we propose a conjecture to explain an observed instance of the epoch-wise double-descent phenomenon. Our findings suggest that the notion of model capacity needs to be modified to consider the distributed way training data is fitted across sub-units.

机器学习中对泛化的理解处于不断变化的状态，部分原因是深度学习模型能够对有噪声的训练数据进行插值，并且仍然对样本外数据进行适当的处理，从而与长期以来关于学习中偏差-方差权衡的直觉相矛盾。我们通过在一个相对简单的框架内讨论神经网络训练的局部属性来扩展现有的相关工作。我们描述了如何在所提出的框架内补偿各种类型的噪声，以便允许深度学习模型在插值虚假函数描述符的情况下进行泛化。从经验上讲，我们通过涉及多参数多层感知器和受控训练数据噪声的实验来支持我们的假设。主要见解是，深度学习模型针对模块化训练数据进行了优化，函数空间中的不同区域专门用于拟合不同类型的样本信息。此外，我们还表明，模型倾向于首先拟合未损坏的样本。基于这一发现，我们提出了一个猜想来解释一个观察到的分时代双下降现象的例子。我们的研究结果表明，需要修改模型能力的概念，以考虑训练数据在子单元中的分布方式。

{"title":"Benign interpolation of noise in deep learning","authors":"Marthinus W. Theunissen, Marelie Hattingh Davel, E. Barnard","doi":"10.18489/sacj.v32i2.833","DOIUrl":"https://doi.org/10.18489/sacj.v32i2.833","url":null,"abstract":"The understanding of generalisation in machine learning is in a state of flux, in part due to the ability of deep learning models to interpolate noisy training data and still perform appropriately on out-of-sample data, thereby contradicting long-held intuitions about the bias-variance tradeoff in learning. We expand upon relevant existing work by discussing local attributes of neural network training within the context of a relatively simple framework. We describe how various types of noise can be compensated for within the proposed framework in order to allow the deep learning model to generalise in spite of interpolating spurious function descriptors. Empirically, we support our postulates with experiments involving overparameterised multilayer perceptrons and controlled training data noise. The main insights are that deep learning models are optimised for training data modularly, with different regions in the function space dedicated to fitting distinct types of sample information. Additionally, we show that models tend to fit uncorrupted samples first. Based on this finding, we propose a conjecture to explain an observed instance of the epoch-wise double-descent phenomenon. Our findings suggest that the notion of model capacity needs to be modified to consider the distributed way training data is fitted across sub-units.","PeriodicalId":55859,"journal":{"name":"South African Computer Journal","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2020-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46650255","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4