Prediction of foreign currency exchange rates using an attention-based long short-term memory network

Shahram Ghahremani, Uyen Trang Nguyen
{"title":"Prediction of foreign currency exchange rates using an attention-based long short-term memory network","authors":"Shahram Ghahremani,&nbsp;Uyen Trang Nguyen","doi":"10.1016/j.mlwa.2025.100648","DOIUrl":null,"url":null,"abstract":"<div><div>We propose an <u>a</u>ttention-based <u>L</u>STM model for predicting <u>f</u>orex r<u>a</u>tes (ALFA). The prediction process consists of three stages. First, an LSTM model captures temporal dependencies within the forex time series. Next, an attention mechanism assigns different weights (importance scores) to the features of the LSTM model’s output. Finally, a fully connected layer generates predictions of forex rates. We conducted comprehensive experiments to evaluate and compare the performance of ALFA against several models used in previous work and against state-of-the-art deep learning models such as temporal convolutional networks (TCN) and Transformer. Experimental results show that ALFA outperforms the baseline models in most cases, across different currency pairs and feature sets, thanks to its attention mechanism that filters out irrelevant or redundant data to focus on important features. ALFA consistently ranks among the top three of the seven models evaluated and ranks first in most cases. We validated the effectiveness of ALFA by applying it to actual trading scenarios using several currency pairs. In these evaluations, ALFA achieves estimated annual return rates comparable to those of professional traders.</div></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"20 ","pages":"Article 100648"},"PeriodicalIF":4.9000,"publicationDate":"2025-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Machine learning with applications","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666827025000313","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

We propose an attention-based LSTM model for predicting forex rates (ALFA). The prediction process consists of three stages. First, an LSTM model captures temporal dependencies within the forex time series. Next, an attention mechanism assigns different weights (importance scores) to the features of the LSTM model’s output. Finally, a fully connected layer generates predictions of forex rates. We conducted comprehensive experiments to evaluate and compare the performance of ALFA against several models used in previous work and against state-of-the-art deep learning models such as temporal convolutional networks (TCN) and Transformer. Experimental results show that ALFA outperforms the baseline models in most cases, across different currency pairs and feature sets, thanks to its attention mechanism that filters out irrelevant or redundant data to focus on important features. ALFA consistently ranks among the top three of the seven models evaluated and ranks first in most cases. We validated the effectiveness of ALFA by applying it to actual trading scenarios using several currency pairs. In these evaluations, ALFA achieves estimated annual return rates comparable to those of professional traders.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用基于注意力的长短期记忆网络预测外汇汇率
我们提出了一种用于预测外汇汇率的基于注意力的 LSTM 模型(ALFA)。预测过程包括三个阶段。首先,LSTM 模型捕捉外汇时间序列中的时间依赖性。接下来,注意力机制为 LSTM 模型输出的特征分配不同的权重(重要性分数)。最后,全连接层生成外汇汇率预测。我们进行了全面的实验,对 ALFA 的性能进行了评估,并与之前工作中使用的几个模型以及时序卷积网络(TCN)和 Transformer 等最先进的深度学习模型进行了比较。实验结果表明,在不同货币对和特征集的大多数情况下,ALFA 的性能都优于基线模型,这要归功于它的注意力机制,该机制可以过滤掉无关或冗余数据,从而将注意力集中在重要特征上。在评估的七个模型中,ALFA 一直名列前三,并在大多数情况下名列第一。我们将 ALFA 应用于多个货币对的实际交易场景,验证了它的有效性。在这些评估中,ALFA 实现了与专业交易员相当的估计年收益率。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Machine learning with applications
Machine learning with applications Management Science and Operations Research, Artificial Intelligence, Computer Science Applications
自引率
0.00%
发文量
0
审稿时长
98 days
期刊最新文献
Causal Digital Twins for cyber–physical security in water systems: A framework for robust anomaly detection Beyond text: Multimodal stance detection in Arabic tweets SAFE AI metrics: An integrated approach Adaptive multi-domain uncertainty quantification for digital twin water forecasting DefMoN: A reproducible framework for theory-grounded synthetic data generation in affective AI
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1