Modeling of Ambient Comfort Affect Reward based on multi-agents in cloud interconnection environment for developing the sustainable home controller

A. A. Bielskis, E. Guseinoviene, L. Zutautas, Darius Drungilas, D. Dzemydienė, Gediminas Gricius
{"title":"Modeling of Ambient Comfort Affect Reward based on multi-agents in cloud interconnection environment for developing the sustainable home controller","authors":"A. A. Bielskis, E. Guseinoviene, L. Zutautas, Darius Drungilas, D. Dzemydienė, Gediminas Gricius","doi":"10.1109/EVER.2013.6521585","DOIUrl":null,"url":null,"abstract":"The paper presents a research based on a vision of a multi-agent model working for the ambient comfort measurement and environment control system. Such means are used for developing the Smarter Eco-Social Laboratory (SrESL). The human Ambient Comfort Affect Reward (ACAR) index is proposed for development of the Reinforcement Learning Based Ambient Comfort Controller (RL-ACC) for experiments using equipment of SrESL. The ACAR index is recognized as dependent on human physiological parameters, such as the temperature, the electrocardiogram (ECG) and the electro-dermal activity (EDA). The fuzzy logic is used to approximate the ACAR index function by defining two fuzzy inference systems: the Arousal-Valence System, and the Ambient Comfort Affect Reward (ACAR) System. The goal of the RL-ACC is to find such the environmental state characteristics that create an optimal comfort for people affected by this environment. The Radial Basis Neural Network is used as the main component of the RL-ACC to performing of two roles: the policy structure, known as the Actor, used to select actions, and the estimated value function, known as the Critic that criticizes the actions made by the Actor. The Actor which manages Critic processes was used as a value function approximation of the continuous learning tasks of the RL-ACC and presented in this paper.","PeriodicalId":386323,"journal":{"name":"2013 Eighth International Conference and Exhibition on Ecological Vehicles and Renewable Energies (EVER)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2013-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 Eighth International Conference and Exhibition on Ecological Vehicles and Renewable Energies (EVER)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/EVER.2013.6521585","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

The paper presents a research based on a vision of a multi-agent model working for the ambient comfort measurement and environment control system. Such means are used for developing the Smarter Eco-Social Laboratory (SrESL). The human Ambient Comfort Affect Reward (ACAR) index is proposed for development of the Reinforcement Learning Based Ambient Comfort Controller (RL-ACC) for experiments using equipment of SrESL. The ACAR index is recognized as dependent on human physiological parameters, such as the temperature, the electrocardiogram (ECG) and the electro-dermal activity (EDA). The fuzzy logic is used to approximate the ACAR index function by defining two fuzzy inference systems: the Arousal-Valence System, and the Ambient Comfort Affect Reward (ACAR) System. The goal of the RL-ACC is to find such the environmental state characteristics that create an optimal comfort for people affected by this environment. The Radial Basis Neural Network is used as the main component of the RL-ACC to performing of two roles: the policy structure, known as the Actor, used to select actions, and the estimated value function, known as the Critic that criticizes the actions made by the Actor. The Actor which manages Critic processes was used as a value function approximation of the continuous learning tasks of the RL-ACC and presented in this paper.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
云互联环境下基于多智能体的环境舒适影响奖励建模及可持续家居控制器开发
本文提出了一种基于多智能体模型的环境舒适性测量与环境控制系统的研究。这些手段被用于开发智能生态社会实验室(SrESL)。为了开发基于强化学习的环境舒适控制器(RL-ACC),提出了人类环境舒适影响奖励(ACAR)指标,用于在SrESL设备上进行实验。ACAR指数被认为依赖于人体的生理参数,如温度、心电图(ECG)和皮肤电活动(EDA)。通过定义唤醒效价系统和环境舒适影响奖励(ACAR)系统两个模糊推理系统,利用模糊逻辑近似ACAR指标函数。RL-ACC的目标是找到这样的环境状态特征,为受这种环境影响的人创造最佳的舒适度。径向基神经网络作为RL-ACC的主要组成部分,用于执行两个角色:用于选择操作的策略结构(称为Actor)和用于批评Actor所做操作的估计值函数(称为Critic)。本文将管理批评过程的Actor作为RL-ACC连续学习任务的值函数逼近。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Ideal analytical expression of the magnetic circuit in a permanent magnet linear machine Topologies of flux-switching machines for in-wheel traction Influence of machine control strategy on electric vehicle range Design, sizing and set up of a specific low cost electronic load for PV modules characterization Regenerative braking in a small low cost plug-in hybrid electric vehicle for urban use
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1