文章基本信息

标题：Ambient Lighting Controller Based on Reinforcement Learning Components of Multi-Agents
本地全文：下载
作者：Bielskis ; A. A. ; Guseinoviene 等
期刊名称：Studies About Languages
印刷版ISSN：2029-7203
出版年度：2012
卷号：121
期号：5
页码：79-84
DOI：10.5755/j01.eee.121.5.1656
语种：English
出版社：Faculty of Humanities, Kaunas University of Technology
摘要：Pristatoma universitetinio tipo darniosios laboratorijos ESLab vizija, kuri plėtoja neseniai autorių pasiūlyto išmaniojo ekosocialaus būsto įgyvendinimo idėją. Pateikiamas aplinkos komforto matavimo ir aplinkos kontrolės sistemos valdiklio modelis, kuris bus panaudotas ESLab plėtotei. Straipsnyje pasiūlytas žmogaus aplinkos apšvietimo efekto paskatos AAAP (ALAR) indeksas pritaikytas kuriant paskatos mokytis pagrįstąjį aplinkos komforto valdiklį ESLab laboratorijai. AAAP (ALAR) indeksas priklauso nuo žmogaus fiziologinių parametrų: temperatūros, ECG (elektrokardiogramos) ir EDA (elektrinio odos aktyvumo). Neraiškioji logika yra panaudota AAAP (ALAR) indekso funkcijai aproksimuoti, taikant dvi neraiškias išvedimo sistemas: susijaudinimo ir malonumo sistemą ir žmogų supančios AAAP (ALAR) sistemą. Sukurtojo paskatos mokytis grindžiamo aplinkos apšvietimo valdiklio PMGAAV (RLBACC) tikslas yra skatinti tokias aplinkos valdymo savybes, kurios kuria optimalų patogumą šios aplinkos paveiktiems žmonėms. Valdiklio modelis pagrįstas radialinių bazių neuroninių tinklų taikymu, realizuojant aktoriaus strategijos struktūrą tinkamiems veiksmams išrinkti ir apskaičiuojant vertės funkciją, kuri yra žinoma kaip kritikas, kuris kritikuoja aktoriaus padarytus veiksmus. Kritikas šiame straipsnyje buvo panaudotas kaip tolydžiojo PMGAAV (RLBACC) mokymosi užduočių įverčio funkcijos aproksimacija. Il. 9, bibl. 7 (anglų kalba; santraukos anglų ir lietuvių k.).
其他摘要：The paper presents a vision of sustainable eco-social laboratory, the ESLab which might be used to speed up the process of development of the recently proposed by authors of the Smart Eco-Social Apartment. It is presented the multi-agent model of the ambient comfort measurement and environment control system to be used for the development of the ESLab . The human Ambient Lighting Affect Reward index, the ALAR index is proposed at the first time used for development of the Reinforcement Learning Based Ambient Comfort Controller, the RLBACC for the ESLab. The ALAR index is dependent on human physiological parameters: the temperature, the ECG - electrocardiogram and the EDA -electro-dermal activity. The fuzzy logic is used to approximate the ALAR index function by defining two fuzzy inference systems: the Arousal-Valence System , and the Ambient Lighting Affect Reward (ALAR) System. The goal of the RLBACC is to find such the environmental state characteristics that create an optimal comfort for people affected by this environment. The Radial Basis Neural Network is used as the main component of the RLBACC to performing of two roles - the policy structure, known as the Actor , used to select actions, and the estimated value function, known as the Critic that criticizes the actions made by the Actor . The Critic in this paper was used as a value function approximation of the continuous learning tasks of the RLBACC . Ill. 9, bibl. 7 (in English; abstracts in English and Lithuanian). DOI: http://dx.doi.org/10.5755/j01.eee.121.5.1656