文章基本信息

标题：SIMULACIÓN DEL DILEMA DEL PRISIONERO A PARTIR DE MODELOS CONEXIONISTAS DE APRENDIZAJE POR REFORZAMIENTO
本地全文：下载
作者：Julián Tejada H. ; Lina María Perilla R. ; Sara Victoria Serrato V. 等
期刊名称：Suma Psicológica
印刷版ISSN：0121-4381
电子版ISSN：2145-9797
出版年度：2004
卷号：11
期号：1
页码：29-51
语种：Spanish
出版社：Fundación Universitaria Konrad Lorenz
摘要：El desarrollo de los computadores ha permitido la generación de modelos que permiten simular el comportamiento de los organismos vivos en condiciones controladas donde la manipulación de las variables se puede hacer de manera precisa. En la actualidad, los modelos de simulación se basan en el comportamiento de sistemas dinámicos, como las Redes Neuronales. Dentro de estos modelos se destaca uno que se basa en el condicionamiento operante y se denomina Aprendizaje por Reforzamiento. En la presente investigación se simuló a través de este modelo el Dilema del Prisionero (DP), manipulando una variable que determinaba un nivel motivacional de los organismos que los incitaba a ser cooperativos. Se realizaron alrededor de 187.800 ensayos en los que los organismos digitales tenían que enfrentarse al DP manipulando 6 niveles de motivación. Los resultados permiten identificar una característica intrínseca al DP y es que bajo ciertas condiciones los organismos optaron por no confesar de manera consistente, sin que por esto se pueda afirmar que están siendo cooperativos o autocontrolados. Lo anterior se debe a que en la simulación se decidió que los organismos no iban a tener conocimiento de la existencia del otro ni del efecto que sus acciones tenían sobre las consecuencias que su compaóero recibía.
其他摘要：The development of computers has allowed the generation of models that let simulate the behavior of the alive organisms under controlled conditions, where the manipulation of the variables can be done in a precise way. Actually, the simulation models are based on the behavior of dynamic systems: as the Neural Networks, inside them arises one that is based on operating conditioning and it is named Reinforcement Learning. In the present investigation it was simulated through this model the Prisonerís Dilemma (PD), manipulating a variable that determines a motivational level of the organisms that make them to be cooperative. They were carried out around 187.800 essays in those who the digital organisms had to confront the PD, manipulating 6 motivational levels. The results allow to identify an intrinsic characteristic of the PD and it is that, under certain conditions the organisms opted not to confess in a consistent way without this reason we can affirm that they are being cooperative or self-controlled because in the simulation we decided that the organism did not have any knowledge of the existence of the other one, neither of the effects that their actions had on the consequences that their partner received.
关键词：Dilema del prisionero; aprendizaje por reforzamiento; cooperación; autocontrol; conexionismo; simulación.
其他关键词：Prisonerís dilemma; reinforcement learning; cooperation; self- control; connectionism; simulation.