<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.3 20210610//EN" "JATS-journalpublishing1-3.dtd">
<article article-type="research-article" dtd-version="1.3" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xml:lang="ru"><front><journal-meta><journal-id journal-id-type="publisher-id">ellibs</journal-id><journal-title-group><journal-title xml:lang="ru">Электронные библиотеки</journal-title><trans-title-group xml:lang="en"><trans-title>Russian Digital Libraries Journal</trans-title></trans-title-group></journal-title-group><issn pub-type="epub">1562-5419</issn><publisher><publisher-name>Казанский (Приволжский) федеральный университет</publisher-name></publisher></journal-meta><article-meta><article-id custom-type="elpub" pub-id-type="custom">ellibs-17</article-id><article-categories><subj-group subj-group-type="heading"><subject>Research Article</subject></subj-group><subj-group subj-group-type="section-heading" xml:lang="ru"><subject>Статьи</subject></subj-group></article-categories><title-group><article-title>Разработка системы эмоциональной оценки на основе обучения с подкреплением и нейробиологически инспирированных методов</article-title><trans-title-group xml:lang="en"><trans-title>The system of emotional appraisal based on reinforcement learning and bio-inspired methods</trans-title></trans-title-group></title-group><contrib-group><contrib contrib-type="author" corresp="yes"><name-alternatives><name name-style="eastern" xml:lang="ru"><surname>Майорова</surname><given-names>Е. Ю.</given-names></name></name-alternatives><email xlink:type="simple">eugeniamaiorova@gmail.com</email><xref ref-type="aff" rid="aff-1"/></contrib><contrib contrib-type="author" corresp="yes"><name-alternatives><name name-style="eastern" xml:lang="ru"><surname>Таланов</surname><given-names>М. О.</given-names></name></name-alternatives><email xlink:type="simple">max.talanov@gmail.com</email><xref ref-type="aff" rid="aff-1"/></contrib><contrib contrib-type="author" corresp="yes"><name-alternatives><name name-style="eastern" xml:lang="ru"><surname>Лоу</surname><given-names>Р.</given-names></name></name-alternatives><email xlink:type="simple">robert.lowe@his.se</email><xref ref-type="aff" rid="aff-2"/></contrib></contrib-group><aff xml:lang="ru" id="aff-1"><institution>Казанский (Приволжский) федеральный университет</institution><country>Russian Federation</country></aff><aff xml:lang="ru" id="aff-2"><institution>University of Gothenburg</institution><country>Russian Federation</country></aff><pub-date pub-type="collection"><year>2016</year></pub-date><pub-date pub-type="epub"><day>28</day><month>06</month><year>2016</year></pub-date><volume>19</volume><issue>3</issue><fpage>193</fpage><lpage>215</lpage><permissions><copyright-statement>Copyright &amp;#x00A9; Майорова Е.Ю., Таланов М.О., Лоу Р., 2016</copyright-statement><copyright-year>2016</copyright-year><copyright-holder xml:lang="ru">Майорова Е.Ю., Таланов М.О., Лоу Р.</copyright-holder><copyright-holder xml:lang="en">Майорова Е.Ю., Таланов М.О., Лоу Р.</copyright-holder><license xml:lang="ru" license-type="creative-commons-attribution" xlink:href="https://creativecommons.org/licenses/by/4.0/" xlink:type="simple"><license-p>Данная работа распространяется под лицензией Creative Commons Attribution 4.0.</license-p></license><license xml:lang="en" license-type="creative-commons-attribution" xlink:href="https://creativecommons.org/licenses/by/4.0/" xlink:type="simple"><license-p>This work is licensed under a Creative Commons Attribution 4.0 License.</license-p></license></permissions><self-uri xlink:href="https://ellibs.elpub.ru/jour/article/view/17">https://ellibs.elpub.ru/jour/article/view/17</self-uri><abstract><p>Объектом проведенного исследования является эмоциональная оценка искусственного интеллекта. В качестве системы реализации эмоциональной оценки выбрана система обучения с подкреплением. В результате симуляции построенной модели получены графики, показывающие активность структур мозга, участвующих в процессе их воздействия друг на друга. В ходе настройки системы удалось добиться четырех вспышек активности на таламусе вместо ожидаемых пяти.
</p></abstract><trans-abstract xml:lang="en"><p>I research and lecture in Cognitive Science where my particular interest is in emotions – neural networks modeling and applications – and animal and human learning.</p></trans-abstract><kwd-group xml:lang="ru"><kwd>куб Лёвхейма</kwd><kwd>эмоциональная оценка</kwd></kwd-group><kwd-group xml:lang="en"><kwd>NEST</kwd><kwd>NeuCogAR</kwd></kwd-group></article-meta></front><back><ref-list><title>References</title><ref id="cit1"><label>1</label><citation-alternatives><mixed-citation xml:lang="ru">Максим Таланов. Эмоциональные вычисления. URL: http://postnauka. ru/video/45297.</mixed-citation><mixed-citation xml:lang="en">Максим Таланов. Эмоциональные вычисления. URL: http://postnauka. ru/video/45297.</mixed-citation></citation-alternatives></ref><ref id="cit2"><label>2</label><citation-alternatives><mixed-citation xml:lang="ru">Lowenstein G., Lerner J.S. The role of affect in decision-making // In R. Davidson, K. Scherer, H. Goldsmith (Eds.) Handbook of Affective Science. New York: Oxford University Press, 2003. P. 619–642.</mixed-citation><mixed-citation xml:lang="en">Lowenstein G., Lerner J.S. The role of affect in decision-making // In R. Davidson, K. Scherer, H. Goldsmith (Eds.) Handbook of Affective Science. New York: Oxford University Press, 2003. P. 619–642.</mixed-citation></citation-alternatives></ref><ref id="cit3"><label>3</label><citation-alternatives><mixed-citation xml:lang="ru">Максим Таланов. Эмоциональный искусственный интеллект. URL: http://postnauka.ru/video/45296.</mixed-citation><mixed-citation xml:lang="en">Максим Таланов. Эмоциональный искусственный интеллект. URL: http://postnauka.ru/video/45296.</mixed-citation></citation-alternatives></ref><ref id="cit4"><label>4</label><citation-alternatives><mixed-citation xml:lang="ru">Tom Ziemke, Robert Lowe. On the Role of Emotion in Embodied Cognitive Architectures: From Organisms to Robots. Springer Science+Business Media, LLC 2009. P. 71–73.</mixed-citation><mixed-citation xml:lang="en">Tom Ziemke, Robert Lowe. On the Role of Emotion in Embodied Cognitive Architectures: From Organisms to Robots. Springer Science+Business Media, LLC 2009. P. 71–73.</mixed-citation></citation-alternatives></ref><ref id="cit5"><label>5</label><citation-alternatives><mixed-citation xml:lang="ru">David Sander, Didier Grandjean, Klaus R. Scherer. A systems approach to appraisal mechanisms in emotion. Geneva Emotion Research Group, Department of Psychology, University of Geneva, 2005. P. 140–148.</mixed-citation><mixed-citation xml:lang="en">David Sander, Didier Grandjean, Klaus R. Scherer. A systems approach to appraisal mechanisms in emotion. Geneva Emotion Research Group, Department of Psychology, University of Geneva, 2005. P. 140–148.</mixed-citation></citation-alternatives></ref><ref id="cit6"><label>6</label><citation-alternatives><mixed-citation xml:lang="ru">Petta P. The role of emotion in a tractable architecture for situated cognizers // In: Trappl R., Petta P., Payr S. Eds. Emotions in Humans and Artifacts. Cambridge, MA: MIT Press, 2003. P. 87–88.</mixed-citation><mixed-citation xml:lang="en">Petta P. The role of emotion in a tractable architecture for situated cognizers // In: Trappl R., Petta P., Payr S. Eds. Emotions in Humans and Artifacts. Cambridge, MA: MIT Press, 2003. P. 87–88.</mixed-citation></citation-alternatives></ref><ref id="cit7"><label>7</label><citation-alternatives><mixed-citation xml:lang="ru">Minsky Marvin. The Emotion Machine: Commonsense Thinking, Artifiial Intelligence, and the Future of the Human Mind. Simon and Schuster, 2007. P. 256–258.</mixed-citation><mixed-citation xml:lang="en">Minsky Marvin. The Emotion Machine: Commonsense Thinking, Artifiial Intelligence, and the Future of the Human Mind. Simon and Schuster, 2007. P. 256–258.</mixed-citation></citation-alternatives></ref><ref id="cit8"><label>8</label><citation-alternatives><mixed-citation xml:lang="ru">Wörgötter F., Porr B. Temporal Sequence Learning, Prediction, and Control – a Review of different models and their relation to biological mechanisms. Department of Psychology, University of Stirling, 2005. P. 45.</mixed-citation><mixed-citation xml:lang="en">Wörgötter F., Porr B. Temporal Sequence Learning, Prediction, and Control – a Review of different models and their relation to biological mechanisms. Department of Psychology, University of Stirling, 2005. P. 45.</mixed-citation></citation-alternatives></ref><ref id="cit9"><label>9</label><citation-alternatives><mixed-citation xml:lang="ru">Ortony A., Norman D., Revelle W. Affect and proto-affect in effective functioning // In: Fellous J-M, Arbib M.A., Eds. Who need emotions? New York: Oxford University Press, 2005.</mixed-citation><mixed-citation xml:lang="en">Ortony A., Norman D., Revelle W. Affect and proto-affect in effective functioning // In: Fellous J-M, Arbib M.A., Eds. Who need emotions? New York: Oxford University Press, 2005.</mixed-citation></citation-alternatives></ref><ref id="cit10"><label>10</label><citation-alternatives><mixed-citation xml:lang="ru">Damasio A.R. The feeling of what happens: body, emotion and the making of consciousness. Heinemann: London, 1999. 400 p.</mixed-citation><mixed-citation xml:lang="en">Damasio A.R. The feeling of what happens: body, emotion and the making of consciousness. Heinemann: London, 1999. 400 p.</mixed-citation></citation-alternatives></ref><ref id="cit11"><label>11</label><citation-alternatives><mixed-citation xml:lang="ru">Rolls E. Emotion explained. Oxford: Oxford University Press, 2005.</mixed-citation><mixed-citation xml:lang="en">Rolls E. Emotion explained. Oxford: Oxford University Press, 2005.</mixed-citation></citation-alternatives></ref><ref id="cit12"><label>12</label><citation-alternatives><mixed-citation xml:lang="ru">Phelps E. Emotion and cognition: Insights from studies of the human amygdala // Annu. Rev. Psychol. 2006. V. 57. P. 27–53.</mixed-citation><mixed-citation xml:lang="en">Phelps E. Emotion and cognition: Insights from studies of the human amygdala // Annu. Rev. Psychol. 2006. V. 57. P. 27–53.</mixed-citation></citation-alternatives></ref><ref id="cit13"><label>13</label><citation-alternatives><mixed-citation xml:lang="ru">Scherer K.R., Ekman P. On the nature and function of emotion: a component process approach // In: Approaches to Emotion. Hillsdale, N.J.: Lawrence Erlbaum, 1984. P. 293–317.</mixed-citation><mixed-citation xml:lang="en">Scherer K.R., Ekman P. On the nature and function of emotion: a component process approach // In: Approaches to Emotion. Hillsdale, N.J.: Lawrence Erlbaum, 1984. P. 293–317.</mixed-citation></citation-alternatives></ref><ref id="cit14"><label>14</label><citation-alternatives><mixed-citation xml:lang="ru">Paulus Martin P., Angela J.Yu. Emotion and decision-making: affect-driven belief systems in anxiety and depression // Trends in Cognitive Sciences. September 2012. V. 16, No 9. P. 476–483.</mixed-citation><mixed-citation xml:lang="en">Paulus Martin P., Angela J.Yu. Emotion and decision-making: affect-driven belief systems in anxiety and depression // Trends in Cognitive Sciences. September 2012. V. 16, No 9. P. 476–483.</mixed-citation></citation-alternatives></ref><ref id="cit15"><label>15</label><citation-alternatives><mixed-citation xml:lang="ru">Kahneman D., Tversky A. Prospect theory: an analysis of decision under risk // Econometrica. 1979. V. 47. P. 263–291.</mixed-citation><mixed-citation xml:lang="en">Kahneman D., Tversky A. Prospect theory: an analysis of decision under risk // Econometrica. 1979. V. 47. P. 263–291.</mixed-citation></citation-alternatives></ref><ref id="cit16"><label>16</label><citation-alternatives><mixed-citation xml:lang="ru">Mukherjee K. A dual system model of preferences under risk // Psychol. Rev. 2010. V. 117. P. 243–255.</mixed-citation><mixed-citation xml:lang="en">Mukherjee K. A dual system model of preferences under risk // Psychol. Rev. 2010. V. 117. P. 243–255.</mixed-citation></citation-alternatives></ref><ref id="cit17"><label>17</label><citation-alternatives><mixed-citation xml:lang="ru">Hsee C.K., Rottenstreich Y. Music, pandas, and muggers: on the affective psychology of value // J. Exp. Psychol. Gen. 2004. V. 133. P. 23–30.</mixed-citation><mixed-citation xml:lang="en">Hsee C.K., Rottenstreich Y. Music, pandas, and muggers: on the affective psychology of value // J. Exp. Psychol. Gen. 2004. V. 133. P. 23–30.</mixed-citation></citation-alternatives></ref><ref id="cit18"><label>18</label><citation-alternatives><mixed-citation xml:lang="ru">Kusev P., van Schaik P. Preferences under risk: content-dependent behavior and psychological processing //Front. Psychol. 2011. V. 2. P. 269–271.</mixed-citation><mixed-citation xml:lang="en">Kusev P., van Schaik P. Preferences under risk: content-dependent behavior and psychological processing //Front. Psychol. 2011. V. 2. P. 269–271.</mixed-citation></citation-alternatives></ref><ref id="cit19"><label>19</label><citation-alternatives><mixed-citation xml:lang="ru">Breazeal C. Designing sociable robots. Cambridge, MA: MIT Press, 2002. 244 p.</mixed-citation><mixed-citation xml:lang="en">Breazeal C. Designing sociable robots. Cambridge, MA: MIT Press, 2002. 244 p.</mixed-citation></citation-alternatives></ref><ref id="cit20"><label>20</label><citation-alternatives><mixed-citation xml:lang="ru">Kelley A.E. Neurochemical networks encoding emotion and motivation: An evolutionary perspective // In: Fellous J-M., Arbib M.A., Eds. Who needs emotions? The brain meets the robot. New York: Oxford University Press, 2005.</mixed-citation><mixed-citation xml:lang="en">Kelley A.E. Neurochemical networks encoding emotion and motivation: An evolutionary perspective // In: Fellous J-M., Arbib M.A., Eds. Who needs emotions? The brain meets the robot. New York: Oxford University Press, 2005.</mixed-citation></citation-alternatives></ref><ref id="cit21"><label>21</label><citation-alternatives><mixed-citation xml:lang="ru">Max Talanov, Jordi Vallverdu, Salvatore Distefano, Manuel Mazzara, Radhakrishnan Delhibabu. neuromodulating cognitive architecture: towards biomimetic emotional AI // Advanced Information Networking and Applications (AINA), 2015 IEEE 29th International Conference. P. 587–592.</mixed-citation><mixed-citation xml:lang="en">Max Talanov, Jordi Vallverdu, Salvatore Distefano, Manuel Mazzara, Radhakrishnan Delhibabu. neuromodulating cognitive architecture: towards biomimetic emotional AI // Advanced Information Networking and Applications (AINA), 2015 IEEE 29th International Conference. P. 587–592.</mixed-citation></citation-alternatives></ref><ref id="cit22"><label>22</label><citation-alternatives><mixed-citation xml:lang="ru">Аллахвердов В.М., Богданова С.И. и др. Психология: учеб. / отв. ред. А.А. Крылов. М.: Проспект, 2005. С. 214–217.</mixed-citation><mixed-citation xml:lang="en">Аллахвердов В.М., Богданова С.И. и др. Психология: учеб. / отв. ред. А.А. Крылов. М.: Проспект, 2005. С. 214–217.</mixed-citation></citation-alternatives></ref><ref id="cit23"><label>23</label><citation-alternatives><mixed-citation xml:lang="ru">Vernon David. Artificial Cognitive Systems: a Primer. The MIT Press Cambridge, Massachusetts London, England, 2014. 288 p.</mixed-citation><mixed-citation xml:lang="en">Vernon David. Artificial Cognitive Systems: a Primer. The MIT Press Cambridge, Massachusetts London, England, 2014. 288 p.</mixed-citation></citation-alternatives></ref><ref id="cit24"><label>24</label><citation-alternatives><mixed-citation xml:lang="ru">McCarthy J., Hayes P.J. Some philosophical problems from the standpoint of artificial intelligence at the Wayback Machine //In: Meltzer B., Michie D., Eds. Machine Intelligence. Edinburgh: Edinburgh University Press, 1969. No 4. P. 463–502 (archived August 25, 2013).</mixed-citation><mixed-citation xml:lang="en">McCarthy J., Hayes P.J. Some philosophical problems from the standpoint of artificial intelligence at the Wayback Machine //In: Meltzer B., Michie D., Eds. Machine Intelligence. Edinburgh: Edinburgh University Press, 1969. No 4. P. 463–502 (archived August 25, 2013).</mixed-citation></citation-alternatives></ref><ref id="cit25"><label>25</label><citation-alternatives><mixed-citation xml:lang="ru">Таланов Максим. Марвин Минский и эмоциональные машины. URL: https://postnauka.ru/faq/58727.</mixed-citation><mixed-citation xml:lang="en">Таланов Максим. Марвин Минский и эмоциональные машины. URL: https://postnauka.ru/faq/58727.</mixed-citation></citation-alternatives></ref><ref id="cit26"><label>26</label><citation-alternatives><mixed-citation xml:lang="ru">Lövheim H. A new three-dimensional model for emotions and monoamine neurotransmitters // Med Hypotheses. 2012. V. 8. P. 341–348.</mixed-citation><mixed-citation xml:lang="en">Lövheim H. A new three-dimensional model for emotions and monoamine neurotransmitters // Med Hypotheses. 2012. V. 8. P. 341–348.</mixed-citation></citation-alternatives></ref><ref id="cit27"><label>27</label><citation-alternatives><mixed-citation xml:lang="ru">Tomkins S. Affect theory // In: P. Ekman, W. Friesen, P. Ellsworth, Eds. Emotions in the Human Face. Cambridge: Cambridge University Press, 1982. P. 355–395.</mixed-citation><mixed-citation xml:lang="en">Tomkins S. Affect theory // In: P. Ekman, W. Friesen, P. Ellsworth, Eds. Emotions in the Human Face. Cambridge: Cambridge University Press, 1982. P. 355–395.</mixed-citation></citation-alternatives></ref><ref id="cit28"><label>28</label><citation-alternatives><mixed-citation xml:lang="ru">Smith Craig A., Lazarus Richard S. Emotion and Adaptation // In: L.A. Pervin, Ed. Handbook of Personality: Theory and Research. New York: Guilford, 1990. P. 609–637.</mixed-citation><mixed-citation xml:lang="en">Smith Craig A., Lazarus Richard S. Emotion and Adaptation // In: L.A. Pervin, Ed. Handbook of Personality: Theory and Research. New York: Guilford, 1990. P. 609–637.</mixed-citation></citation-alternatives></ref><ref id="cit29"><label>29</label><citation-alternatives><mixed-citation xml:lang="ru">Lazarus Richard S. Progress on a cognitive-motivational-relational theory of emotion // American Psychologist. 1991. V. 46, No 8. P. 819–834.</mixed-citation><mixed-citation xml:lang="en">Lazarus Richard S. Progress on a cognitive-motivational-relational theory of emotion // American Psychologist. 1991. V. 46, No 8. P. 819–834.</mixed-citation></citation-alternatives></ref><ref id="cit30"><label>30</label><citation-alternatives><mixed-citation xml:lang="ru">Niv Yael. Reinforcement learning in the brain // Psychology Department &amp; Princeton Neuroscience Institute, Princeton University, 2009.</mixed-citation><mixed-citation xml:lang="en">Niv Yael. Reinforcement learning in the brain // Psychology Department &amp; Princeton Neuroscience Institute, Princeton University, 2009.</mixed-citation></citation-alternatives></ref><ref id="cit31"><label>31</label><citation-alternatives><mixed-citation xml:lang="ru">Barto A.G. Adaptive critic and the basal ganglia // In J.C. Houk, J.L. Davis, D.G. Beiser, Eds. Models of information processing in the basal ganglia. Cambridge: MIT Press, 1995. P. 215–232.</mixed-citation><mixed-citation xml:lang="en">Barto A.G. Adaptive critic and the basal ganglia // In J.C. Houk, J.L. Davis, D.G. Beiser, Eds. Models of information processing in the basal ganglia. Cambridge: MIT Press, 1995. P. 215–232.</mixed-citation></citation-alternatives></ref><ref id="cit32"><label>32</label><citation-alternatives><mixed-citation xml:lang="ru">Schultz W., Dayan P., Montague P.R. A neural substrate of prediction and reward // Science. 1997. No 275. P. 1593–1599.</mixed-citation><mixed-citation xml:lang="en">Schultz W., Dayan P., Montague P.R. A neural substrate of prediction and reward // Science. 1997. No 275. P. 1593–1599.</mixed-citation></citation-alternatives></ref><ref id="cit33"><label>33</label><citation-alternatives><mixed-citation xml:lang="ru">Wickens J.R., Kotter R. Cellular models of reinforcement // In: J.C. Houk, J.L. Davis, D.G. Beiser, Eds. Models of information processing in the basal ganglia. MIT Press, 1995. P. 187–214.</mixed-citation><mixed-citation xml:lang="en">Wickens J.R., Kotter R. Cellular models of reinforcement // In: J.C. Houk, J.L. Davis, D.G. Beiser, Eds. Models of information processing in the basal ganglia. MIT Press, 1995. P. 187–214.</mixed-citation></citation-alternatives></ref><ref id="cit34"><label>34</label><citation-alternatives><mixed-citation xml:lang="ru">Barto A.G., Sutton R.S., Watkins C.J.C.H. Learning and sequential decision making // In: M. Gabriel, J. Moore, Eds. Learning and computational neuroscience: Foundations of adaptive networks. Cambridge, MA: MIT Press, 1990. P. 593–602.</mixed-citation><mixed-citation xml:lang="en">Barto A.G., Sutton R.S., Watkins C.J.C.H. Learning and sequential decision making // In: M. Gabriel, J. Moore, Eds. Learning and computational neuroscience: Foundations of adaptive networks. Cambridge, MA: MIT Press, 1990. P. 593–602.</mixed-citation></citation-alternatives></ref><ref id="cit35"><label>35</label><citation-alternatives><mixed-citation xml:lang="ru">Bertsekas D.P., Tsitsiklis J.N. Neuro-dynamic programming. Athena Sc., Scientific, 1996. 512 p.</mixed-citation><mixed-citation xml:lang="en">Bertsekas D.P., Tsitsiklis J.N. Neuro-dynamic programming. Athena Sc., Scientific, 1996. 512 p.</mixed-citation></citation-alternatives></ref><ref id="cit36"><label>36</label><citation-alternatives><mixed-citation xml:lang="ru">Sutton R.S., Barto A.G. Reinforcement Learning. An Introduction. Bradford Books, MIT Press, Cambridge, MA, 2002 edition, 1998. 320 p.</mixed-citation><mixed-citation xml:lang="en">Sutton R.S., Barto A.G. Reinforcement Learning. An Introduction. Bradford Books, MIT Press, Cambridge, MA, 2002 edition, 1998. 320 p.</mixed-citation></citation-alternatives></ref><ref id="cit37"><label>37</label><citation-alternatives><mixed-citation xml:lang="ru">Bellman R.E. Dynamic Programming. Princeton: Princeton University Press, 1957. 392 p.</mixed-citation><mixed-citation xml:lang="en">Bellman R.E. Dynamic Programming. Princeton: Princeton University Press, 1957. 392 p.</mixed-citation></citation-alternatives></ref><ref id="cit38"><label>38</label><citation-alternatives><mixed-citation xml:lang="ru">Sutton R.S. Learning to predict by the methods of temporal differences // Machine Learning. August 1988. V. 3, Issue 1. P. 9–44.</mixed-citation><mixed-citation xml:lang="en">Sutton R.S. Learning to predict by the methods of temporal differences // Machine Learning. August 1988. V. 3, Issue 1. P. 9–44.</mixed-citation></citation-alternatives></ref><ref id="cit39"><label>39</label><citation-alternatives><mixed-citation xml:lang="ru">Sutton R.S. Generalization in reinforcement learning: successful examples using sparse coarse coding // In: D.S. Touretzky, M.C. Mozer, M.E. Hasselmo, Eds. Advances in Neural Information Processing Systems: Proceedings of the 1995 Conference. Cambridge, MA, 1996. P. 1038–1044.</mixed-citation><mixed-citation xml:lang="en">Sutton R.S. Generalization in reinforcement learning: successful examples using sparse coarse coding // In: D.S. Touretzky, M.C. Mozer, M.E. Hasselmo, Eds. Advances in Neural Information Processing Systems: Proceedings of the 1995 Conference. Cambridge, MA, 1996. P. 1038–1044.</mixed-citation></citation-alternatives></ref><ref id="cit40"><label>40</label><citation-alternatives><mixed-citation xml:lang="ru">Rummery G.A. Problem solving with reinforcement learning. PhD thesis. Cambridge University, Cambridge, 1995. 52 p.</mixed-citation><mixed-citation xml:lang="en">Rummery G.A. Problem solving with reinforcement learning. PhD thesis. Cambridge University, Cambridge, 1995. 52 p.</mixed-citation></citation-alternatives></ref><ref id="cit41"><label>41</label><citation-alternatives><mixed-citation xml:lang="ru">Watkins C.J.C.H. Learning from delayed rewards. PhD thesis. University of Cambridge, Cambridge, England, 1989. 234 p. URL: https://www.cs.rhul.ac.uk/home/ chrisw/new_thesis.pdf.</mixed-citation><mixed-citation xml:lang="en">Watkins C.J.C.H. Learning from delayed rewards. PhD thesis. University of Cambridge, Cambridge, England, 1989. 234 p. URL: https://www.cs.rhul.ac.uk/home/ chrisw/new_thesis.pdf.</mixed-citation></citation-alternatives></ref><ref id="cit42"><label>42</label><citation-alternatives><mixed-citation xml:lang="ru">Watkins C.J.C.H., Dayan P. Technical note: Q-Learning // Machine Learning. 1992. V. 7, Issue 8. P. 279–292. URL: http://www.gatsby.ucl.ac.uk/~dayan/papers/ cjch.pdf.</mixed-citation><mixed-citation xml:lang="en">Watkins C.J.C.H., Dayan P. Technical note: Q-Learning // Machine Learning. 1992. V. 7, Issue 8. P. 279–292. URL: http://www.gatsby.ucl.ac.uk/~dayan/papers/ cjch.pdf.</mixed-citation></citation-alternatives></ref><ref id="cit43"><label>43</label><citation-alternatives><mixed-citation xml:lang="ru">Pavlov I.P. Conditioned reflexes. London: Oxford University Press, 1927. URL: http://s-f-walker.org.uk/pubsebooks/pdfs/Conditioned-Reflexes-Pavlov.pdf.</mixed-citation><mixed-citation xml:lang="en">Pavlov I.P. Conditioned reflexes. London: Oxford University Press, 1927. URL: http://s-f-walker.org.uk/pubsebooks/pdfs/Conditioned-Reflexes-Pavlov.pdf.</mixed-citation></citation-alternatives></ref><ref id="cit44"><label>44</label><citation-alternatives><mixed-citation xml:lang="ru">Воронцов К.В. Обучение с подкреплением (Reinforcement Learning) URL: http://www.machinelearning.ru/wiki/images/archive/3/35/20140621071329! Voron-ML-RL-slides.pdf.</mixed-citation><mixed-citation xml:lang="en">Воронцов К.В. Обучение с подкреплением (Reinforcement Learning) URL: http://www.machinelearning.ru/wiki/images/archive/3/35/20140621071329! Voron-ML-RL-slides.pdf.</mixed-citation></citation-alternatives></ref><ref id="cit45"><label>45</label><citation-alternatives><mixed-citation xml:lang="ru">Bellman R. A Markovian decision process // Journal of Mathematics and Mechanics. 1957. No 6. P. 716–719.</mixed-citation><mixed-citation xml:lang="en">Bellman R. A Markovian decision process // Journal of Mathematics and Mechanics. 1957. No 6. P. 716–719.</mixed-citation></citation-alternatives></ref><ref id="cit46"><label>46</label><citation-alternatives><mixed-citation xml:lang="ru">Rescorla R.A., Wagner A.R. A theory of Pavlovian conditioning: variations in the effectiveness of reinforcement and nonreinforcement // In: A.H. Black, W.F. Prokasy, Eds. Classical conditioning II: Current research and theory. New York, NY: Appleton-Century-Crofts, 1972. P. 64–99.</mixed-citation><mixed-citation xml:lang="en">Rescorla R.A., Wagner A.R. A theory of Pavlovian conditioning: variations in the effectiveness of reinforcement and nonreinforcement // In: A.H. Black, W.F. Prokasy, Eds. Classical conditioning II: Current research and theory. New York, NY: Appleton-Century-Crofts, 1972. P. 64–99.</mixed-citation></citation-alternatives></ref><ref id="cit47"><label>47</label><citation-alternatives><mixed-citation xml:lang="ru">Gewaltig Marc-Oliver, Diesmann Markus. NEST (NEural Simulation Tool) // Scholarpedia. 2007. V. 2, No 4. P. 1430. URL: http://www.scholarpedia.org/article/ NEST_(NEural_Simulation_Tool).</mixed-citation><mixed-citation xml:lang="en">Gewaltig Marc-Oliver, Diesmann Markus. NEST (NEural Simulation Tool) // Scholarpedia. 2007. V. 2, No 4. P. 1430. URL: http://www.scholarpedia.org/article/ NEST_(NEural_Simulation_Tool).</mixed-citation></citation-alternatives></ref><ref id="cit48"><label>48</label><citation-alternatives><mixed-citation xml:lang="ru">Supercomputers Ready for Use as Discovery Machines for Neuroscience // Frontiers in Neuroinformatics. November 2012. V. 6. P. 1–12.</mixed-citation><mixed-citation xml:lang="en">Supercomputers Ready for Use as Discovery Machines for Neuroscience // Frontiers in Neuroinformatics. November 2012. V. 6. P. 1–12.</mixed-citation></citation-alternatives></ref><ref id="cit49"><label>49</label><citation-alternatives><mixed-citation xml:lang="ru">Diesmann M., Gewaltig M. NEST: an environment for neural systems simulations // Forschung und wisschenschaftliches Rechnen, Beiträge zum Heinz-Billing-Preis. 2001. Bd. 58. S. 43–70.</mixed-citation><mixed-citation xml:lang="en">Diesmann M., Gewaltig M. NEST: an environment for neural systems simulations // Forschung und wisschenschaftliches Rechnen, Beiträge zum Heinz-Billing-Preis. 2001. Bd. 58. S. 43–70.</mixed-citation></citation-alternatives></ref><ref id="cit50"><label>50</label><citation-alternatives><mixed-citation xml:lang="ru">Picard R.W. Affective Computing. MIT Press, 1997.</mixed-citation><mixed-citation xml:lang="en">Picard R.W. Affective Computing. MIT Press, 1997.</mixed-citation></citation-alternatives></ref></ref-list><fn-group><fn fn-type="conflict"><p>The authors declare that there are no conflicts of interest present.</p></fn></fn-group></back></article>
