Proposta de arquitetura em Hardware para FPGA da técnica Qlearning de aprendizagem por reforço

Fernandes, Marcelo Augusto CostaSilva, Lucileide Medeiros Dantas da2017-03-222017-03-222016-11-18SILVA, Lucileide Medeiros Dantas da. Proposta de arquitetura em Hardware para FPGA da técnica Qlearning de aprendizagem por reforço. 2016. 72f. Dissertação (Mestrado em Engenharia Elétrica e de Computação) - Centro de Tecnologia, Universidade Federal do Rio Grande do Norte, Natal, 2016.https://repositorio.ufrn.br/jspui/handle/123456789/22395Q-learning is a off-policy reinforcement learning technique which has as main advantage the possibility of obtaining an optimal policy interacting with an unknown model environment. This work proposes a parallel fixed-point Q-learning algorithm architecture, implemented in FPGA. Fundamental to this approach is optimize system processing time. Convergence results are presented. The processing time and occupied area were analyzed for diferentes scenarios and various fixed point formats. Architecture implementation details were featured. The entire project was developed using the System Generator platform (Xilinx), with a Virtex-6 xc6vcx240t-1ff1156 as the target FPGA.Acesso AbertoFPGAQ-learningAprendizagem por reforçoHardwareProposta de arquitetura em Hardware para FPGA da técnica Qlearning de aprendizagem por reforçomasterThesisCNPQ::ENGENHARIAS::ENGENHARIA ELETRICA E DE COMPUTAÇÃO