摘要:This paper studies the distribution of FFEs (fire fighting equipments) carried by UAVs (unmanned aerial vehicles) from FFUs (fire fighting units) under the background of multi-wave forest fire. The objective is to allocate the FFEs of each FFU to minimize the sum of the probabilities of each fire site’s unsuccessful extinguishment. In order to solve the multi-wave equipment distribution problem of the FFUs, a distributed reinforcement learning algorithm is designed in this paper. In the algorithm, agents cooperate to find the optimal distribution of FFEs based on information exchange, and a local Q-function is established for each agent to find the optimal FFE distribution combination. Simulation results demonstrate the effectiveness of the algorithm.