Combining Reinforcement Learning Algorithm and Genetic Algorithm to Solve the Traveling Salesman Problem

Yaqi Ruan; Weihong Cai; Jiaying Wang

doi:10.22541/au.171623645.58126524/v1

loading page

Combining Reinforcement Learning Algorithm and Genetic Algorithm to Solve the Traveling Salesman Problem

Yaqi Ruan,
Weihong Cai,
Jiaying Wang

Abstract

With the growing recognition of the unique advantages of reinforcement learning and genetic algorithms in addressing combinatorial optimization problems, this study aims to integrate these two methods to collectively tackle the classic combinatorial optimization challenge of the Traveling Salesman Problem (TSP). The Traveling Salesman Problem (TSP) stands as a quintessential combinatorial optimization challenge, tasked with determining the shortest path among designated cities. This paper introduces an innovative approach by amalgamating reinforcement learning’s path selection prowess with genetic algorithms’ global search strategy, aiming to uncover superior solutions in TSP. Specifically, the experiment employs a dual Q-learning algorithm within reinforcement learning to identify multiple optimal paths, serving as progenitors for the genetic algorithm to further enhance performance. The paper meticulously outlines the problem modeling process, elucidating TSP instance definitions, descriptions, and precise objective function definitions. Experimental findings underscore the substantial enhancements achievable in TSP optimization through this comprehensive approach, offering a fresh perspective and methodology for tackling combinatorial optimization challenges.