Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
139 tokens/sec
GPT-4o
47 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Attention-based Reinforcement Learning for Combinatorial Optimization: Application to Job Shop Scheduling Problem (2401.16580v2)

Published 29 Jan 2024 in cs.AI

Abstract: Job shop scheduling problems represent a significant and complex facet of combinatorial optimization problems, which have traditionally been addressed through either exact or approximate solution methodologies. However, the practical application of these solutions is often challenged due to the complexity of real-world problems. Even when utilizing an approximate solution approach, the time required to identify a near-optimal solution can be prohibitively extensive, and the solutions derived are generally not applicable to new problems. This study proposes an innovative attention-based reinforcement learning method specifically designed for the category of job shop scheduling problems. This method integrates a policy gradient reinforcement learning approach with a modified transformer architecture. A key finding of this research is the ability of our trained learners within the proposed method to be repurposed for larger-scale problems that were not part of the initial training set. Furthermore, empirical evidence demonstrates that our approach surpasses the results of recent studies and outperforms commonly implemented heuristic rules. This suggests that our method offers a promising avenue for future research and practical application in the field of job shop scheduling problems.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (46)
  1. The shifting bottleneck procedure for job shop scheduling. Management science, 34(3):391–401.
  2. An actor-critic algorithm for sequence prediction. arXiv preprint arXiv:1607.07086.
  3. Neural machine translation by jointly learning to align and translate. arXiv preprint arXiv:1409.0473.
  4. Neural combinatorial optimization with reinforcement learning.
  5. An efficient approach to job shop scheduling problem using simulated annealing. International Journal of Hybrid Information Technology, 8(11):273–284.
  6. A deep reinforcement learning framework based on an attention mechanism and disjunctive graph embedding for the job-shop scheduling problem. IEEE Transactions on Industrial Informatics, 19(2):1322–1331.
  7. Accelerated simulated annealing algorithm applied to the flexible job shop scheduling problem. International Transactions in Operational Research, 24(5):1119–1137.
  8. Learning heuristics for the tsp by policy gradient. In Integration of Constraint Programming, Artificial Intelligence, and Operations Research: 15th International Conference, CPAIOR 2018, Delft, The Netherlands, June 26–29, 2018, Proceedings 15, pages 170–181. Springer.
  9. Solving fuzzy job-shop scheduling problem using de algorithm improved by a selection mechanism. IEEE Transactions on Fuzzy Systems, 28(12):3265–3275.
  10. A hybrid genetic algorithm for the job shop scheduling problem. European journal of operational research, 167(1):77–95.
  11. Soft actor-critic: Off-policy maximum entropy deep reinforcement learning with a stochastic actor. In International conference on machine learning, pages 1861–1870. PMLR.
  12. IBM Decision Optimization (Accessed 2024). Constraint programming modeling for python (docplex.cp). DOcplex.CP: Constraint Programming Modeling for Python V2.25 documentation.
  13. Reactive tabu search for job-shop scheduling problems. In 2016 11th International Conference on Computer Science & Education (ICCSE), pages 97–102. IEEE.
  14. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980.
  15. Attention, learn to solve routing problems! arXiv preprint arXiv:1803.08475.
  16. Mixed integer programming models for job shop scheduling: A computational analysis. Computers & Operations Research, 73:165–173.
  17. Pomo: Policy optimization with multiple optima for reinforcement learning. Advances in Neural Information Processing Systems, 33:21188–21198.
  18. Matrix encoding networks for neural combinatorial optimization. Advances in Neural Information Processing Systems, 34:5138–5149.
  19. Li, S. E. (2023). Deep reinforcement learning. In Reinforcement Learning for Sequential Decision and Optimal Control, pages 365–402. Springer.
  20. Actor-critic deep reinforcement learning for solving job shop scheduling problems. Ieee Access, 8:71752–71762.
  21. Combinatorial optimization by graph pointer networks and hierarchical reinforcement learning. arXiv preprint arXiv:1911.04936.
  22. Applying deep learning and reinforcement learning to traveling salesman problem. In 2018 international conference on computing, electronics & communications engineering (ICCECE), pages 65–70. IEEE.
  23. Playing atari with deep reinforcement learning. arXiv preprint arXiv:1312.5602.
  24. A guided local search with iterative ejections of bottleneck operations for the job shop scheduling problem. Computers & Operations Research, 90:60–71.
  25. Reinforcement learning for solving the vehicle routing problem. Advances in neural information processing systems, 31.
  26. A hybrid genetic algorithm for the job shop scheduling problems. Computers & industrial engineering, 45(4):597–613.
  27. Schedulenet: Learn to solve multi-agent scheduling problems with reinforcement learning. arXiv preprint arXiv:2106.03051.
  28. Learning to schedule job-shop problems: representation and policy learning using graph neural network and reinforcement learning. International Journal of Production Research, 59(11):3360–3377.
  29. A column generation-based approach for proportionate flexible two-stage no-wait job shop scheduling. International journal of production research, 58(2):487–508.
  30. A tabu search/path relinking algorithm to solve the job shop scheduling problem. Computers & Operations Research, 53:154–164.
  31. Learning to locomote: Understanding how environment design matters for deep reinforcement learning. In Proceedings of the 13th ACM SIGGRAPH Conference on Motion, Interaction and Games, pages 1–10.
  32. High-dimensional continuous control using generalized advantage estimation. arXiv preprint arXiv:1506.02438.
  33. Hybrid algorithm based on improved extended shifting bottleneck procedure and ga for assembly job shop scheduling problem. International Journal of Production Research, 58(9):2604–2625.
  34. Shylo, O. (2010). Job shop scheduling problem instances.
  35. Reinforcement learning: An introduction. MIT press.
  36. A reinforcement learning environment for job-shop scheduling. arXiv preprint arXiv:2104.03760.
  37. Attention is all you need. Advances in neural information processing systems, 30.
  38. Pointer networks. Advances in neural information processing systems, 28.
  39. A two-stage optimization method for energy-saving flexible job-shop scheduling based on energy dynamic characterization. Journal of Cleaner Production, 188:575–588.
  40. Review on flexible job shop scheduling. IET Collaborative Intelligent Manufacturing, 1(3):67–77.
  41. Reinforcement learning with multiple relational attention for solving vehicle routing problems. IEEE Transactions on Cybernetics, 52(10):11107–11120.
  42. Mini-batch algorithms with online step size. Knowledge-Based Systems, 165:228–240.
  43. Solving job shop scheduling problems via deep reinforcement learning. Applied Soft Computing, 143:110436.
  44. Learning to dispatch for job shop scheduling via deep reinforcement learning. Advances in Neural Information Processing Systems, 33:1621–1632.
  45. Review of job shop scheduling research and its new perspectives under industry 4.0. Journal of Intelligent Manufacturing, 30(4):1809–1830.
  46. Flexible assembly job-shop scheduling with sequence-dependent setup times and part sharing in a dynamic environment: Constraint programming model, mixed-integer programming model, and dispatching rules. IEEE Transactions on Engineering Management, 65(3):487–504.
Citations (3)

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com

Tweets