L2S¶
L2S proposes a DRL-based improvement heuristic for JSSP. It encodes complete solutions as disjunctive graphs, leverages a dual-module GNN (TPM for topology, CAM for job–machine semantics), and trains a policy via n-step REINFORCE to generate swap operations without full neighborhood evaluation.
Usage¶
You can run the following command lines to execute the code.
python train.py/eval.py settings=l2s_settings mode={train/test} problem=jssp num_job={num_job} num_machine={num_machine}
decoder_strategy={sampling/greedy}
Tasks¶
Supported Tasks: JSSP.
Required Data Generator:
Training¶
L2s applies autoregressive reinforcement learning class ARREINFORCELightning() for training.