Graph Neural Network Based Action Ranking for Planning

Rajesh Mangananvar             Stefan Lee             Alan Fern             Prasad Tadepalli            

Oregon State University Logo

Shifts from learning global value functions to ranking local actions—a simpler, graph-based approach that generalizes from small training examples to much larger real-world problems, achieving 89% success on tasks 8× larger than training data (vs. 6.5% for value-based methods).

Abstract

We propose a novel approach to learn relational policies for classical planning based on learning to rank actions. We introduce a new graph representation that explicitly captures action information and propose a Graph Neural Network (GNN) architecture augmented with Gated Recurrent Units (GRUs) to learn action rankings. Unlike value-function based approaches that must learn a globally consistent function, our action ranking method only needs to learn locally consistent ranking. Our model is trained on data generated from small problem instances that are easily solved by planners and is applied to significantly larger instances where planning is computationally prohibitive. Experimental results across standard planning benchmarks demonstrate that our action-ranking approach not only achieves better generalization to larger problems than those used in training but also outperforms multiple baselines (value function and action ranking) methods in terms of success rate and plan quality.

Why Learning for Planning? [Expand All]


The Problem with Existing Approaches [Expand All]


GABAR's Key Insight [Expand All]

GABAR Graph Construction Example

Method Overview [Expand All]

GABAR Framework

Results [Expand All]

Table 2 (from paper): Main results showing GABAR vs. Baselines and Ablations.
Domain Diff Baselines GABAR Ablations
GPL ASNets GRAPL GABAR GABAR-ACT GABAR-CD GABAR-RANK
C↑P↑ C↑P↑ C↑P↑ C↑P↑ C↑P↑ C↑P↑ C↑P↑
BlocksE 1001.1 1001.6 640.65 1001.5 440.65 1000.92 290.79
M 450.68 1001.5 480.44 1001.6 140.49 920.81 210.71
H 100.33 921.4 380.28 1001.7 40.35 810.80 90.61
MiconicE 970.97 1001.0 680.56 1001.0 350.55 970.88 420.67
M 370.56 1000.98 650.54 1000.97 180.33 940.86 290.37
H 190.29 900.92 600.49 1000.95 20.27 880.83 160.29
SpannerE 731.1 780.86 220.65 941.1 310.65 870.98 570.82
M 420.56 600.69 50.55 930.99 110.27 810.93 420.77
H 30.18 420.61 0- 890.91 0- 620.79 120.45
GripperE 1001.0 780.98 260.95 1001.1 310.56 951.0 550.58
M 560.85 540.91 120.67 1000.99 230.40 920.93 430.41
H 210.74 420.88 0- 1000.96 90.28 870.86 210.33
VisitallE 691.3 940.96 921.1 931.1 721.2 911.1 520.64
M 150.76 860.93 881.0 911.0 640.93 891.1 460.56
H 00 640.81 780.99 881.1 440.67 831.2 390.54
GridE 740.89 520.81 200.38 1000.91 210.56 790.87 170.54
M 170.61 450.66 30.28 970.85 80.46 710.65 120.28
H 00 210.60 0- 920.74 0- 540.53 0-
LogisticsE 560.61 390.71 320.81 900.75 120.64 310.86 410.65
M 70.21 220.55 90.45 760.65 30.49 250.54 210.49
H 00 40.39 0- 710.59 0- 60.35 0-
RoversE 640.99 670.96 210.35 871.0 220.75 440.81 330.67
M 90.32 560.87 50.19 820.96 60.66 370.63 90.56
H 00 310.64 0- 770.97 0- 190.57 0-
CombinedE 79.10.98 760.98 43.50.67 95.51.04 33.50.69 780.93 40.20.67
M 28.50.56 65.40.88 29.30.51* 92.21.01 18.40.50 72.70.80 27.80.51
H 6.50.39* 48.50.78 22.10.58* 89.20.99 7.40.39* 600.73 12.10.44*
Table 4 (from paper): GABAR substantially outperforms SOTA LLMs, which fail at complex planning.
Domain Diff OpenAI-O3 Gemini-2.5-Pro GABAR
C↑P↑ C↑P↑ C↑P↑
BlocksE 731.03 811.1 1001.5
M 410.95 470.86 1001.6
H 40.61 120.81 1001.7
MiconicE 560.81 790.86 1001.0
M 120.69 360.58 1000.97
H 0- 120.51 1000.95
SpannerE 380.81 420.75 941.1
M 130.77 100.64 930.99
H 0- 0- 890.91
CombinedE 33.40.85 44.00.8 95.51.04
M 11.60.77* 17.10.68* 92.21.01
H 0.40.61* 1.50.51* 89.20.99

Key Findings from Ablations [Expand All]


Technical Details [Expand All]


Why This Matters [Expand All]

BibTeX


      @inproceedings{mangannavargraph,
        title={Graph Neural Network Based Action Ranking for Planning},
        author={Mangannavar, Rajesh Devaraddi and Lee, Stefan and Fern, Alan and Tadepalli, Prasad},
        booktitle={The Thirty-ninth Annual Conference on Neural Information Processing Systems}
      }