Multi-hop knowledge graph reasoning learned via policy gradient with reward shaping and action dropout
reinforcement-learning pytorch knowledge-graph policy-gradient reward-shaping action-dropout multi-hop-reasoning
-
Updated
Oct 4, 2024 - Jupyter Notebook