Rooted Absorbed Prefix Trajectory Balance with Submodular Replay for GFlowNet Training
arXiv, 2026
Mode collapse remains a critical challenge in training Generative Flow Networks (GFlowNets), especially when fine-tuning language models for molecular generation. We propose Rooted Absorbed Prefix Trajectory Balance, a new training objective that strengthens early-stage learning signals, combined with a submodular replay strategy that promotes diversity. Our approach improves both sample quality and mode coverage on molecular generation tasks.
Wang, X., Lu, W., & Wang, S. arXiv preprint arXiv:2603.00454, 2026.
Paper