::: Amortizing Intractable Inference in Large Language Models

using amortized Bayesian inference to sample from these intractable posteriors.
fine-tuning LLMs via diversity-seeking reinforcement learning algorithms: generative flow networks (GFlowNets).
empirical evaluation

presents a new technique for fine-tuning LLMs, to perform amortized inference in probabilistic models