::: Amortizing Intractable Inference in Large Language Models
- using amortized Bayesian inference to sample from these intractable posteriors.
- fine-tuning LLMs via diversity-seeking reinforcement learning algorithms: generative flow networks (GFlowNets).
- empirical evaluation
presents a new technique for fine-tuning LLMs, to perform amortized inference in probabilistic models