Uncertainty-Guided Optimization on Large Language Model Search Trees

Grosse, Julia; Wu, Ruotian; Rashid, Ahmad; Hennig, Philipp; Poupart, Pascal; Kristiadi, Agustinus

Computer Science > Machine Learning

arXiv:2407.03951v1 (cs)

[Submitted on 4 Jul 2024 (this version), latest version 9 Oct 2024 (v2)]

Title:Uncertainty-Guided Optimization on Large Language Model Search Trees

Authors:Julia Grosse, Ruotian Wu, Ahmad Rashid, Philipp Hennig, Pascal Poupart, Agustinus Kristiadi

View PDF HTML (experimental)

Abstract:Beam search is a standard tree search algorithm when it comes to finding sequences of maximum likelihood, for example, in the decoding processes of large language models. However, it is myopic since it does not take the whole path from the root to a leaf into account. Moreover, it is agnostic to prior knowledge available about the process: For example, it does not consider that the objective being maximized is a likelihood and thereby has specific properties, like being bound in the unit interval. Taking a probabilistic approach, we define a prior belief over the LLMs' transition probabilities and obtain a posterior belief over the most promising paths in each iteration. These beliefs are helpful to define a non-myopic Bayesian-optimization-like acquisition function that allows for a more data-efficient exploration scheme than standard beam search. We discuss how to select the prior and demonstrate in on- and off-model experiments with recent large language models, including Llama-2-7b, that our method achieves higher efficiency than beam search: Our method achieves the same or a higher likelihood while expanding fewer nodes than beam search.

Comments:	10 pages
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2407.03951 [cs.LG]
	(or arXiv:2407.03951v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2407.03951

Submission history

From: Julia Grosse [view email]
[v1] Thu, 4 Jul 2024 14:08:50 UTC (2,191 KB)
[v2] Wed, 9 Oct 2024 08:16:18 UTC (2,280 KB)

Computer Science > Machine Learning

Title:Uncertainty-Guided Optimization on Large Language Model Search Trees

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Uncertainty-Guided Optimization on Large Language Model Search Trees

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators