Tuning Language Models by Proxy

Tuning Language Models by Proxy

Submitted on 16 Jan 2024 Tuning Language Models by Proxy
intuitive yet effective

tunes a smaller LM, then applies the difference between the predictions of the small tuned and untuned LMs to shift the original predictions of the base model in the direction of tuning, while retaining the benefits of larger-scale pretraining.

Pasted image 20240119222957.png