Tuning Language Models by Proxy

#machine-learning/large-model

Tuning Language Models by Proxy

Submitted on 16 Jan 2024 Tuning Language Models by Proxy
intuitive yet effective

Proxy-tuning: A method to customize large language models without accessing their internal weights, by using smaller tuned models as experts and anti-experts.
Premise: same dictionary

tunes a smaller LM, then applies the difference between the predictions of the small tuned and untuned LMs to shift the original predictions of the base model in the direction of tuning, while retaining the benefits of larger-scale pretraining.