Sohyun An
Sohyun An
Home
Featured
Publications
Experience
Honors & Awards
Tags
Projects
TA
Contact
Light
Dark
Automatic
SFT
A Unifying Lens on Supervised Fine-Tuning Through Target Distribution Design
Supervised fine-tuning (SFT) typically maximizes the likelihood of every token in a demonstrated trajectory. However, an observed token …
Tong Xie
,
Yuanhao Ban
,
Yunqi Hong
,
Sohyun An
,
Yihang Chen
,
Cho-Jui Hsieh
Cite
Paper
Project
Cite
×