Bonus: We DPO'ed Fuyu-Heavy on the public Ultrafeedback dataset and got competitive results on MT-Bench and AlpacaEval 1.0! Plus it powers the model used in the chat demo you see here, underlining the balance between image and language modeling that Fuyu-Heavy achieves.
Bonus: We DPO'ed Fuyu-Heavy on the public Ultrafeedback dataset and got competitive results on MT-Bench and AlpacaEval 1.0! Plus it powers the model used in the chat demo you see here, underlining the balance between image and language modeling that Fuyu-Heavy achieves.
@code_monet What else can you reveal about the model architecture?