Why is MSE loss preferable over e.g. cosine distance?

rdisipio · February 7, 2022, 7:48pm

Hi,

why is MSE loss preferable over e.g. cosine distance when it comes to train a downstream classifier?

Regards,
Riccardo

jamesbriggs · February 8, 2022, 1:49pm

Hey Riccardo, it isn’t necessarily preferable, but if you can share more detail on the training context and the type of input data maybe we can help more?