Hinge Loss
Last updated
Last updated
Definition: The correct score should be larger than a margin (1, and the 1 here actually can be arbitrary) than wrong scores
Min 0, Max infinity
At the initial time, every class scores should be similar and the expected loss is C-1 ( each wrong class has loss 1, and there are C - 1 wrong classes)
Squared hinge loss is different from hinge loss, squared loss focusing more on the bad cases, a small difference may have huge loss value at the end.