Newton Losses: Using Curvature Information for Learning with Differentiable Algorithms

16 Sept 2023 (modified: 11 Feb 2024)Submitted to ICLR 2024EveryoneRevisionsBibTeX
Primary Area: neurosymbolic & hybrid AI systems (physics-informed, logic & formal reasoning, etc.)
Code Of Ethics: I acknowledge that I and all co-authors of this work have read and commit to adhering to the ICLR Code of Ethics.
Keywords: differentiable, differentiable algorithms, second-order, newton, algorithmic losses, sorting, ranking, differentiable sorting, differentiable ranking, sort, rank, shortest-path, dijkstra
Submission Guidelines: I certify that this submission complies with the submission instructions as described on https://iclr.cc/Conferences/2024/AuthorGuide.
Abstract: In many supervised learning problems, model predictions are compared to ground truth labels using simple convex losses such as softmax cross-entropy and squared error. In weakly-supervised learning, more complex losses involving problem-specific algorithmic procedures and knowledge, e.g., differentiable shortest-path computations or sorting algorithms, are common. These losses can be hard to optimize as they are non-convex in the model output and may exhibit vanishing and exploding gradients. To alleviate this issue, we present Newton Losses, a method for boosting the performance of non-convex and hard to optimize losses by locally approximating an existing loss function with a quadratic (incorporating second-order information). As Newton Losses only replaces the loss function, the method allows training the neural network with gradient descent and is computationally efficient. We apply Newton Losses to eight differentiable algorithm methods for the multi-digit MNIST sorting benchmark and the Warcraft shortest-path benchmark.
Anonymous Url: I certify that there is no URL (e.g., github page) that could be used to find authors' identity.
No Acknowledgement Section: I certify that there is no acknowledgement section in this submission for double blind review.
Submission Number: 555
Loading