The training phase is the most crucial stage during the machine learning process. In the case of labeled data and supervised learning, machine learning entails minimizing the loss function under various constraints. We provide an innovative model for learning with numerous data sets, resulting from the application of multicriteria optimization techniques to existing deep learning algorithms. Data fitting is formulated as a multicriteria model in which each criterion measures the data fitting error on a specific data set. This is an optimization model involving a vector-valued function, and it has to be analyzed using the notion of Pareto efficiency. We present stability results for efficient solutions in the presence of input and output data perturbations. The multiple data set environment comes into play to eliminate the bias caused by the selection of a specific training set. To apply this concept, we present a scalarization strategy as well as numerical experiments in digit classification using MNIST data.

Enhancing deep learning algorithm accuracy and stability using multicriteria optimization: an application to distributed learning with MNIST digits

Rocca M.
2022-01-01

Abstract

The training phase is the most crucial stage during the machine learning process. In the case of labeled data and supervised learning, machine learning entails minimizing the loss function under various constraints. We provide an innovative model for learning with numerous data sets, resulting from the application of multicriteria optimization techniques to existing deep learning algorithms. Data fitting is formulated as a multicriteria model in which each criterion measures the data fitting error on a specific data set. This is an optimization model involving a vector-valued function, and it has to be analyzed using the notion of Pareto efficiency. We present stability results for efficient solutions in the presence of input and output data perturbations. The multiple data set environment comes into play to eliminate the bias caused by the selection of a specific training set. To apply this concept, we present a scalarization strategy as well as numerical experiments in digit classification using MNIST data.
2022
2022
Artificial intelligence; Deep learning; Machine learning; Multicriteria optimization; Classification; MINST data
La Torre, D.; Liuzzi, D.; Repetto, M.; Rocca, M.
File in questo prodotto:
File Dimensione Formato  
annals-LT-DL-MR-MR.pdf

non disponibili

Tipologia: Versione Editoriale (PDF)
Licenza: Copyright dell'editore
Dimensione 1.11 MB
Formato Adobe PDF
1.11 MB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11383/2140111
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 1
  • ???jsp.display-item.citation.isi??? 1
social impact