Enhancing deep learning algorithm accuracy and stability using multicriteria optimization: an application to distributed learning with MNIST digits

IRIS - Institutional Research Information System
IRIS è il sistema di gestione integrata dei dati della ricerca (persone, progetti, pubblicazioni, attività) adottato dall'Università degli Studi dell’Insubria.

IRInSubria - Institutional Repository Insubria
IRInSubria raccoglie, conserva, documenta e dissemina le informazioni sulla produzione scientifica dell'Università degli Studi dell’Insubria anche ai fini della valutazione della ricerca.

The training phase is the most crucial stage during the machine learning process. In the case of labeled data and supervised learning, machine learning entails minimizing the loss function under various constraints. We provide an innovative model for learning with numerous data sets, resulting from the application of multicriteria optimization techniques to existing deep learning algorithms. Data fitting is formulated as a multicriteria model in which each criterion measures the data fitting error on a specific data set. This is an optimization model involving a vector-valued function, and it has to be analyzed using the notion of Pareto efficiency. We present stability results for efficient solutions in the presence of input and output data perturbations. The multiple data set environment comes into play to eliminate the bias caused by the selection of a specific training set. To apply this concept, we present a scalarization strategy as well as numerical experiments in digit classification using MNIST data.

Enhancing deep learning algorithm accuracy and stability using multicriteria optimization: an application to distributed learning with MNIST digits

La Torre D.;Liuzzi D.;Repetto M.;Rocca M.

2022-01-01

Abstract

The training phase is the most crucial stage during the machine learning process. In the case of labeled data and supervised learning, machine learning entails minimizing the loss function under various constraints. We provide an innovative model for learning with numerous data sets, resulting from the application of multicriteria optimization techniques to existing deep learning algorithms. Data fitting is formulated as a multicriteria model in which each criterion measures the data fitting error on a specific data set. This is an optimization model involving a vector-valued function, and it has to be analyzed using the notion of Pareto efficiency. We present stability results for efficient solutions in the presence of input and output data perturbations. The multiple data set environment comes into play to eliminate the bias caused by the selection of a specific training set. To apply this concept, we present a scalarization strategy as well as numerical experiments in digit classification using MNIST data.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2022
			
	Anno di pubblicazione online
	
				2022
			
	Rivista
	
				ANNALS OF OPERATIONS RESEARCH
			
	DOI
	
				https://dx.doi.org/10.1007/s10479-022-04833-x
			
	Codice Web of Science
	
				WOS:000825065000002
			
	Codice Scopus
	
				2-s2.0-85133888808
			
	Parole chiave
	
				Artificial intelligence; Deep learning; Machine learning; Multicriteria optimization; Classification; MINST data
			
	Tutti gli autori
	
						La Torre, D.; Liuzzi, D.; Repetto, M.; Rocca, M.
					
	Appare nelle tipologie:
	
				Articolo su Rivista

File in questo prodotto:

File	Dimensione	Formato
annals-LT-DL-MR-MR.pdf non disponibili Tipologia: Versione Editoriale (PDF) Licenza: Copyright dell'editore Dimensione 1.11 MB Formato Adobe PDF Visualizza/Apri Richiedi una copia	1.11 MB	Adobe PDF	Visualizza/Apri Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11383/2140111

Citazioni

ND

3

1

social impact