Open-Ethical AI: Advancements in Open-Source Human-Centric Neural Language Models

IRIS - Institutional Research Information System
IRIS è il sistema di gestione integrata dei dati della ricerca (persone, progetti, pubblicazioni, attività) adottato dall'Università degli Studi dell’Insubria.

IRInSubria - Institutional Repository Insubria
IRInSubria raccoglie, conserva, documenta e dissemina le informazioni sulla produzione scientifica dell'Università degli Studi dell’Insubria anche ai fini della valutazione della ricerca.

This survey summarises the most recent methods for building and assessing helpful, honest, and harmless neural language models, considering small, medium, and large-size models. Pointers to open-source resources that help to align pre-trained models are given, including methods that use parameter-efficient techniques, specialized prompting frameworks, adapter modules, case-specific knowledge injection, and adversarially robust training techniques. Special care is given to evidencing recent progress on value alignment, commonsense reasoning, factuality enhancement, and abstract reasoning of language models. Most reviewed works in this survey publicly shared their code and related data and were accepted in world-leading Machine Learning venues. This work aims at helping researchers and practitioners accelerate their entrance into the field of human-centric neural language models, which might be a cornerstone of the contemporary and near-future industrial and societal revolution.

Open-Ethical AI: Advancements in Open-Source Human-Centric Neural Language Models

Sabrina Sicari^Primo;Jesus F. Cevallos M.^Secondo;Alessandra Rizzardi^Penultimo;Alberto Coen-Porisini^Ultimo

2024-01-01

Abstract

This survey summarises the most recent methods for building and assessing helpful, honest, and harmless neural language models, considering small, medium, and large-size models. Pointers to open-source resources that help to align pre-trained models are given, including methods that use parameter-efficient techniques, specialized prompting frameworks, adapter modules, case-specific knowledge injection, and adversarially robust training techniques. Special care is given to evidencing recent progress on value alignment, commonsense reasoning, factuality enhancement, and abstract reasoning of language models. Most reviewed works in this survey publicly shared their code and related data and were accepted in world-leading Machine Learning venues. This work aims at helping researchers and practitioners accelerate their entrance into the field of human-centric neural language models, which might be a cornerstone of the contemporary and near-future industrial and societal revolution.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2024
			
	Anno di pubblicazione online
	
				2024
			
	Rivista
	
				ACM COMPUTING SURVEYS
			
	Url
	
				https://dl.acm.org/doi/full/10.1145/3703454
			
	DOI
	
				https://dx.doi.org/10.1145/3703454
			
	Codice Web of Science
	
				WOS:001402535800005
			
	Codice Scopus
	
				2-s2.0-85213442073
			
	Parole chiave
	
				Neural language models, open-source, large-language models, humancentric AI
			
	Tutti gli autori
	
						Sicari, Sabrina; F. Cevallos M., Jesus; Rizzardi, Alessandra; Coen-Porisini, Alberto
					
	Appare nelle tipologie:
	
				Articolo su Rivista

File in questo prodotto:

File	Dimensione	Formato
2024_SurveyEthics.pdf accesso aperto Tipologia: Versione Editoriale (PDF) Licenza: Creative commons Dimensione 1.83 MB Formato Adobe PDF Visualizza/Apri	1.83 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11383/2185892

Attenzione

L'Ateneo sottopone a validazione solo i file PDF allegati

Citazioni

ND

8

3

social impact