When you doubt, abstain: From misclassification to epoché in automatic text categorisation

IRIS - Institutional Research Information System
IRIS è il sistema di gestione integrata dei dati della ricerca (persone, progetti, pubblicazioni, attività) adottato dall'Università degli Studi dell’Insubria.

IRInSubria - Institutional Repository Insubria
IRInSubria raccoglie, conserva, documenta e dissemina le informazioni sulla produzione scientifica dell'Università degli Studi dell’Insubria anche ai fini della valutazione della ricerca.

This paper describes how natural language processing and ontologies are exploited for automatic text categorisation. The approach introduced is part of the MANENT system, an infrastructure for integrating, structuring and searching Digital Libraries. The procedure of structural information extraction, and of the automatic classification of the records according to natural language understanding and theWordNet Domains taxonomy is discussed. A comparison between two versions of the classification algorithm is conducted and the improvements of the new approach are articulated. In particular, using semantic connections between words refines the classification results while reducing misclassification to non classification. © 2011 IEEE.

When you doubt, abstain: From misclassification to epoché in automatic text categorisation

Locoro A.;Grignani D.;Mascardi V.

2011-01-01

Abstract

This paper describes how natural language processing and ontologies are exploited for automatic text categorisation. The approach introduced is part of the MANENT system, an infrastructure for integrating, structuring and searching Digital Libraries. The procedure of structural information extraction, and of the automatic classification of the records according to natural language understanding and theWordNet Domains taxonomy is discussed. A comparison between two versions of the classification algorithm is conducted and the improvements of the new approach are articulated. In particular, using semantic connections between words refines the classification results while reducing misclassification to non classification. © 2011 IEEE.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2011
			
	Titolo del volume
	
				Proceedings - 2011 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Workshops, WI-IAT 2011
			
	ISBN
	
				9781457713736
			
	Titolo del congresso
	
				2011 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Workshops, WI-IAT 2011
			
	Luogo del Congresso
	
				Lyon, fra
			
	Data del Congresso
	
				2011
			
	Appare nelle tipologie:
	
				Relazione (in Volume)

File in questo prodotto:

File	Dimensione	Formato
30_nlpoe_06040842.pdf non disponibili Tipologia: Versione Editoriale (PDF) Licenza: DRM non definito Dimensione 212.79 kB Formato Adobe PDF Visualizza/Apri Richiedi una copia	212.79 kB	Adobe PDF	Visualizza/Apri Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11383/2119328

Citazioni

ND

2

ND

social impact