Beyond Reanalysis: Critical Issues in Data Reuse for Solid Tumor Proteomics

IRIS - Institutional Research Information System
IRIS è il sistema di gestione integrata dei dati della ricerca (persone, progetti, pubblicazioni, attività) adottato dall'Università degli Studi dell’Insubria.

IRInSubria - Institutional Repository Insubria
IRInSubria raccoglie, conserva, documenta e dissemina le informazioni sulla produzione scientifica dell'Università degli Studi dell’Insubria anche ai fini della valutazione della ricerca.

Proteomics represents a fundamental layer for understanding the molecular complexity of solid tumors by quantifying protein abundance and capturing proteoforms and posttranslational modifications undetected in genomics or transcriptomics analyses. As mass spectrometry-based technologies and public proteomics repositories have expanded, opportunities for large-scale data reuse have grown accordingly. Nevertheless, data availability has not been translated into straightforward reuse: differences in experimental design, acquisition strategies, quantification workflows and metadata quality still limit the reproducibility and cross-study comparability. In this review, proteomics data reuse is defined as the systematic reanalysis and integration of publicly available datasets to support precision oncology applications such as biomarker assessment and antibody–drug conjugate target prioritization. We discuss reuse as an end-to-end analytical process, focusing on data analysis workflows, harmonization strategies, and the impact of heterogeneous experimental and analytical choices on interoperability. The increased application of artificial intelligence in proteomics data integration and reuse is also addressed, highlighting its analytical potential while underscoring the risks of overinterpretation when biological context and data structure are not adequately considered. Using colorectal and prostate cancer as representative examples, we illustrate how proteomics data reuse can support biological discovery and translational research, while critically examining the factors that limit robustness and clinical relevance.

Beyond Reanalysis: Critical Issues in Data Reuse for Solid Tumor Proteomics

Franzetti, Federica^Co-primo;Giugni, Nicole^Co-primo;Airoldi, Manuel;Bondi, Heather;Alberio, Tiziana^Penultimo;Fasano, Mauro^Ultimo

2026-01-01

Abstract

Proteomics represents a fundamental layer for understanding the molecular complexity of solid tumors by quantifying protein abundance and capturing proteoforms and posttranslational modifications undetected in genomics or transcriptomics analyses. As mass spectrometry-based technologies and public proteomics repositories have expanded, opportunities for large-scale data reuse have grown accordingly. Nevertheless, data availability has not been translated into straightforward reuse: differences in experimental design, acquisition strategies, quantification workflows and metadata quality still limit the reproducibility and cross-study comparability. In this review, proteomics data reuse is defined as the systematic reanalysis and integration of publicly available datasets to support precision oncology applications such as biomarker assessment and antibody–drug conjugate target prioritization. We discuss reuse as an end-to-end analytical process, focusing on data analysis workflows, harmonization strategies, and the impact of heterogeneous experimental and analytical choices on interoperability. The increased application of artificial intelligence in proteomics data integration and reuse is also addressed, highlighting its analytical potential while underscoring the risks of overinterpretation when biological context and data structure are not adequately considered. Using colorectal and prostate cancer as representative examples, we illustrate how proteomics data reuse can support biological discovery and translational research, while critically examining the factors that limit robustness and clinical relevance.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2026
			
	Anno di pubblicazione online
	
				2026
			
	Rivista
	
				PROTEOMES
			
	Url
	
				https://www.mdpi.com/2227-7382/14/2/16
			
	DOI
	
				https://dx.doi.org/10.3390/proteomes14020016
			
	Codice PUBMED
	
				42029488
			
	Parole chiave
	
				proteomics data reuse; data harmonization; solid tumors; data standards; precision oncology; public repositories; proteoforms
			
	Tutti gli autori
	
						Franzetti, Federica; Giugni, Nicole; Airoldi, Manuel; Bondi, Heather; Alberio, Tiziana; Fasano, Mauro
					
	Appare nelle tipologie:
	
				Articolo su Rivista

File in questo prodotto:

File	Dimensione	Formato
proteomes-14-00016-v2.pdf accesso aperto Tipologia: Versione Editoriale (PDF) Licenza: Creative commons Dimensione 768.3 kB Formato Adobe PDF Visualizza/Apri	768.3 kB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11383/2212071

Attenzione

L'Ateneo sottopone a validazione solo i file PDF allegati

Citazioni

1

ND

ND

social impact