Low-rank (LR) factorization techniques aim to represent data in a low-dimensional space by identifying fundamental sources. Standard LR approaches often require additional constraints to account for real-world complexity, resulting in penalized low-rank matrix factorizations. These techniques incorporate penalties or regularization terms to improve robustness and adaptability to practical constraints, bridging theoretical research with real-world applications. This paper explores a nonnegative constrained low-rank decomposition technique, namely, Nonnegative Matrix Factorization (NMF), and its constrained variants as powerful tools for analyzing nonnegative data. We cover theoretical foundations and practical implementations, review algorithms for standard NMF, and address challenges in setting hyperparameters for penalized variants. We emphasize applications in omics data analysis with a model that incorporates biological constraints to extract meaningful insights, and highlight applications in environmental data analysis

Penalizing Low-Rank Matrix Factorization: From theoretical connections to practical applications

Selicato L.
2025

Abstract

Low-rank (LR) factorization techniques aim to represent data in a low-dimensional space by identifying fundamental sources. Standard LR approaches often require additional constraints to account for real-world complexity, resulting in penalized low-rank matrix factorizations. These techniques incorporate penalties or regularization terms to improve robustness and adaptability to practical constraints, bridging theoretical research with real-world applications. This paper explores a nonnegative constrained low-rank decomposition technique, namely, Nonnegative Matrix Factorization (NMF), and its constrained variants as powerful tools for analyzing nonnegative data. We cover theoretical foundations and practical implementations, review algorithms for standard NMF, and address challenges in setting hyperparameters for penalized variants. We emphasize applications in omics data analysis with a model that incorporates biological constraints to extract meaningful insights, and highlight applications in environmental data analysis
2025
Istituto di Ricerca Sulle Acque - IRSA - Sede Secondaria Bari
Low-rank matrix factorization
Penalty hyperparameter
Physical data model
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/557410
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ente

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 1
  • ???jsp.display-item.citation.isi??? ND
social impact