In this work we address the problem of predicting proteinprotein interactions. Its solution can give greater insight in the study of complex diseases, like cancer, and provides valuable information in the study of active small molecules for new drugs, limiting the number of molecules to be tested in laboratory. We model the problem as a binary classification task, using a suitable coding of the amino acid sequences. We apply k-Nearest Neighbors classification algorithm to the classes of interacting and noninteracting proteins. Results show that it is possible to achieve high prediction accuracy in cross validation. A case study is analyzed to show it is possible to reconstruct a real network of thousands interacting proteins with high accuracy on standard hardware.

Predicting protein-protein interactions with k-Nearest Neighbors classification algorithm

Mario Rosario Guarracino;
2010

Abstract

In this work we address the problem of predicting proteinprotein interactions. Its solution can give greater insight in the study of complex diseases, like cancer, and provides valuable information in the study of active small molecules for new drugs, limiting the number of molecules to be tested in laboratory. We model the problem as a binary classification task, using a suitable coding of the amino acid sequences. We apply k-Nearest Neighbors classification algorithm to the classes of interacting and noninteracting proteins. Results show that it is possible to achieve high prediction accuracy in cross validation. A case study is analyzed to show it is possible to reconstruct a real network of thousands interacting proteins with high accuracy on standard hardware.
2010
Istituto di Calcolo e Reti ad Alte Prestazioni - ICAR
Inglese
Francesco Masulli; Leif E. Peterson; Roberto Tagliaferri
Computational Intelligence Methods for Bioinformatics and Biostatistics
6th International Meeting on Computational Intelligence Methods for Bioinformatics and Biostatistics (CIBB 2009)
139
150
978-3-642-14570-4
Springer
Springer-Verlag
New York
Berlin Heidelberg
STATI UNITI D'AMERICA
GERMANIA
2009
Genova
Protein-protein interaction prediction
conjoint-triad method
k-Nearest Neighbors
binary classification
1
none
Mario Rosario Guarracino ; Adriano Nebbia
273
info:eu-repo/semantics/conferenceObject
04 Contributo in convegno::04.01 Contributo in Atti di convegno
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/138222
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 7
social impact