CNR Institutional Research Information System

In this work we address the problem of predicting proteinprotein interactions. Its solution can give greater insight in the study of complex diseases, like cancer, and provides valuable information in the study of active small molecules for new drugs, limiting the number of molecules to be tested in laboratory. We model the problem as a binary classification task, using a suitable coding of the amino acid sequences. We apply k-Nearest Neighbors classification algorithm to the classes of interacting and noninteracting proteins. Results show that it is possible to achieve high prediction accuracy in cross validation. A case study is analyzed to show it is possible to reconstruct a real network of thousands interacting proteins with high accuracy on standard hardware.

Predicting protein-protein interactions with k-Nearest Neighbors classification algorithm

Mario Rosario Guarracino;Adriano Nebbia

2010

Abstract

In this work we address the problem of predicting proteinprotein interactions. Its solution can give greater insight in the study of complex diseases, like cancer, and provides valuable information in the study of active small molecules for new drugs, limiting the number of molecules to be tested in laboratory. We model the problem as a binary classification task, using a suitable coding of the amino acid sequences. We apply k-Nearest Neighbors classification algorithm to the classes of interacting and noninteracting proteins. Results show that it is possible to achieve high prediction accuracy in cross validation. A case study is analyzed to show it is possible to reconstruct a real network of thousands interacting proteins with high accuracy on standard hardware.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2010
			
	Strutture organizzative
	
				Istituto di Calcolo e Reti ad Alte Prestazioni - ICAR
			
	Lingua/e
	
				Inglese
			
	Supervisori e coordinatori esterni
	
				Francesco Masulli; Leif E. Peterson; Roberto Tagliaferri
			
	Titolo del Volume
	
				Computational Intelligence Methods for Bioinformatics and Biostatistics
			
	Titolo del convegno
	
				6th International Meeting on Computational Intelligence Methods for Bioinformatics and Biostatistics (CIBB 2009)
			
	Da pagina
	
				139
			
	A pagina
	
				150
			
	Codice ISBN
	
				978-3-642-14570-4
			
	Codice DOI
	
				https://dx.doi.org/10.1007/978-3-642-14571-1_10
			
	Nome Editore
	
				Springer
Springer-Verlag
			
	Città Editore
	
				New York
Berlin Heidelberg
			
	Nazione Editore
	
				STATI UNITI D'AMERICA
GERMANIA
			
	Periodo del Convegno
	
				2009
			
	Luogo del Convegno
	
				Genova
			
	Parole chiave
	
				Protein-protein interaction prediction
conjoint-triad method
k-Nearest Neighbors
binary classification
			
	Codice Scopus
	
				2-s2.0-77955795053
			
	Codice Web of Science
	
				WOS:000285734400010
			
	Numero autori
	
				1
			
	Fulltext
	
				none
			
	Tutti gli autori
	
						Mario Rosario Guarracino ; Adriano Nebbia
					
	Tipologia Login Miur
	
				273
			
	Tipologia
	
				info:eu-repo/semantics/conferenceObject
			
	Tipologia
	
				04 Contributo in convegno::04.01 Contributo in Atti di convegno
			
	Appare nelle tipologie:
	
				04.01 Contributo in Atti di convegno

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/138222

Citazioni

ND

0

7

social impact