This paper introduces the POESIA internet filtering system, which is open-source, and which combines standard filtering methods, such as positive/negative URL lists, with more advanced techniques, such as image processing and NLP-enhanced text filtering. The description here focusses on components providing textual content filtering for three European languages (English, Italian and Spanish), employing NLP methods to enhance performance. We address also the acquisition of language data needed to develop these filters, and the evaluation of the system and its components.

NLP-enhanced Content filtering within the POESIA Project

Marchi S;Montemagni S;
2004

Abstract

This paper introduces the POESIA internet filtering system, which is open-source, and which combines standard filtering methods, such as positive/negative URL lists, with more advanced techniques, such as image processing and NLP-enhanced text filtering. The description here focusses on components providing textual content filtering for three European languages (English, Italian and Spanish), employing NLP methods to enhance performance. We address also the acquisition of language data needed to develop these filters, and the evaluation of the system and its components.
Campo DC Valore Lingua
dc.authority.orgunit Istituto di linguistica computazionale "Antonio Zampolli" - ILC -
dc.authority.people Hepple M it
dc.authority.people N Ireson it
dc.authority.people Allegrini P it
dc.authority.people Marchi S it
dc.authority.people Montemagni S it
dc.authority.people Gómez Hidalgo JM it
dc.collection.id.s 71c7200a-7c5f-4e83-8d57-d3d2ba88f40d *
dc.collection.name 04.01 Contributo in Atti di convegno *
dc.contributor.appartenenza Istituto di linguistica computazionale "Antonio Zampolli" - ILC *
dc.contributor.appartenenza.mi 918 *
dc.date.accessioned 2024/02/19 17:50:48 -
dc.date.available 2024/02/19 17:50:48 -
dc.date.issued 2004 -
dc.description.abstracteng This paper introduces the POESIA internet filtering system, which is open-source, and which combines standard filtering methods, such as positive/negative URL lists, with more advanced techniques, such as image processing and NLP-enhanced text filtering. The description here focusses on components providing textual content filtering for three European languages (English, Italian and Spanish), employing NLP methods to enhance performance. We address also the acquisition of language data needed to develop these filters, and the evaluation of the system and its components. -
dc.description.affiliations Hepple M.N.Ireson: University of Sheffield Allegrini P.: ILC-CNR Gómez Hidalgo J.M.: Universidad Europea de Madrid -
dc.description.allpeople Hepple, M; N, Ireson; Allegrini, P; Marchi, S; Montemagni, S; Gómez Hidalgo, Jm -
dc.description.allpeopleoriginal Hepple M.; N. Ireson; Allegrini P.; Marchi S.; Montemagni S.; Gómez Hidalgo J.M. -
dc.description.fulltext none en
dc.description.numberofauthors 6 -
dc.identifier.isbn 2-9517408-1-6 -
dc.identifier.uri https://hdl.handle.net/20.500.14243/64230 -
dc.identifier.url https://www.aclweb.org/anthology/L04-1507/ -
dc.language.iso eng -
dc.relation.alleditors Maria Teresa Lino, Maria Francisca Xavier, Fátima Ferreira, Rute Costa, Raquel Silva, -
dc.relation.conferencedate 26-28 May 2004 -
dc.relation.conferencename LREC 2004: Fourth International Conference on Language Resources and Evaluation -
dc.relation.conferenceplace Lisbona -
dc.relation.firstpage 1967 -
dc.relation.ispartofbook Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC 2004) -
dc.relation.lastpage 1970 -
dc.relation.numberofpages 4 -
dc.subject.keywords Image processing -
dc.subject.keywords Natural language processing systems -
dc.subject.keywords Open systems -
dc.subject.singlekeyword Image processing *
dc.subject.singlekeyword Natural language processing systems *
dc.subject.singlekeyword Open systems *
dc.title NLP-enhanced Content filtering within the POESIA Project en
dc.type.driver info:eu-repo/semantics/conferenceObject -
dc.type.full 04 Contributo in convegno::04.01 Contributo in Atti di convegno it
dc.type.miur 273 -
dc.type.referee Sì, ma tipo non specificato -
dc.ugov.descaux1 84609 -
iris.orcid.lastModifiedDate 2024/04/04 23:31:47 *
iris.orcid.lastModifiedMillisecond 1712266307113 *
iris.scopus.extIssued 2004 -
iris.scopus.extTitle NLP-enhanced content filtering within the POESIA project -
iris.sitodocente.maxattempts 1 -
Appare nelle tipologie: 04.01 Contributo in Atti di convegno
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/64230
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact