This paper presents both the general model and a case study of the Computational and Collaborative Philology Library (CoPhiLib), an ongoing initiative underway at the Institute for Computational Linguistics (ILC) of the National Research Council (CNR), Pisa, Italy. The library, designed and organized as a reusable, abstract and open-source software component, aims at solving the needs of multi-lingual and cross-lingual analysis by exposing common Application Programming Interfaces (APIs). The core modules, coded by the Java programming language, constitute the groundwork of a Web platform designed to deal with textual scholarly needs. The Web application, implemented according to the Java Enterprise specifications, focuses on multi-layered analysis for the study of literary documents and related multimedia sources. This ambitious challenge seeks to obtain the management of textual resources, on the one hand by abstracting from current language, on the other hand by decoupling from the specific requirements of single projects. This goal is achieved thanks to methodologies declared by the "agile process", and by putting into effect suitable use case modeling, design patterns, and component-based architectures. The reusability and flexibility of the system have been tested on an Arabic case study: the system allows users to choose the morphological engine (such as AraMorph or Al-Khalil), along with linguistic granularity (i.e. with or without declension). Finally, the application enables the construction of annotated resources for further statistical engines (training set).

Towards a flexible open-source software library for multi-layered scholarly textual studies: An Arabic case study dealing with semi-automatic language processing

Del Grosso A M
;
Nahli O
2014

Abstract

This paper presents both the general model and a case study of the Computational and Collaborative Philology Library (CoPhiLib), an ongoing initiative underway at the Institute for Computational Linguistics (ILC) of the National Research Council (CNR), Pisa, Italy. The library, designed and organized as a reusable, abstract and open-source software component, aims at solving the needs of multi-lingual and cross-lingual analysis by exposing common Application Programming Interfaces (APIs). The core modules, coded by the Java programming language, constitute the groundwork of a Web platform designed to deal with textual scholarly needs. The Web application, implemented according to the Java Enterprise specifications, focuses on multi-layered analysis for the study of literary documents and related multimedia sources. This ambitious challenge seeks to obtain the management of textual resources, on the one hand by abstracting from current language, on the other hand by decoupling from the specific requirements of single projects. This goal is achieved thanks to methodologies declared by the "agile process", and by putting into effect suitable use case modeling, design patterns, and component-based architectures. The reusability and flexibility of the system have been tested on an Arabic case study: the system allows users to choose the morphological engine (such as AraMorph or Al-Khalil), along with linguistic granularity (i.e. with or without declension). Finally, the application enables the construction of annotated resources for further statistical engines (training set).
Campo DC Valore Lingua
dc.authority.orgunit Istituto di linguistica computazionale "Antonio Zampolli" - ILC en
dc.authority.people Del Grosso A M en
dc.authority.people Nahli O en
dc.collection.id.s 71c7200a-7c5f-4e83-8d57-d3d2ba88f40d *
dc.collection.name 04.01 Contributo in Atti di convegno *
dc.contributor.appartenenza Istituto di linguistica computazionale "Antonio Zampolli" - ILC *
dc.contributor.appartenenza.mi 918 *
dc.date.accessioned 2024/02/20 12:34:32 -
dc.date.available 2024/02/20 12:34:32 -
dc.date.firstsubmission 2024/07/02 15:13:06 *
dc.date.issued 2014 -
dc.date.submission 2024/07/02 15:13:06 *
dc.description.abstracteng This paper presents both the general model and a case study of the Computational and Collaborative Philology Library (CoPhiLib), an ongoing initiative underway at the Institute for Computational Linguistics (ILC) of the National Research Council (CNR), Pisa, Italy. The library, designed and organized as a reusable, abstract and open-source software component, aims at solving the needs of multi-lingual and cross-lingual analysis by exposing common Application Programming Interfaces (APIs). The core modules, coded by the Java programming language, constitute the groundwork of a Web platform designed to deal with textual scholarly needs. The Web application, implemented according to the Java Enterprise specifications, focuses on multi-layered analysis for the study of literary documents and related multimedia sources. This ambitious challenge seeks to obtain the management of textual resources, on the one hand by abstracting from current language, on the other hand by decoupling from the specific requirements of single projects. This goal is achieved thanks to methodologies declared by the "agile process", and by putting into effect suitable use case modeling, design patterns, and component-based architectures. The reusability and flexibility of the system have been tested on an Arabic case study: the system allows users to choose the morphological engine (such as AraMorph or Al-Khalil), along with linguistic granularity (i.e. with or without declension). Finally, the application enables the construction of annotated resources for further statistical engines (training set). -
dc.description.affiliations Institute for Computational Linguistics - National Research Council (ILC-CNR, Pisa) -
dc.description.allpeople DEL GROSSO, ANGELO MARIO; Nahli, O -
dc.description.allpeopleoriginal Del Grosso A. M.; Nahli O. en
dc.description.fulltext none en
dc.description.numberofauthors 2 -
dc.identifier.doi 10.1109/CIST.2014.7016633 en
dc.identifier.isbn 978-1-4799-5978-5 en
dc.identifier.scopus 2-s2.0-84938086244 -
dc.identifier.uri https://hdl.handle.net/20.500.14243/221676 -
dc.identifier.url http://ieeexplore.ieee.org/xpl/articleDetails.jsp?tp=&arnumber=7016633&queryText=del+grosso+philology en
dc.language.iso eng en
dc.miur.last.status.update 2024-07-02T13:19:26Z *
dc.publisher.country USA en
dc.publisher.name IEEE en
dc.publisher.place New York en
dc.relation.alleditors M. El Mohajir, M. Al Achhab, M. Chahhou, A. Mounir, B. El Mohajir, V. Pirrelli, A. Zarghili, M. ElFar en
dc.relation.conferencedate 20-22/10/2014 en
dc.relation.conferencename Third IEEE International Colloquium in Information Science and Technology (CIST) en
dc.relation.conferenceplace Tetuan, Morocco en
dc.relation.firstpage 285 en
dc.relation.ispartofbook Third IEEE International Colloquium in Information Science and Technology (CIST) en
dc.relation.lastpage 290 en
dc.relation.medium ELETTRONICO en
dc.relation.numberofpages 6 en
dc.subject.keywords Design -
dc.subject.keywords Information Engineering -
dc.subject.keywords Design Patterns -
dc.subject.keywords Text Processing -
dc.subject.keywords Arabic Natural Language Processing -
dc.subject.singlekeyword Design *
dc.subject.singlekeyword Information Engineering *
dc.subject.singlekeyword Design Patterns *
dc.subject.singlekeyword Text Processing *
dc.subject.singlekeyword Arabic Natural Language Processing *
dc.title Towards a flexible open-source software library for multi-layered scholarly textual studies: An Arabic case study dealing with semi-automatic language processing en
dc.type.circulation Internazionale en
dc.type.driver info:eu-repo/semantics/conferenceObject -
dc.type.full 04 Contributo in convegno::04.01 Contributo in Atti di convegno it
dc.type.miur 273 -
dc.type.referee Comitato scientifico en
dc.ugov.descaux1 295187 -
iris.isi.extIssued 2014 -
iris.isi.extTitle Towards a flexible open-source software library for multi-layered scholarly textual studies An Arabic case study dealing with semi-automatic language processing -
iris.orcid.lastModifiedDate 2024/11/17 19:33:52 *
iris.orcid.lastModifiedMillisecond 1731868432838 *
iris.scopus.extIssued 2014 -
iris.scopus.extTitle Towards a flexible open-source software library for multi-layered scholarly textual studies: An Arabic case study dealing with semi-automatic language processing -
iris.scopus.ideLinkStatusDate 2024/11/17 19:33:52 *
iris.scopus.ideLinkStatusMillisecond 1731868432870 *
iris.sitodocente.maxattempts 1 -
iris.unpaywall.bestoahost repository *
iris.unpaywall.bestoaversion submittedVersion *
iris.unpaywall.doi 10.1109/cist.2014.7016633 *
iris.unpaywall.hosttype repository *
iris.unpaywall.isoa true *
iris.unpaywall.landingpage http://hdl.handle.net/11573/855233 *
iris.unpaywall.license other-oa *
iris.unpaywall.metadataCallLastModified 12/06/2025 06:29:55 -
iris.unpaywall.metadataCallLastModifiedMillisecond 1749702595203 -
iris.unpaywall.oastatus green *
iris.unpaywall.pdfurl http://hdl.handle.net/11573/855233 *
scopus.category 1711 *
scopus.category 1706 *
scopus.category 1803 *
scopus.category 1802 *
scopus.contributor.affiliation Consiglio Nazionale Delle Ricerche (CNR) -
scopus.contributor.affiliation Consiglio Nazionale Delle Ricerche (CNR) -
scopus.contributor.afid 60008941 -
scopus.contributor.afid 60008941 -
scopus.contributor.auid 56319538600 -
scopus.contributor.auid 56741333300 -
scopus.contributor.country Italy -
scopus.contributor.country Italy -
scopus.contributor.dptid -
scopus.contributor.dptid -
scopus.contributor.name Angelo Mario -
scopus.contributor.name Ouafae -
scopus.contributor.subaffiliation Istituto di Linguistica Computazionale A. Zampolli (ILC); -
scopus.contributor.subaffiliation Istituto di Linguistica Computazionale A. Zampolli (ILC); -
scopus.contributor.surname Del Grosso -
scopus.contributor.surname Nahli -
scopus.date.issued 2015 *
scopus.description.abstracteng This paper presents both the general model and a case study of the Computational and Collaborative Philology Library (CoPhiLib), an ongoing initiative underway at the Institute for Computational Linguistics (ILC) of the National Research Council (CNR), Pisa, Italy. The library, designed and organized as a reusable, abstract and open-source software component, aims at solving the needs of multi-lingual and cross-lingual analysis by exposing common Application Programming Interfaces (APIs). The core modules, coded by the Java programming language, constitute the groundwork of a Web platform designed to deal with textual scholarly needs. The Web application, implemented according to the Java Enterprise specifications, focuses on multi-layered analysis for the study of literary documents and related multimedia sources. This ambitious challenge seeks to obtain the management of textual resources, on the one hand by abstracting from current language, on the other hand by decoupling from the specific requirements of single projects. This goal is achieved thanks to methodologies declared by the 'agile process', and by putting into effect suitable use case modeling, design patterns, and component-based architectures. The reusability and flexibility of the system have been tested on an Arabic case study: the system allows users to choose the morphological engine (such as AraMorph or Al-Khalil), along with linguistic granularity (i.e. with or without declension). Finally, the application enables the construction of annotated resources for further statistical engines (training set). *
scopus.description.allpeopleoriginal Del Grosso A.M.; Nahli O. *
scopus.differences scopus.publisher.name *
scopus.differences scopus.subject.keywords *
scopus.differences scopus.relation.conferencedate *
scopus.differences scopus.description.allpeopleoriginal *
scopus.differences scopus.description.abstracteng *
scopus.differences scopus.relation.issue *
scopus.differences scopus.relation.conferencename *
scopus.differences scopus.identifier.isbn *
scopus.differences scopus.date.issued *
scopus.differences scopus.relation.conferenceplace *
scopus.differences scopus.relation.volume *
scopus.document.type cp *
scopus.document.types cp *
scopus.identifier.doi 10.1109/CIST.2014.7016633 *
scopus.identifier.eissn 2327-1884 *
scopus.identifier.isbn 9781479959792 *
scopus.identifier.pui 605411574 *
scopus.identifier.scopus 2-s2.0-84938086244 *
scopus.journal.sourceid 21100400809 *
scopus.language.iso eng *
scopus.publisher.name Institute of Electrical and Electronics Engineers Inc. *
scopus.relation.article 7016633 *
scopus.relation.conferencedate 2014 *
scopus.relation.conferencename 3rd IEEE International Colloquium in Information Science and Technology, CIST 2014 *
scopus.relation.conferenceplace mar *
scopus.relation.firstpage 285 *
scopus.relation.issue January *
scopus.relation.lastpage 290 *
scopus.relation.volume 2015- *
scopus.subject.keywords API Design; Arabic Natural Language Processing; Design Patterns; Information Engineering; Text Processing; *
scopus.title Towards a flexible open-source software library for multi-layered scholarly textual studies: An Arabic case study dealing with semi-automatic language processing *
scopus.titleeng Towards a flexible open-source software library for multi-layered scholarly textual studies: An Arabic case study dealing with semi-automatic language processing *
Appare nelle tipologie: 04.01 Contributo in Atti di convegno
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/221676
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 3
  • ???jsp.display-item.citation.isi??? ND
social impact