This paper presents both the general model and a case study of the Computational and Collaborative Philology Library (CoPhiLib), an ongoing initiative underway at the Institute for Computational Linguistics (ILC) of the National Research Council (CNR), Pisa, Italy. The library, designed and organized as a reusable, abstract and open-source software component, aims at solving the needs of multi-lingual and cross- lingual analysis by exposing common Application Programming Interfaces (APIs). The core modules, coded by the Java programming language, constitute the groundwork of a Web platform designed to deal with textual scholarly needs. The Web application, implemented according to the Java Enterprise specifications, focuses on multi-layered analysis for the study of literary documents and related multimedia sources. This ambitious challenge seeks to obtain the management of textual resources, on the one hand by abstracting from current language, on the other hand by decoupling from the specific requirements of single projects. This goal is achieved thanks to methodologies declared by the "agile process", and by putting into effect suitable use case modeling, design patterns, and component- based architectures. The reusability and flexibility of the system have been tested on an Arabic case study: the system allows users to choose the morphological engine (such as AraMorph or Al- Khalil), along with linguistic granularity (i.e. with or without declension). Finally, the application enables the construction of annotated resources for further statistical engines (training set).

Towards a flexible open-source software library for multi-layered scholarly textual studies - An Arabic use-case dealing with semi-automatic language processing

Angelo Mario Del Grosso;Ouafae Nahli
2014

Abstract

This paper presents both the general model and a case study of the Computational and Collaborative Philology Library (CoPhiLib), an ongoing initiative underway at the Institute for Computational Linguistics (ILC) of the National Research Council (CNR), Pisa, Italy. The library, designed and organized as a reusable, abstract and open-source software component, aims at solving the needs of multi-lingual and cross- lingual analysis by exposing common Application Programming Interfaces (APIs). The core modules, coded by the Java programming language, constitute the groundwork of a Web platform designed to deal with textual scholarly needs. The Web application, implemented according to the Java Enterprise specifications, focuses on multi-layered analysis for the study of literary documents and related multimedia sources. This ambitious challenge seeks to obtain the management of textual resources, on the one hand by abstracting from current language, on the other hand by decoupling from the specific requirements of single projects. This goal is achieved thanks to methodologies declared by the "agile process", and by putting into effect suitable use case modeling, design patterns, and component- based architectures. The reusability and flexibility of the system have been tested on an Arabic case study: the system allows users to choose the morphological engine (such as AraMorph or Al- Khalil), along with linguistic granularity (i.e. with or without declension). Finally, the application enables the construction of annotated resources for further statistical engines (training set).
Campo DC Valore Lingua
dc.authority.orgunit Istituto di linguistica computazionale "Antonio Zampolli" - ILC -
dc.authority.people Angelo Mario Del Grosso it
dc.authority.people Ouafae Nahli it
dc.collection.id.s 010b2614-196f-4b19-86fc-88182f427232 *
dc.collection.name 04.03 Poster in Atti di convegno *
dc.contributor.appartenenza Istituto di linguistica computazionale "Antonio Zampolli" - ILC *
dc.contributor.appartenenza.mi 918 *
dc.date.accessioned 2024/02/20 19:22:51 -
dc.date.available 2024/02/20 19:22:51 -
dc.date.issued 2014 -
dc.description.abstracteng This paper presents both the general model and a case study of the Computational and Collaborative Philology Library (CoPhiLib), an ongoing initiative underway at the Institute for Computational Linguistics (ILC) of the National Research Council (CNR), Pisa, Italy. The library, designed and organized as a reusable, abstract and open-source software component, aims at solving the needs of multi-lingual and cross- lingual analysis by exposing common Application Programming Interfaces (APIs). The core modules, coded by the Java programming language, constitute the groundwork of a Web platform designed to deal with textual scholarly needs. The Web application, implemented according to the Java Enterprise specifications, focuses on multi-layered analysis for the study of literary documents and related multimedia sources. This ambitious challenge seeks to obtain the management of textual resources, on the one hand by abstracting from current language, on the other hand by decoupling from the specific requirements of single projects. This goal is achieved thanks to methodologies declared by the "agile process", and by putting into effect suitable use case modeling, design patterns, and component- based architectures. The reusability and flexibility of the system have been tested on an Arabic case study: the system allows users to choose the morphological engine (such as AraMorph or Al- Khalil), along with linguistic granularity (i.e. with or without declension). Finally, the application enables the construction of annotated resources for further statistical engines (training set). -
dc.description.affiliations Istituto di Linguistica Computazionale del Consiglio Nazionale delle Ricerche, Istituto di Linguistica Computazionale del Consiglio Nazionale delle Ricerche -
dc.description.allpeople DEL GROSSO, ANGELO MARIO; Nahli, Ouafae -
dc.description.allpeopleoriginal Angelo Mario Del Grosso, Ouafae Nahli -
dc.description.fulltext none en
dc.description.numberofauthors 2 -
dc.identifier.uri https://hdl.handle.net/20.500.14243/228544 -
dc.language.iso eng -
dc.relation.conferencedate 20-22/10/ 2014 -
dc.relation.conferencename ANLP IEEE CIST14 -
dc.relation.conferenceplace Tetuan, Morocco -
dc.subject.keywords Computational and collaborative philology -
dc.subject.keywords API -
dc.subject.singlekeyword Computational and collaborative philology *
dc.subject.singlekeyword API *
dc.title Towards a flexible open-source software library for multi-layered scholarly textual studies - An Arabic use-case dealing with semi-automatic language processing en
dc.type.driver info:eu-repo/semantics/conferenceObject -
dc.type.full 04 Contributo in convegno::04.03 Poster in Atti di convegno it
dc.type.miur 275 -
dc.type.referee Sì, ma tipo non specificato -
dc.ugov.descaux1 288053 -
iris.orcid.lastModifiedDate 2024/04/04 10:04:56 *
iris.orcid.lastModifiedMillisecond 1712217896241 *
iris.sitodocente.maxattempts 1 -
Appare nelle tipologie: 04.03 Poster in Atti di convegno
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/228544
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact