This work provides a comprehensive and systematic description of the installation, configuration, and deployment of the CLARIN-DSpace 7 platform, carried out in the context of upgrading the ILC4CLARIN repository to its latest version. CLARIN-DSpace 7 is a customized distribution of DSpace tailored to the needs of the European CLARIN infrastructure for managing language resources and technologies. The document presents a detailed technical guide covering the entire process, from system environment preparation to the configuration of core components, including the Spring Boot backend, Angular frontend, PostgreSQL database, Solr indexing engine, Tomcat application server, and Shibboleth-based federated authentication services. It also addresses key aspects such as SSL certificate management, Apache web server configuration, integration with AAI systems, and user interface customization. Particular attention is given to migration procedures from previous DSpace versions (5.x), highlighting challenges and solutions to ensure data integrity and service continuity. The work serves as a practical and replicable reference for the implementation of interoperable institutional repositories compliant with CLARIN standards and FAIR principles, supporting the management and dissemination of linguistic data and NLP tools in the scientific domain.
Installing and configuring CLARIN-DSPACE 7
Riccardo Del GrattaSecondo
Supervision
;Michele MalliaPrimo
Writing – Original Draft Preparation
2026
Abstract
This work provides a comprehensive and systematic description of the installation, configuration, and deployment of the CLARIN-DSpace 7 platform, carried out in the context of upgrading the ILC4CLARIN repository to its latest version. CLARIN-DSpace 7 is a customized distribution of DSpace tailored to the needs of the European CLARIN infrastructure for managing language resources and technologies. The document presents a detailed technical guide covering the entire process, from system environment preparation to the configuration of core components, including the Spring Boot backend, Angular frontend, PostgreSQL database, Solr indexing engine, Tomcat application server, and Shibboleth-based federated authentication services. It also addresses key aspects such as SSL certificate management, Apache web server configuration, integration with AAI systems, and user interface customization. Particular attention is given to migration procedures from previous DSpace versions (5.x), highlighting challenges and solutions to ensure data integrity and service continuity. The work serves as a practical and replicable reference for the implementation of interoperable institutional repositories compliant with CLARIN standards and FAIR principles, supporting the management and dissemination of linguistic data and NLP tools in the scientific domain.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


