<?xml version="1.0" encoding="UTF-8"?><?xml-stylesheet type="text/xsl" href="static/CINECAstyle.xsl"?><OAI-PMH xmlns="http://www.openarchives.org/OAI/2.0/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/ http://www.openarchives.org/OAI/2.0/OAI-PMH.xsd"><responseDate>2026-06-17T16:08:30Z</responseDate><request verb="GetRecord" identifier="oai:iris.cnr.it:20.500.14243/570501" metadataPrefix="oai_dc">https://iris.cnr.it/oai/request</request><GetRecord><record><header><identifier>oai:iris.cnr.it:20.500.14243/570501</identifier><datestamp>2026-03-04T01:27:52Z</datestamp><setSpec>com_20.500.14243_22</setSpec><setSpec>com_20.500.14243_21</setSpec><setSpec>col_20.500.14243_23</setSpec><setSpec>ou_ou239</setSpec></header><metadata><oai_dc:dc xmlns:oai_dc="http://www.openarchives.org/OAI/2.0/oai_dc/" xmlns:doc="http://www.lyncode.com/xoai" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:dc="http://purl.org/dc/elements/1.1/" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/oai_dc/ http://www.openarchives.org/OAI/2.0/oai_dc.xsd">
<dc:title>Efficient multi-task learning with instance selection for biomedical NLP</dc:title>
<dc:creator>Bonfigli A.</dc:creator>
<dc:creator>Bacco L.</dc:creator>
<dc:creator>Pecchia L.</dc:creator>
<dc:creator>Merone M.</dc:creator>
<dc:creator>Dell'Orletta F.</dc:creator>
<dc:contributor>Bonfigli, A.</dc:contributor>
<dc:contributor> Bacco, L.</dc:contributor>
<dc:contributor> Pecchia, L.</dc:contributor>
<dc:contributor> Merone, M.</dc:contributor>
<dc:contributor> Dell'Orletta, F.</dc:contributor>
<dc:subject>Biomedical NLP</dc:subject>
<dc:subject>BLUE benchmark</dc:subject>
<dc:subject>Computational efficiency</dc:subject>
<dc:subject>Instance selection</dc:subject>
<dc:subject>Multi-task learning</dc:subject>
<dc:description>Background: Biomedical natural language processing (NLP) increasingly relies on large language models and extensive datasets, presenting significant computational challenges. Methods: We propose Blue5, a multi-task model based on SciFive that incorporates instance selection (IS) to enable efficient, multi-task learning (MTL) on biomedical data. We adapt the E2SC-IS framework for the biomedical domain, integrating a calibrated SVM classifier to reduce computational costs. Results: Our approach achieves an average data reduction of 26.6% across the several tasks of the BLUE (Biomedical Language Understanding Evaluation) Benchmark, while maintaining performance comparable with state-of-the-art models. The multi-task SVM configuration emerges as the most effective, demonstrating the power of combining IS with MTL for biomedical NLP. As a result of the unified framework, Blue5 effectively selects the most informative instances across tasks, ensuring model generalization while efficiently handling multiple NLP tasks. Conclusion: Our work offers a practical solution to address growing computational demands, enabling more scalable and accessible applications of advanced NLP techniques in biomedical research and healthcare.</dc:description>
<dc:date>2025</dc:date>
<dc:type>info:eu-repo/semantics/article</dc:type>
<dc:identifier>https://hdl.handle.net/20.500.14243/570501</dc:identifier>
<dc:identifier>10.1016/j.compbiomed.2025.110050</dc:identifier>
<dc:identifier>info:eu-repo/semantics/altIdentifier/scopus/2-s2.0-105001252768</dc:identifier>
<dc:language>eng</dc:language>
<dc:relation>volume:190</dc:relation>
<dc:relation>journal:COMPUTERS IN BIOLOGY AND MEDICINE</dc:relation>
<dc:rights>info:eu-repo/semantics/openAccess</dc:rights>
<dc:rights>license:Creative commons</dc:rights>
<dc:rights>license uri:http://creativecommons.org/licenses/by-nc-nd/4.0/</dc:rights>
</oai_dc:dc></metadata></record></GetRecord></OAI-PMH>