Tenders or technical terms contain a large quantity of both technical, legal, managerial information mixed in a nested and complex net of relationships. Extracting technical and design information from a document whose aim is both legal and technical, and that is written using several specific jargons, is not a trivial task: the purpose of the research is to try to detect, extract, split and assign information from the text of a tender in an automatic way. It means being able to understand technical and legal terms and organize them in multiple ways: according to product structure, internal organisational structure, etc. The focus is in providing a handy tool that could speed up and facilitate human analysis and allow tackling also the process of transforming customer's requirements into design specifications. The approach chosen to overcome the various issues is to support state-of-the-art Computational Linguistic tools with a wide Knowledge Base. The latter has been constructed both manually and automatically and comprises not only keywords but also concepts, relationships and regular expressions. The implementation of the methodology has been carried out during a project for AnsaldoBreda S.p.A. (now Hitachi Rail Europe). A case study about the tender for a high-speed train has been included to show the functioning and output of the entire software system. (C) 2020 Elsevier B.V. All rights reserved.
Text mining tool for translating terms of contract into technical specifications: Development and application in the railway sector
Dell'Orletta F;
2021
Abstract
Tenders or technical terms contain a large quantity of both technical, legal, managerial information mixed in a nested and complex net of relationships. Extracting technical and design information from a document whose aim is both legal and technical, and that is written using several specific jargons, is not a trivial task: the purpose of the research is to try to detect, extract, split and assign information from the text of a tender in an automatic way. It means being able to understand technical and legal terms and organize them in multiple ways: according to product structure, internal organisational structure, etc. The focus is in providing a handy tool that could speed up and facilitate human analysis and allow tackling also the process of transforming customer's requirements into design specifications. The approach chosen to overcome the various issues is to support state-of-the-art Computational Linguistic tools with a wide Knowledge Base. The latter has been constructed both manually and automatically and comprises not only keywords but also concepts, relationships and regular expressions. The implementation of the methodology has been carried out during a project for AnsaldoBreda S.p.A. (now Hitachi Rail Europe). A case study about the tender for a high-speed train has been included to show the functioning and output of the entire software system. (C) 2020 Elsevier B.V. All rights reserved.| Campo DC | Valore | Lingua |
|---|---|---|
| dc.authority.ancejournal | COMPUTERS IN INDUSTRY | en |
| dc.authority.orgunit | Istituto di linguistica computazionale "Antonio Zampolli" - ILC | en |
| dc.authority.people | Fantoni G | en |
| dc.authority.people | Coli E | en |
| dc.authority.people | Chiarello F | en |
| dc.authority.people | Apreda R | en |
| dc.authority.people | Dell'Orletta F | en |
| dc.authority.people | Pratelli G | en |
| dc.collection.id.s | b3f88f24-048a-4e43-8ab1-6697b90e068e | * |
| dc.collection.name | 01.01 Articolo in rivista | * |
| dc.contributor.appartenenza | Istituto di linguistica computazionale "Antonio Zampolli" - ILC | * |
| dc.contributor.appartenenza.mi | 918 | * |
| dc.contributor.area | Non assegn | * |
| dc.date.accessioned | 2024/02/21 05:22:27 | - |
| dc.date.available | 2024/02/21 05:22:27 | - |
| dc.date.firstsubmission | 2024/12/20 13:10:59 | * |
| dc.date.issued | 2021 | - |
| dc.date.submission | 2025/01/24 15:16:33 | * |
| dc.description.abstracteng | Tenders or technical terms contain a large quantity of both technical, legal, managerial information mixed in a nested and complex net of relationships. Extracting technical and design information from a document whose aim is both legal and technical, and that is written using several specific jargons, is not a trivial task: the purpose of the research is to try to detect, extract, split and assign information from the text of a tender in an automatic way. It means being able to understand technical and legal terms and organize them in multiple ways: according to product structure, internal organisational structure, etc. The focus is in providing a handy tool that could speed up and facilitate human analysis and allow tackling also the process of transforming customer's requirements into design specifications. The approach chosen to overcome the various issues is to support state-of-the-art Computational Linguistic tools with a wide Knowledge Base. The latter has been constructed both manually and automatically and comprises not only keywords but also concepts, relationships and regular expressions. The implementation of the methodology has been carried out during a project for AnsaldoBreda S.p.A. (now Hitachi Rail Europe). A case study about the tender for a high-speed train has been included to show the functioning and output of the entire software system. (C) 2020 Elsevier B.V. All rights reserved. | - |
| dc.description.affiliations | Univesrità di Pisa; Università di Pisa; Erre Quadro Srl; ILC-CNR; Hitachi Rail SpA | - |
| dc.description.allpeople | Fantoni, G; Coli, E; Chiarello, F; Apreda, R; Dell'Orletta, F; Pratelli, G | - |
| dc.description.allpeopleoriginal | Fantoni G., Coli E., Chiarello F., Apreda R., Dell'Orletta F., Pratelli G. | en |
| dc.description.fulltext | open | en |
| dc.description.numberofauthors | 6 | - |
| dc.identifier.doi | 10.1016/j.compind.2020.103357 | en |
| dc.identifier.isi | WOS:000601332500007 | - |
| dc.identifier.scopus | 2-s2.0-85097351785 | - |
| dc.identifier.uri | https://hdl.handle.net/20.500.14243/401396 | - |
| dc.language.iso | eng | en |
| dc.miur.last.status.update | 2024-12-20T12:11:19Z | * |
| dc.relation.numberofpages | 17 | en |
| dc.relation.volume | 124 | en |
| dc.subject.keywordseng | Contract terms | - |
| dc.subject.keywordseng | Technical requirements | - |
| dc.subject.keywordseng | Natural language processing | - |
| dc.subject.singlekeyword | Contract terms | * |
| dc.subject.singlekeyword | Technical requirements | * |
| dc.subject.singlekeyword | Natural language processing | * |
| dc.title | Text mining tool for translating terms of contract into technical specifications: Development and application in the railway sector | en |
| dc.type.circulation | Internazionale | en |
| dc.type.driver | info:eu-repo/semantics/article | - |
| dc.type.full | 01 Contributo su Rivista::01.01 Articolo in rivista | it |
| dc.type.impactfactor | si | en |
| dc.type.miur | 262 | - |
| dc.type.referee | Esperti anonimi | en |
| dc.ugov.descaux1 | 450809 | - |
| iris.isi.extIssued | 2021 | - |
| iris.isi.extTitle | Text mining tool for translating terms of contract into technical specifications: Development and application in the railway sector | - |
| iris.mediafilter.data | 2025/04/08 04:47:39 | * |
| iris.orcid.lastModifiedDate | 2025/04/11 01:21:22 | * |
| iris.orcid.lastModifiedMillisecond | 1744327282229 | * |
| iris.scopus.extIssued | 2021 | - |
| iris.scopus.extTitle | Text mining tool for translating terms of contract into technical specifications: Development and application in the railway sector | - |
| iris.sitodocente.maxattempts | 1 | - |
| iris.unpaywall.bestoahost | repository | * |
| iris.unpaywall.bestoaversion | submittedVersion | * |
| iris.unpaywall.doi | 10.1016/j.compind.2020.103357 | * |
| iris.unpaywall.hosttype | repository | * |
| iris.unpaywall.isoa | true | * |
| iris.unpaywall.journalisindoaj | false | * |
| iris.unpaywall.landingpage | https://www.sciencedirect.com/science/article/abs/pii/S0166361520305911?via=ihub | * |
| iris.unpaywall.license | cc-by-nc-nd | * |
| iris.unpaywall.metadataCallLastModified | 28/04/2026 04:33:51 | - |
| iris.unpaywall.metadataCallLastModifiedMillisecond | 1777343631928 | - |
| iris.unpaywall.oastatus | green | * |
| isi.authority.ancejournal | COMPUTERS IN INDUSTRY###0166-3615 | * |
| isi.category | EV | * |
| isi.contributor.affiliation | University of Pisa | - |
| isi.contributor.affiliation | University of Pisa | - |
| isi.contributor.affiliation | University of Pisa | - |
| isi.contributor.affiliation | Erre Quadro Srl | - |
| isi.contributor.affiliation | Consiglio Nazionale delle Ricerche (CNR) | - |
| isi.contributor.affiliation | Hitachi Rail SpA | - |
| isi.contributor.country | Italy | - |
| isi.contributor.country | Italy | - |
| isi.contributor.country | Italy | - |
| isi.contributor.country | Italy | - |
| isi.contributor.country | Italy | - |
| isi.contributor.country | Italy | - |
| isi.contributor.name | G. | - |
| isi.contributor.name | E. | - |
| isi.contributor.name | F. | - |
| isi.contributor.name | R. | - |
| isi.contributor.name | F. | - |
| isi.contributor.name | G. | - |
| isi.contributor.researcherId | GWZ-8445-2022 | - |
| isi.contributor.researcherId | ITV-2617-2023 | - |
| isi.contributor.researcherId | DVB-0474-2022 | - |
| isi.contributor.researcherId | CDK-4389-2022 | - |
| isi.contributor.researcherId | AAX-1864-2020 | - |
| isi.contributor.researcherId | DNC-0893-2022 | - |
| isi.contributor.subaffiliation | Dept Civil & Ind Engn | - |
| isi.contributor.subaffiliation | Dept Informat Engn | - |
| isi.contributor.subaffiliation | Dept Civil & Ind Engn | - |
| isi.contributor.subaffiliation | - | |
| isi.contributor.subaffiliation | Inst Linguist Computaz Antonio Zampolli ILC | - |
| isi.contributor.subaffiliation | - | |
| isi.contributor.surname | Fantoni | - |
| isi.contributor.surname | Coli | - |
| isi.contributor.surname | Chiarello | - |
| isi.contributor.surname | Apreda | - |
| isi.contributor.surname | Dell'Orletta | - |
| isi.contributor.surname | Pratelli | - |
| isi.date.issued | 2021 | * |
| isi.description.abstracteng | Tenders or technical terms contain a large quantity of both technical, legal, managerial information mixed in a nested and complex net of relationships. Extracting technical and design information from a document whose aim is both legal and technical, and that is written using several specific jargons, is not a trivial task: the purpose of the research is to try to detect, extract, split and assign information from the text of a tender in an automatic way. It means being able to understand technical and legal terms and organize them in multiple ways: according to product structure, internal organisational structure, etc. The focus is in providing a handy tool that could speed up and facilitate human analysis and allow tackling also the process of transforming customer's requirements into design specifications. The approach chosen to overcome the various issues is to support state-of-the-art Computational Linguistic tools with a wide Knowledge Base. The latter has been constructed both manually and automatically and comprises not only keywords but also concepts, relationships and regular expressions. The implementation of the methodology has been carried out during a project for AnsaldoBreda S.p.A. (now Hitachi Rail Europe). A case study about the tender for a high-speed train has been included to show the functioning and output of the entire software system. (C) 2020 Elsevier B.V. All rights reserved. | * |
| isi.description.allpeopleoriginal | Fantoni, G; Coli, E; Chiarello, F; Apreda, R; Dell'Orletta, F; Pratelli, G; | * |
| isi.document.sourcetype | WOS.SCI | * |
| isi.document.type | Article | * |
| isi.document.types | Article | * |
| isi.identifier.doi | 10.1016/j.compind.2020.103357 | * |
| isi.identifier.eissn | 1872-6194 | * |
| isi.identifier.isi | WOS:000601332500007 | * |
| isi.journal.journaltitle | COMPUTERS IN INDUSTRY | * |
| isi.journal.journaltitleabbrev | COMPUT IND | * |
| isi.language.original | English | * |
| isi.publisher.place | RADARWEG 29, 1043 NX AMSTERDAM, NETHERLANDS | * |
| isi.relation.volume | 124 | * |
| isi.title | Text mining tool for translating terms of contract into technical specifications: Development and application in the railway sector | * |
| scopus.authority.ancejournal | COMPUTERS IN INDUSTRY###0166-3615 | * |
| scopus.category | 1700 | * |
| scopus.category | 2200 | * |
| scopus.contributor.affiliation | University of Pisa | - |
| scopus.contributor.affiliation | University of Pisa | - |
| scopus.contributor.affiliation | University of Pisa | - |
| scopus.contributor.affiliation | Erre Quadro s.r.l. | - |
| scopus.contributor.affiliation | Istituto di Linguistica Computazionale “Antonio Zampolli” ILC – CNR | - |
| scopus.contributor.affiliation | Formerly with AnsaldoBreda S.p.A. (now Hitachi Rail S.p.A.) | - |
| scopus.contributor.afid | 60028868 | - |
| scopus.contributor.afid | 60028868 | - |
| scopus.contributor.afid | 60028868 | - |
| scopus.contributor.afid | 113670748 | - |
| scopus.contributor.afid | 60008941 | - |
| scopus.contributor.afid | 125513429 | - |
| scopus.contributor.auid | 8515018000 | - |
| scopus.contributor.auid | 57219034457 | - |
| scopus.contributor.auid | 57215031142 | - |
| scopus.contributor.auid | 12752898300 | - |
| scopus.contributor.auid | 57540567000 | - |
| scopus.contributor.auid | 57220550119 | - |
| scopus.contributor.country | Italy | - |
| scopus.contributor.country | Italy | - |
| scopus.contributor.country | Italy | - |
| scopus.contributor.country | Italy | - |
| scopus.contributor.country | Italy | - |
| scopus.contributor.country | Italy | - |
| scopus.contributor.dptid | 113243569 | - |
| scopus.contributor.dptid | 113917116 | - |
| scopus.contributor.dptid | 113243569 | - |
| scopus.contributor.dptid | - | |
| scopus.contributor.dptid | - | |
| scopus.contributor.dptid | - | |
| scopus.contributor.name | G. | - |
| scopus.contributor.name | E. | - |
| scopus.contributor.name | F. | - |
| scopus.contributor.name | R. | - |
| scopus.contributor.name | F. | - |
| scopus.contributor.name | G. | - |
| scopus.contributor.subaffiliation | Department of Civil and Industrial Engineering; | - |
| scopus.contributor.subaffiliation | Department of Information Engineering; | - |
| scopus.contributor.subaffiliation | Department of Civil and Industrial Engineering; | - |
| scopus.contributor.subaffiliation | - | |
| scopus.contributor.subaffiliation | - | |
| scopus.contributor.subaffiliation | - | |
| scopus.contributor.surname | Fantoni | - |
| scopus.contributor.surname | Coli | - |
| scopus.contributor.surname | Chiarello | - |
| scopus.contributor.surname | Apreda | - |
| scopus.contributor.surname | Dell'Orletta | - |
| scopus.contributor.surname | Pratelli | - |
| scopus.date.issued | 2021 | * |
| scopus.description.abstracteng | Tenders or technical terms contain a large quantity of both technical, legal, managerial information mixed in a nested and complex net of relationships. Extracting technical and design information from a document whose aim is both legal and technical, and that is written using several specific jargons, is not a trivial task: the purpose of the research is to try to detect, extract, split and assign information from the text of a tender in an automatic way. It means being able to understand technical and legal terms and organize them in multiple ways: according to product structure, internal organisational structure, etc. The focus is in providing a handy tool that could speed up and facilitate human analysis and allow tackling also the process of transforming customer's requirements into design specifications. The approach chosen to overcome the various issues is to support state-of-the-art Computational Linguistic tools with a wide Knowledge Base. The latter has been constructed both manually and automatically and comprises not only keywords but also concepts, relationships and regular expressions. The implementation of the methodology has been carried out during a project for AnsaldoBreda S.p.A. (now Hitachi Rail Europe). A case study about the tender for a high-speed train has been included to show the functioning and output of the entire software system. | * |
| scopus.description.allpeopleoriginal | Fantoni G.; Coli E.; Chiarello F.; Apreda R.; Dell'Orletta F.; Pratelli G. | * |
| scopus.differences | scopus.subject.keywords | * |
| scopus.differences | scopus.description.allpeopleoriginal | * |
| scopus.differences | scopus.description.abstracteng | * |
| scopus.document.type | ar | * |
| scopus.document.types | ar | * |
| scopus.identifier.doi | 10.1016/j.compind.2020.103357 | * |
| scopus.identifier.pui | 2010104460 | * |
| scopus.identifier.scopus | 2-s2.0-85097351785 | * |
| scopus.journal.sourceid | 19080 | * |
| scopus.language.iso | eng | * |
| scopus.publisher.name | Elsevier B.V. | * |
| scopus.relation.article | 103357 | * |
| scopus.relation.volume | 124 | * |
| scopus.subject.keywords | Computational science; Contract terms; Natural language processing; Technical requirements; Tendering; Text mining; | * |
| scopus.title | Text mining tool for translating terms of contract into technical specifications: Development and application in the railway sector | * |
| scopus.titleeng | Text mining tool for translating terms of contract into technical specifications: Development and application in the railway sector | * |
| Appare nelle tipologie: | 01.01 Articolo in rivista | |
| File | Dimensione | Formato | |
|---|---|---|---|
|
1-s2.0-S0166361520305911-main.pdf
accesso aperto
Tipologia:
Versione Editoriale (PDF)
Licenza:
Creative commons
Dimensione
4.65 MB
Formato
Adobe PDF
|
4.65 MB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


