A recent trend in corpus linguistics is the construction of bilingual (or multilingual) reference corpora. So far most studies on bilingual text archives have been on parallel corpora, i. e.sets of translationally equivalent texts. Such corpora provide valuable indepth evidence of how texts written in one language can be rendered in another according to contextual factors, such as style, register, domain, etc., and are thus important sources of data when studying the translation processo However, a criticism made of the results of analyses based on these corpora is that a translation is alwaysinfluenced by the source text and, therefore, the target language text(s) can never be considered a true representation of that language in usage. It has been suggested that a more reliable source for certain studies, e.g. how a particular concept is expressed independently in different languages, is a comparable corpus. Comparable archives are sets of texts from pairs or multiples of languages that concem a given domain and can be contrasted because of their common features. However, the development of procedures to query data from this kind of corpus is far more problematic than when interrogating parallel archives as it is difficult to identify automatically equivalent expressions in the different languages. We have studied a solution to this problem. We will present procedures that have been developed at the Istituto di Linguistica Computazionale, Pisa, to construct and query both parallel and comparable bilingual text corpora and to extract significant data for translation purposes and contrastivetextual studies, giving examples of the different types of results that can be obtained. The procedures form part of an integrated mono- and bilingual text and lexical database system, implemented in a prototypal version at the Institute.
Bilingual reference corpora for translators and translation studies
Peters C;Picchi E
1996
Abstract
A recent trend in corpus linguistics is the construction of bilingual (or multilingual) reference corpora. So far most studies on bilingual text archives have been on parallel corpora, i. e.sets of translationally equivalent texts. Such corpora provide valuable indepth evidence of how texts written in one language can be rendered in another according to contextual factors, such as style, register, domain, etc., and are thus important sources of data when studying the translation processo However, a criticism made of the results of analyses based on these corpora is that a translation is alwaysinfluenced by the source text and, therefore, the target language text(s) can never be considered a true representation of that language in usage. It has been suggested that a more reliable source for certain studies, e.g. how a particular concept is expressed independently in different languages, is a comparable corpus. Comparable archives are sets of texts from pairs or multiples of languages that concem a given domain and can be contrasted because of their common features. However, the development of procedures to query data from this kind of corpus is far more problematic than when interrogating parallel archives as it is difficult to identify automatically equivalent expressions in the different languages. We have studied a solution to this problem. We will present procedures that have been developed at the Istituto di Linguistica Computazionale, Pisa, to construct and query both parallel and comparable bilingual text corpora and to extract significant data for translation purposes and contrastivetextual studies, giving examples of the different types of results that can be obtained. The procedures form part of an integrated mono- and bilingual text and lexical database system, implemented in a prototypal version at the Institute.| Campo DC | Valore | Lingua |
|---|---|---|
| dc.authority.orgunit | Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo" - ISTI | - |
| dc.authority.people | Peters C | it |
| dc.authority.people | Picchi E | it |
| dc.collection.id.s | 33fc2b58-b895-438b-9d2a-2c5bc86a83a6 | * |
| dc.collection.name | 04.04 Presentazione/Comunicazione non pubblicata in atti di convegno | * |
| dc.contributor.appartenenza | Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo" - ISTI | * |
| dc.contributor.appartenenza | Istituto di linguistica computazionale "Antonio Zampolli" - ILC | * |
| dc.contributor.appartenenza.mi | 918 | * |
| dc.contributor.appartenenza.mi | 973 | * |
| dc.date.accessioned | 2024/02/20 04:38:22 | - |
| dc.date.available | 2024/02/20 04:38:22 | - |
| dc.date.issued | 1996 | - |
| dc.description.abstracteng | A recent trend in corpus linguistics is the construction of bilingual (or multilingual) reference corpora. So far most studies on bilingual text archives have been on parallel corpora, i. e.sets of translationally equivalent texts. Such corpora provide valuable indepth evidence of how texts written in one language can be rendered in another according to contextual factors, such as style, register, domain, etc., and are thus important sources of data when studying the translation processo However, a criticism made of the results of analyses based on these corpora is that a translation is alwaysinfluenced by the source text and, therefore, the target language text(s) can never be considered a true representation of that language in usage. It has been suggested that a more reliable source for certain studies, e.g. how a particular concept is expressed independently in different languages, is a comparable corpus. Comparable archives are sets of texts from pairs or multiples of languages that concem a given domain and can be contrasted because of their common features. However, the development of procedures to query data from this kind of corpus is far more problematic than when interrogating parallel archives as it is difficult to identify automatically equivalent expressions in the different languages. We have studied a solution to this problem. We will present procedures that have been developed at the Istituto di Linguistica Computazionale, Pisa, to construct and query both parallel and comparable bilingual text corpora and to extract significant data for translation purposes and contrastivetextual studies, giving examples of the different types of results that can be obtained. The procedures form part of an integrated mono- and bilingual text and lexical database system, implemented in a prototypal version at the Institute. | - |
| dc.description.affiliations | CNR-IEI, Pisa, Italy; CNR-ILC, Pisa, Italy | - |
| dc.description.allpeople | Peters, C; Picchi, E | - |
| dc.description.allpeopleoriginal | Peters C.; Picchi E. | - |
| dc.description.fulltext | restricted | en |
| dc.description.note | Abstract/poster in convegni internazionali - Codice PuMa: /cnr.iei/1996-A3-004 | - |
| dc.description.numberofauthors | 2 | - |
| dc.identifier.uri | https://hdl.handle.net/20.500.14243/390684 | - |
| dc.language.iso | eng | - |
| dc.relation.conferencedate | 9-11 may 1996 | - |
| dc.relation.conferencename | International Translation Studies Conference Unity in Diversity? | - |
| dc.relation.conferenceplace | Dublin, Irlanda | - |
| dc.subject.keywords | Textual Databases | - |
| dc.subject.keywords | Bilingual Reference Corpora | - |
| dc.subject.keywords | Contrastive Textology | - |
| dc.subject.singlekeyword | Textual Databases | * |
| dc.subject.singlekeyword | Bilingual Reference Corpora | * |
| dc.subject.singlekeyword | Contrastive Textology | * |
| dc.title | Bilingual reference corpora for translators and translation studies | en |
| dc.type.driver | info:eu-repo/semantics/conferenceObject | - |
| dc.type.full | 04 Contributo in convegno::04.04 Presentazione/Comunicazione non pubblicata in atti di convegno | it |
| dc.type.miur | -2.0 | - |
| dc.ugov.descaux1 | 409974 | - |
| iris.mediafilter.data | 2025/04/18 03:11:41 | * |
| iris.orcid.lastModifiedDate | 2024/04/04 10:01:26 | * |
| iris.orcid.lastModifiedMillisecond | 1712217686767 | * |
| iris.sitodocente.maxattempts | 1 | - |
| Appare nelle tipologie: | 04.04 Presentazione/Comunicazione non pubblicata (convegno, evento, webinar...) | |
| File | Dimensione | Formato | |
|---|---|---|---|
|
prod_409974-doc_144244.pdf
solo utenti autorizzati
Descrizione: Bilingual reference corpora for translators and translation studies
Tipologia:
Versione Editoriale (PDF)
Dimensione
242.71 kB
Formato
Adobe PDF
|
242.71 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


