A recent trend in corpus linguistics is the construction of bilingual (or multilingual) reference corpora. So far most studies on bilingual text archives have been on parallel corpora, i. e.sets of translationally equivalent texts. Such corpora provide valuable indepth evidence of how texts written in one language can be rendered in another according to contextual factors, such as style, register, domain, etc., and are thus important sources of data when studying the translation processo However, a criticism made of the results of analyses based on these corpora is that a translation is alwaysinfluenced by the source text and, therefore, the target language text(s) can never be considered a true representation of that language in usage. It has been suggested that a more reliable source for certain studies, e.g. how a particular concept is expressed independently in different languages, is a comparable corpus. Comparable archives are sets of texts from pairs or multiples of languages that concem a given domain and can be contrasted because of their common features. However, the development of procedures to query data from this kind of corpus is far more problematic than when interrogating parallel archives as it is difficult to identify automatically equivalent expressions in the different languages. We have studied a solution to this problem. We will present procedures that have been developed at the Istituto di Linguistica Computazionale, Pisa, to construct and query both parallel and comparable bilingual text corpora and to extract significant data for translation purposes and contrastivetextual studies, giving examples of the different types of results that can be obtained. The procedures form part of an integrated mono- and bilingual text and lexical database system, implemented in a prototypal version at the Institute.

Bilingual reference corpora for translators and translation studies

Peters C;Picchi E
1996

Abstract

A recent trend in corpus linguistics is the construction of bilingual (or multilingual) reference corpora. So far most studies on bilingual text archives have been on parallel corpora, i. e.sets of translationally equivalent texts. Such corpora provide valuable indepth evidence of how texts written in one language can be rendered in another according to contextual factors, such as style, register, domain, etc., and are thus important sources of data when studying the translation processo However, a criticism made of the results of analyses based on these corpora is that a translation is alwaysinfluenced by the source text and, therefore, the target language text(s) can never be considered a true representation of that language in usage. It has been suggested that a more reliable source for certain studies, e.g. how a particular concept is expressed independently in different languages, is a comparable corpus. Comparable archives are sets of texts from pairs or multiples of languages that concem a given domain and can be contrasted because of their common features. However, the development of procedures to query data from this kind of corpus is far more problematic than when interrogating parallel archives as it is difficult to identify automatically equivalent expressions in the different languages. We have studied a solution to this problem. We will present procedures that have been developed at the Istituto di Linguistica Computazionale, Pisa, to construct and query both parallel and comparable bilingual text corpora and to extract significant data for translation purposes and contrastivetextual studies, giving examples of the different types of results that can be obtained. The procedures form part of an integrated mono- and bilingual text and lexical database system, implemented in a prototypal version at the Institute.
Campo DC Valore Lingua
dc.authority.orgunit Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo" - ISTI -
dc.authority.people Peters C it
dc.authority.people Picchi E it
dc.collection.id.s 33fc2b58-b895-438b-9d2a-2c5bc86a83a6 *
dc.collection.name 04.04 Presentazione/Comunicazione non pubblicata in atti di convegno *
dc.contributor.appartenenza Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo" - ISTI *
dc.contributor.appartenenza Istituto di linguistica computazionale "Antonio Zampolli" - ILC *
dc.contributor.appartenenza.mi 918 *
dc.contributor.appartenenza.mi 973 *
dc.date.accessioned 2024/02/20 04:38:22 -
dc.date.available 2024/02/20 04:38:22 -
dc.date.issued 1996 -
dc.description.abstracteng A recent trend in corpus linguistics is the construction of bilingual (or multilingual) reference corpora. So far most studies on bilingual text archives have been on parallel corpora, i. e.sets of translationally equivalent texts. Such corpora provide valuable indepth evidence of how texts written in one language can be rendered in another according to contextual factors, such as style, register, domain, etc., and are thus important sources of data when studying the translation processo However, a criticism made of the results of analyses based on these corpora is that a translation is alwaysinfluenced by the source text and, therefore, the target language text(s) can never be considered a true representation of that language in usage. It has been suggested that a more reliable source for certain studies, e.g. how a particular concept is expressed independently in different languages, is a comparable corpus. Comparable archives are sets of texts from pairs or multiples of languages that concem a given domain and can be contrasted because of their common features. However, the development of procedures to query data from this kind of corpus is far more problematic than when interrogating parallel archives as it is difficult to identify automatically equivalent expressions in the different languages. We have studied a solution to this problem. We will present procedures that have been developed at the Istituto di Linguistica Computazionale, Pisa, to construct and query both parallel and comparable bilingual text corpora and to extract significant data for translation purposes and contrastivetextual studies, giving examples of the different types of results that can be obtained. The procedures form part of an integrated mono- and bilingual text and lexical database system, implemented in a prototypal version at the Institute. -
dc.description.affiliations CNR-IEI, Pisa, Italy; CNR-ILC, Pisa, Italy -
dc.description.allpeople Peters, C; Picchi, E -
dc.description.allpeopleoriginal Peters C.; Picchi E. -
dc.description.fulltext restricted en
dc.description.note Abstract/poster in convegni internazionali - Codice PuMa: /cnr.iei/1996-A3-004 -
dc.description.numberofauthors 2 -
dc.identifier.uri https://hdl.handle.net/20.500.14243/390684 -
dc.language.iso eng -
dc.relation.conferencedate 9-11 may 1996 -
dc.relation.conferencename International Translation Studies Conference Unity in Diversity? -
dc.relation.conferenceplace Dublin, Irlanda -
dc.subject.keywords Textual Databases -
dc.subject.keywords Bilingual Reference Corpora -
dc.subject.keywords Contrastive Textology -
dc.subject.singlekeyword Textual Databases *
dc.subject.singlekeyword Bilingual Reference Corpora *
dc.subject.singlekeyword Contrastive Textology *
dc.title Bilingual reference corpora for translators and translation studies en
dc.type.driver info:eu-repo/semantics/conferenceObject -
dc.type.full 04 Contributo in convegno::04.04 Presentazione/Comunicazione non pubblicata in atti di convegno it
dc.type.miur -2.0 -
dc.ugov.descaux1 409974 -
iris.mediafilter.data 2025/04/18 03:11:41 *
iris.orcid.lastModifiedDate 2024/04/04 10:01:26 *
iris.orcid.lastModifiedMillisecond 1712217686767 *
iris.sitodocente.maxattempts 1 -
Appare nelle tipologie: 04.04 Presentazione/Comunicazione non pubblicata (convegno, evento, webinar...)
File in questo prodotto:
File Dimensione Formato  
prod_409974-doc_144244.pdf

solo utenti autorizzati

Descrizione: Bilingual reference corpora for translators and translation studies
Tipologia: Versione Editoriale (PDF)
Dimensione 242.71 kB
Formato Adobe PDF
242.71 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/390684
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact