CNR Institutional Research Information System

A recent trend in corpus linguistics is the construction of bilingual (or multilingual) reference corpora. So far most studies on bilingual text archives have been on parallel corpora, i. e.sets of translationally equivalent texts. Such corpora provide valuable indepth evidence of how texts written in one language can be rendered in another according to contextual factors, such as style, register, domain, etc., and are thus important sources of data when studying the translation processo However, a criticism made of the results of analyses based on these corpora is that a translation is alwaysinfluenced by the source text and, therefore, the target language text(s) can never be considered a true representation of that language in usage. It has been suggested that a more reliable source for certain studies, e.g. how a particular concept is expressed independently in different languages, is a comparable corpus. Comparable archives are sets of texts from pairs or multiples of languages that concem a given domain and can be contrasted because of their common features. However, the development of procedures to query data from this kind of corpus is far more problematic than when interrogating parallel archives as it is difficult to identify automatically equivalent expressions in the different languages. We have studied a solution to this problem. We will present procedures that have been developed at the Istituto di Linguistica Computazionale, Pisa, to construct and query both parallel and comparable bilingual text corpora and to extract significant data for translation purposes and contrastivetextual studies, giving examples of the different types of results that can be obtained. The procedures form part of an integrated mono- and bilingual text and lexical database system, implemented in a prototypal version at the Institute.

Bilingual reference corpora for translators and translation studies

Peters C;Picchi E

1996

Abstract

A recent trend in corpus linguistics is the construction of bilingual (or multilingual) reference corpora. So far most studies on bilingual text archives have been on parallel corpora, i. e.sets of translationally equivalent texts. Such corpora provide valuable indepth evidence of how texts written in one language can be rendered in another according to contextual factors, such as style, register, domain, etc., and are thus important sources of data when studying the translation processo However, a criticism made of the results of analyses based on these corpora is that a translation is alwaysinfluenced by the source text and, therefore, the target language text(s) can never be considered a true representation of that language in usage. It has been suggested that a more reliable source for certain studies, e.g. how a particular concept is expressed independently in different languages, is a comparable corpus. Comparable archives are sets of texts from pairs or multiples of languages that concem a given domain and can be contrasted because of their common features. However, the development of procedures to query data from this kind of corpus is far more problematic than when interrogating parallel archives as it is difficult to identify automatically equivalent expressions in the different languages. We have studied a solution to this problem. We will present procedures that have been developed at the Istituto di Linguistica Computazionale, Pisa, to construct and query both parallel and comparable bilingual text corpora and to extract significant data for translation purposes and contrastivetextual studies, giving examples of the different types of results that can be obtained. The procedures form part of an integrated mono- and bilingual text and lexical database system, implemented in a prototypal version at the Institute.

Scheda breve

Scheda completa

Scheda completa (DC)

Campo DC	Valore	Lingua
dc.authority.orgunit	Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo" - ISTI	-
dc.authority.people	Peters C	it
dc.authority.people	Picchi E	it
dc.collection.id.s	33fc2b58-b895-438b-9d2a-2c5bc86a83a6	*
dc.collection.name	04.04 Presentazione/Comunicazione non pubblicata in atti di convegno	*
dc.contributor.appartenenza	Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo" - ISTI	*
dc.contributor.appartenenza	Istituto di linguistica computazionale "Antonio Zampolli" - ILC	*
dc.contributor.appartenenza.mi	918	*
dc.contributor.appartenenza.mi	973	*
dc.date.accessioned	2024/02/20 04:38:22	-
dc.date.available	2024/02/20 04:38:22	-
dc.date.issued	1996	-
dc.description.abstracteng	A recent trend in corpus linguistics is the construction of bilingual (or multilingual) reference corpora. So far most studies on bilingual text archives have been on parallel corpora, i. e.sets of translationally equivalent texts. Such corpora provide valuable indepth evidence of how texts written in one language can be rendered in another according to contextual factors, such as style, register, domain, etc., and are thus important sources of data when studying the translation processo However, a criticism made of the results of analyses based on these corpora is that a translation is alwaysinfluenced by the source text and, therefore, the target language text(s) can never be considered a true representation of that language in usage. It has been suggested that a more reliable source for certain studies, e.g. how a particular concept is expressed independently in different languages, is a comparable corpus. Comparable archives are sets of texts from pairs or multiples of languages that concem a given domain and can be contrasted because of their common features. However, the development of procedures to query data from this kind of corpus is far more problematic than when interrogating parallel archives as it is difficult to identify automatically equivalent expressions in the different languages. We have studied a solution to this problem. We will present procedures that have been developed at the Istituto di Linguistica Computazionale, Pisa, to construct and query both parallel and comparable bilingual text corpora and to extract significant data for translation purposes and contrastivetextual studies, giving examples of the different types of results that can be obtained. The procedures form part of an integrated mono- and bilingual text and lexical database system, implemented in a prototypal version at the Institute.	-
dc.description.affiliations	CNR-IEI, Pisa, Italy; CNR-ILC, Pisa, Italy	-
dc.description.allpeople	Peters, C; Picchi, E	-
dc.description.allpeopleoriginal	Peters C.; Picchi E.	-
dc.description.fulltext	restricted	en
dc.description.note	Abstract/poster in convegni internazionali - Codice PuMa: /cnr.iei/1996-A3-004	-
dc.description.numberofauthors	2	-
dc.identifier.uri	https://hdl.handle.net/20.500.14243/390684	-
dc.language.iso	eng	-
dc.relation.conferencedate	9-11 may 1996	-
dc.relation.conferencename	International Translation Studies Conference Unity in Diversity?	-
dc.relation.conferenceplace	Dublin, Irlanda	-
dc.subject.keywords	Textual Databases	-
dc.subject.keywords	Bilingual Reference Corpora	-
dc.subject.keywords	Contrastive Textology	-
dc.subject.singlekeyword	Textual Databases	*
dc.subject.singlekeyword	Bilingual Reference Corpora	*
dc.subject.singlekeyword	Contrastive Textology	*
dc.title	Bilingual reference corpora for translators and translation studies	en
dc.type.driver	info:eu-repo/semantics/conferenceObject	-
dc.type.full	04 Contributo in convegno::04.04 Presentazione/Comunicazione non pubblicata in atti di convegno	it
dc.type.miur	-2.0	-
dc.ugov.descaux1	409974	-
iris.mediafilter.data	2025/04/18 03:11:41	*
iris.orcid.lastModifiedDate	2024/04/04 10:01:26	*
iris.orcid.lastModifiedMillisecond	1712217686767	*
iris.sitodocente.maxattempts	1	-
Appare nelle tipologie:	04.04 Presentazione/Comunicazione non pubblicata (convegno, evento, webinar...)

File in questo prodotto:

File	Dimensione	Formato
prod_409974-doc_144244.pdf solo utenti autorizzati Descrizione: Bilingual reference corpora for translators and translation studies Tipologia: Versione Editoriale (PDF) Dimensione 242.71 kB Formato Adobe PDF Visualizza/Apri Richiedi una copia	242.71 kB	Adobe PDF	Visualizza/Apri Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/390684

Citazioni

ND

ND

ND

social impact