We discuss the importance of bilingual and multilingual text corpora in many types of cross language investigations and illustrate the differences between parallel and comparable text archives. The advantages of comparable over parallel data for certain kinds of contrastive linguistic studies arc outlined. A prototype version of a system for querying comparable text archives is then described and examples nf the first results arc given. The system will form part of an integrated works talion for mono- and bilingual lexical and text database management and interrogation under development at the Istituto di Linguistica Computazionole, Pisa.
Capturing the comparable : a system for querying comparable text corpora
Peters C;Picchi E
1995
Abstract
We discuss the importance of bilingual and multilingual text corpora in many types of cross language investigations and illustrate the differences between parallel and comparable text archives. The advantages of comparable over parallel data for certain kinds of contrastive linguistic studies arc outlined. A prototype version of a system for querying comparable text archives is then described and examples nf the first results arc given. The system will form part of an integrated works talion for mono- and bilingual lexical and text database management and interrogation under development at the Istituto di Linguistica Computazionole, Pisa.File | Dimensione | Formato | |
---|---|---|---|
prod_409987-doc_144254.pdf
solo utenti autorizzati
Descrizione: Capturing the comparable : a system for querying comparable text corpora
Tipologia:
Versione Editoriale (PDF)
Dimensione
1.23 MB
Formato
Adobe PDF
|
1.23 MB | Adobe PDF | Visualizza/Apri Richiedi una copia |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.