In this paper we describe the criteria we adopted for the selection of a corpus composed of 3,000,000 words from Italian contemporary written texts. The corpus will give rise to a frequency dictionary, which should have two main characteristics: i) representativeness of the Italian texts which are actually read, rather than of all possible written texts, ii) usefulness for psycholinguistic research.

Un corpus dell'italiano scritto contemporaneo dalla parte del ricevente

Cristina Burani;Lucia Marconi
1995

Abstract

In this paper we describe the criteria we adopted for the selection of a corpus composed of 3,000,000 words from Italian contemporary written texts. The corpus will give rise to a frequency dictionary, which should have two main characteristics: i) representativeness of the Italian texts which are actually read, rather than of all possible written texts, ii) usefulness for psycholinguistic research.
1995
Istituto di linguistica computazionale "Antonio Zampolli" - ILC
Istituto di Scienze e Tecnologie della Cognizione - ISTC
8879751603
Corpora
Frequency
Frequency Dictionary
Psycholinguistics
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/211265
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact