text production process involves interrelated choices
→ multi-dimensional approach seems a good fit (Biber 1991; Biber & Conrad 2009)
wri
, spo
, web
web-mul
): aggregated according to author and timespo
: one speaker within conversationOriginally: 140+ features, final list: 122, e.g.:
Type-based features – inventories of pronouns, prepositions, conjunctions (relativized using zTTR, Cvrček & Chlumská 2015)
Lexical richness – Yule’s K, thematic concetration (Popescu et al. 2007), unigrams & bigrams (zTTR)
spo
data?)This research was supported by the ERDF project Language Variation in the CNC no. CZ.02.1.01/0.0/0.0/16_013/0001758.
It builds upon work made possible by the Czech National Corpus project (LM2015044) funded by the Ministry of Education, Youth and Sports of the Czech Republic within the framework of Large Research, Development and Innovation Infrastructures.