GR corpus (Dimitropoulou et al.,2010) to the around 45,000 broadcasts in the SUBTLEX-UK corpus (van Heuven et al.,2014). Similarly, the final numbers of words (tokens) included in various corpora differ, ranging from two million words in SUBTLEX-AL (Avdyli & Cuetos,2013) to more t...
N TheLancasterCorpusofMandarinChinese(LC MC),basedonacorpusof73 millioncharacters(50millionwords;seehttp://.lancs.ac.uk/fass/projects/corpus/ LC M C/,checkedonSepte mber24,2009).ThisisthecorpusunderlyingAfrequencydictionaryofmandarinChinese:Corevocabularyforlearners[6]. N TheAcademiaSinicaBalancedCo...
(SUBTLEX-PT-BR; Tang, 2012), Albanian (SUBTLEX-AL; Avdyli & Cuetos, 2013), British English (SUBTLEX-UK; van Heuven, Mandera, Keuleers, & Brysbaert, 2014), European Portuguese (SUBTLEX-PT; Soares et al., 2015), and Polish (SUBTLEX-PL; Mandera, Keuleers, Wodniecka, & Brys...