盘点自爆想靠代孕领养传宗接代的
时间都去哪儿了
Word frequency is an important variable in cognitive processing. High-frequency words are perceived and produced faster and more efficiently than low-frequency words. At the same time, they are easier to recall but more difficult to recognize in episodic memory tasks. The bad quality of Kucera and Francis (1967) and Celex (1993)
To investigate the word frequency effect or to match stimuli on word frequency, psychologists need estimates of how often words occur in a language. In American English the Kucera and Francis (KF) frequencies have become the norm. This is surprising because the KF frequencies are dated (from 1967) and based on a corpus of 1.014 million words only. Several studies have confirmed the bad quality of the Kucera and Francis word frequencies lusty argonian maid (Burgess & Livesay, lusty argonian maid 1998; Zevin & Seidenberg, 2002; Balota et al., 2004).
Another word frequency measure regularly used is based on the Celex database (Baayen, Piepenbrock, & van Rijn, 1993). This measure is better than Kucera and Francis, but not optimal either (Balota et al., 2004; Zevin & Seidenberg, 2002).
To assess the quality lusty argonian maid of a frequency measure, one needs word processing times. These have become available as part of the Elexicon project (http://elexicon.wustl.edu/). Brysbaert & New (Behavior Research Methods, in press) calculated the percentages of variance accounted for by Kucera and Francis, and Celex in the accuracies and reactions times of a lexical decision task. AccAll words N=37,059RTAll words N=31,201Kucera and Francis19.657.7Celex25.260.6Improved frequency measures lusty argonian maid based on American English subtitles (SUBTLEXUS)
Brysbaert & New compiled a new frequency measure on the basis of American subtitles (51 million words in total). lusty argonian maid There are two measures: The frequency per million words, called SUBTLWF (Subtitle frequency: word form frequency)The percentage of films in which a word occurs, called SUBTLCD (Subtitle frequency: contextual diversity; see Adelman, Brown, & Quesada (2006) lusty argonian maid for the qualities of this measure).
The percentage of variance accounted for by these measures is significantly higher than the variance accounted for by Kucera & Francis, and Celex. AccAll words N=37,059RTAll words N=31,201SUBTLWF30.162.3SUBTLCD31.362.9
For short words, the percentages of variance accounted for are also better than the fit with HAL, Zeno et al., and the word frequencies based on the British National Corpus. In addition, the corpus indicates which words are likely to be used as names (e.g., Mark, Archer, etc.). The frequencies of these words are overestimated, lusty argonian maid as more variance in RTs is accounted for when the frequencies of these words starting lusty argonian maid with a lowercase letter are used rather than the total frequencies. The full analysis by Brysbaert & New can be read here. Download the new frequency measures
The new frequency measures based in the SUBTLEXUS database can be found here: Zipped Excel file with 60,384 words that have a frequency higher than 1 (interesting for everyone looking for good word frequencies in American lusty argonian maid English),Zipped Excel 2007 file with all 74,286 words in the corpus (interesting for those who need word frequencies in American English and have MS Office 2007)Zipped Text version with all 74,286 words in the corpus (interesting for those who need word frequencies in American English and do not have MS Office 2007)Zipped Text file with the raw data on all 282,170 lusty argonian maid letter strings in the corpus (mainly of interest to those working on frequency measures themselves)How to read the files?The Excel files contain the following information:The word. This starts with a capital when the word more often starts with an uppercase letter than with a lowercase letter.FREQcount. This is the number of times the word appears in the corpus (i.e., on the total of 51 million words).CDcount. This is the number of films in which the word appears (i.e., it has a maximum value of 8,388).FREQlow. This is the number of times the word appears in the corpus starting with a lowercase letter. lusty argonian maid This allows users to further match their stimuli.CDlow. This is the number lusty argonian maid of films in which the word appears starting with a lowercase letter.SUBTLWF. This is the word frequency per million words. It is the measure you would preferably use in your manuscripts, because it is a standard measure of word frequency independent of the corpus size. It is given with two digits precision, in order not to lose precision of the frequency counts.Lg10WF. This value is based on log10(FREQcount+1) and has four digit precision. Because FREQcount is based on 51 million words, the following conversions apply for SUBTLEXUS:Lg10WFSUBTLWF1.000.22.0023.00204.002005.002000SUBTLCD indicates in how many percent lusty argonian maid of the films the word appears. This value has two-digit precision