中国英语学习者语料库CLEC收集了包括中学生、大学英语4级和6级、专业英语低年级和高年级在内的5种学生的语料一百多万词,并对言语失误进行标注。其目的就是观察各类学生的英语特征和言语失误的情况,希望通过定量和定性的方法对中国学习者英语作出较为精确的描写,为我国学生的英语教学提供有用的反馈信息。表1CLEC语料分布类型词次ST2208088ST3209043ST4212855ST5214510ST6226106总计1070602言语失误标注原则1.简单合理,易于系统操作。参与标注的人比较多,分类表过于繁复,就难于掌握。我们采取两级分类,第一级有11类:词形(fm)、动词短语(vp)、名词短语(np)、代词(pr)、形容词短语(aj)、副词(ad)、介词短语(pp)、连词(cj)、词汇(wd)、搭配(cc)、句子(sn)。每一类里再用数目字细分。如[cc]为词语搭配不当,[cc1]表示名词和名词的搭配,[cc2]表示名词和动词的搭配,[cc3]表示动词和名词的搭配,等等。2.分类表的类别要适中。过粗容易统一,但信息太少,不利于分析学习者的失误/过细难以统一,容易把同一种失误归到不同类别。目前我们采取的办法是对常见的失误从细(如vp和np都有9小类),对少见的失误从粗(如cj只有两小类)。现在的分类表有61个失误码,是属于中等规模的分类表。提供足够的失误信息(失误本身、失误类型和失误发生范围)。例如Inthepast,peopleare[vp6,4-]kindtoeachother…,失误用方括号表示,放在失误之后。[vp6]为vp(动词)第6种(时态)失误,4-为失误发生的范围,-表示失误的位置,4表示失误前有4个词。要联系这4个词,才能判断are这个词用错了。开放性。容许研究者根据需要对失误类型进行补充或进一步再分出细类。例如[sn8]为句子结构有缺陷,研究者可以对这种失误再分为若干细类来研究。这需要把sn8的失误全部检索出来,然后定出第三级的分类范畴,如sn81,sn82,等等。5.对语体或失误的来由暂不作标注,因为这需要标注者较多的主观判断,更难以统一。言语失误分类表(总数:61)词形动词短语名词短语代词码类型码类型码类型码类型fm1Spellingvp1patternnp1patternpr1Referencefm2wordbuildingvp2setphrasenp2setphrasepr2anticipatoryitfm3capitalizationvp3agreementnp3agreementpr3Agreementvp4finite/non-finitenp4casepr4Casevp5non-finitenp5countabilitypr5wh-vp6tensenp6numberpr6Indefinitevp7voicenp7articlevp8moodnp8quantifiersvp9modal/auxiliarynp9otherdeterminers形容词短语副词介词短语连词码类型码类型码类型码类型aj1patternad1orderpp1patterncj1patternaj2setphrasead2modificationpp2setphrasecj2setphraseaj3degreead3degreeaj4-ed/-ingconfusionaj5predicative/attributive词语搭配句子码类型码类型码类型wd1ordercc1noun/nounsn1run-onsentencewd2partofspeechcc2noun/verbsn2sentencefragmentwd3substitutioncc3verb/nounsn3danglingmodifierwd4absencecc4adj/nounsn4illogicalcomparisonwd5redundancycc5verb/advsn5topicprominencewd6repetitioncc6adv/adjsn6Coordinationwd7ambiguitysn7Subordinationsn8structuraldeficiencysn9Punctuation标注说明码分类类别说明fm1wordSpelling(拼写)spelling,coinage,abbreviation,apostrophefm2wordwordbuilding(构词)derivation,inflection,compounding,plurality(noun),irregularity(verb),3rdpersonsingularform(verb),syllabification,hyphenation,worddivisionorfusionfm3wordCapitalization(大小写)lowerinitialletterforupperinitialletterorviceversavp1vbphrPattern(及物性型式)errorintransitivity(viasvtorviceversa),transitiveverbpattern/grammatical(cfOxfordadvancedlearner’sdictionaryofcurrentEnglisheditedbyA.S.Hornby)vp2vbphrsetphrase(固定词组)phrasalverbandverbalphrase:errorinformorusevp3vbphrAgreement(主谓一致性)numberagreementwithitssubject(nounorpronoun)vp4vbphrfinite/non-finite(定式)finiteverbfornon-finiteverborviceversavp5vbphrnon-finite(不定式)infinitiveerror:formanduse/infinitiveforparticipleorviceversa/-edparticiplefor-ingparticipleorviceversavp6vbphrTense(时态)errorintenseusewithinasentence/thesequenceoftensesbetweensentencesvp7vbphrvoice(语态)errorintheuseofvoice:activeforpassiveorviceversavp8vbphrMood(语气)errorintheuseofmood:imperative,subjunctive/improperstructureofconditionalsentencesvp9vbphrmodal/auxiliary(情态)misuseofmodal/auxiliaryverbs/wrongformofmodalverb(orauxiliaryverb)andverbcombination(e.gtenseform,voiceform,etc)np1nnphrPattern(名词型式)Errorincombinationwithotherwords/grammaticalnp2nnphrsetphrase(固定词组)omissionorreplacementofafixedelementthatgoesafteracertainnounnp3nnphrAgreement(主谓一致性)numberagreementofanounwithitsdeterminerorawordthatreferstoitnp4nnphrCase(格)possessivecaseerror:formorusenp5nnphrCountability(可数性)uncountablenounusedascountablenounnp6nnphrNumber(数)countablenounusedwithnodetermineror-s/aor-swithpluralnounnp7nnphrArticle(冠词)a/anconfusionordefinite/indefiniteconfusionnp8nnphrQuantifiers(数量词)misuseorconfusionbetweenmany/much,(a)few/(a)little,some/any,etcnp9nnphrotherdeterminers(其他限定词)misuseorconfusionofdemonstratives,wh-determiners,numerals,etc.pr1pronReference(指称)incorrect/ambiguouspronounreference/anaphoricpr2pronanticipatoryit(先行it)improperorwronguseofanticipatoryit/itreplacedbyademonstrative,etcpr3pronAgreement(主谓一致性)numberagreementwithanounitreferstopr4pronCase(格)caseerrorofanypersonalpronounpr5pronwh-(wh-代词)misuseorconfusionofinterrogative,relativeandconjunctivepronounspr6pronIndefinite(不定式)misuseorconfusionofindefinitepronounssuchasall/both,few/little,some/any,either/neither,etcaj1adjPattern(形容词型式)errorinthecombinationwithotherwords/grammaticalaj2adjsetphrase(固定词组)errorintheidiomaticuseofanadjectivalphrase/omissionorreplacementofafixedelementthatgoesafteracertainadjectiveaj3adjDegree(级)adjectivedegreeerror:formanduseaj4adj-ed/-ingconfusion(-ed/-ing混淆)-edadjectivefor-ingadjectiveorviceversaaj5adjpredicative/attributive(谓语/定语)predicativeadjectiveusedasattributiveadjectivead1advOrder(词序)improperadverbplacement/wrongpositionad2advModification(修饰语)adjectivemodifierusedasverbmodifier/otherkindsofconfusionad3advDegree(级)adverbdegreeerror:formandusepp1prepPattern(介词型式)unacceptablecombinationwithotherwords/grammaticalpp2prepsetphrase(固定词组)errorintheformationoruseofanidiomaticprepositionalphrasecj1conjPattern(连词型式)unacceptablecombinationwithotherwords/grammaticalcj2conjsetphrase(固定词组)errorintheformationoruseofaphrasefunctioningasaconjunctionwd1wordOrder(词序)misplacementofanywordotherthananadverbwd2wordpartofspeec