language testing (pma)3-4(2)

整理文档很辛苦,赏杯茶钱您下走!

免费阅读已结束,点击下载阅读编辑剩下 ...

阅读已结束,您可以下载文档离线阅读编辑

资源描述

Session3ReliabilityandValidityWhenmakingatest,therearetwobasicfactorstoconsider--validityandreliability.Inthischapter,wewilldiscusswhattheseareandhowtheymustbeconsidered.1.DefinitionReliabilityisconcernedwithansweringthequestion“Howmuchofanindividual’stestperformanceisduetomeasurementerror,ortofactorsotherthanthelanguageabilitywewanttomeasure?”andwithminimizingtheeffectsofthesefactorsontestscores.Validityisconcernedwiththequestion“Howmuchofanindividual’stestperformanceisduetothelanguageabilitieswewanttomeasure?Andwithmaximizingtheeffectsoftheseabilitiesontestscores.(Bachman,1990:160-161)validityValiditycanbedefinedasthedegreetowhichatestactuallytestswhatitisintendedtotest.IfthepurposeofatestistotestabilitytocommunicateinEnglish,thenitisvalidifitdoesactuallytestabilitytocommunicate.Ifwhatitistestingisactuallyknowledgeofgrammar,thenitisnotavalidtestfortestingabilitytocommunicate.Thisdefinitionhastwoveryimportantaspects.Thefirstisthatvalidityisamatterofdegree.Testsarenoteithervalidornotvalid.Therearedegreesofvalidity,andsometestsaremorevalidthanothers.Asecondimportantaspectofthisdefinitionisthattestsareonlyvalidorinvalidintermsoftheirintendeduse.Ifatestisintendedtotestreadingability,butitalsotestswriting,thenitmaynotbevalidfortestingreading--butitmaytestreadingandwritingtogether.RelationshipbetweenreliabilityandvalidityInorderforatesttobevalid,itfirstneedstobereliable.Investigationofreliabilityandvaliditycanbeviewedascomplementaryaspectsofidentifying,estimating,andinterpretingdifferentsourcesofvarianceintestscores.FactorsaffectingperformanceofexamineesCommunicativelanguageabilityTESTSCORETestmethodfacetsPersonalattributesRandomfactorsMeaningfulvariancce(directlyrelatedtothepurposeofthetest)MeasurementerrorMeasurementerrorMeasurementerror(errorvariance)(Bachman,1990:165)(Brown,1996:189)Varianceameasureofvariability,SD,1)(22NxXVariance1)(2NxxsStandarddeviation:Z-score:sxxzRelationshipbetweenreliabilityandvalidityValidityandreliabilityhaveacomplicatedrelationship.Ifatestisvalid,itmustalsobereliable.Atestthatgivesdifferentresultsatdifferenttimescannotbevalid.However,itispossibleforatesttobereliablewithoutbeingvalid.Thatis,atestcangivethesameresulttimeaftertimebutnotbemeasuringwhatitwasintendedtomeasure.Reliabilityisconcernedwithdetermininghowmuchofthevarianceintestscoresisreliablevariance,whilevalidityisconcernedwithdeterminingwhatabilitiescontributetothisreliablevariance.Reliabilitysourcesoferrorintestscores,thatis,testscoresthemselvesValiditytestperformanceandfactorsoutsidethetestitselfRelationshipbetweenreliabilityandvalidityCampbell(1959:83)Agreementbetweensimilarmeasuresofthesametrait(e.g.correlationbetweenscoresonparalleltests)Agreementbetweendifferentmeasuresofthesametrait(e.g.correlationbetweenscoresonamultiplechoicetestofgrammarandratingsofgrammaronanoralinterview)ReliabilityValidityValidityConditionsReliabilityDBalancebetweenreliabilityandvalidityprescientificperiodValidityConditionsReliabilityDBalancebetweenreliabilityandvaliditypsychometricperiodValidityConditionsReliabilityDBalancebetweenreliabilityandvalidityintegrative-sociolinguisticperiodValidityConditionsReliabilityDBalancebetweenreliabilityandvalidityValidityConditionsReliabilityDBalancebetweenreliabilityandvalidityValidityConditionsReliabilityRelationshipbetweenreliabilityandvalidityAlthoughitisessentialtoconsiderbothreliabilityandvalidityinthedevelopmentanduseoflanguagetests,thedistinctionbetweenthemmaynotalwaysbeclear.2.ReliabilityClassicaltruescoretheoryetxxxObservedscore(x)Truescore(xt)Errorscore(xe)'xxr=.91:thescoresare91%consistent,orreliable,with9%measurementerror.ThreeapproachestoestimatingreliabilitywithintheclassicaltruescoremodelTest-retestreliabilityhowconsistenttestscoresareovertimeNottooshortbetweenthetwotestsNottoolongbetweenthetwotestsEquivalencetheextenttowhichscoresonalternateformsofatestareequivalentTest-RetestReliabilityDeterminingtest-retestreliabilityisnotasimplematter.Therearevariouswaysoftryingtomeasureit,buteachofthemhaspotentialproblems.Test-retest.Onewayofmeasuringreliabilityistogivethestudentsthesametesttwicetothesamegroupofstudents.However,ifatestisgiventwice,particularlyifthereisnotmuchtimebetweenthetwotests,thestudentsmightdobetterthesecondtimeduetoapracticeeffect.Ontheotherhand,ifthereisalongertimebetweenthetwotests,thepracticeeffectisnotaslikelytobeimportant,butitmaybethatwiththepassageoftime,students'Englishproficiencyhasimproved.Parallelgroups.Anotherwaytobedeterminereliabilityistohavetwoparallelgroupstakethesametest.Theproblemisdeterminingwhetherthetwogroupsaretrulyparallel.Paralleltests.Reliabilitycanalsobemeasuredbygivingparalleltests,thatis,twosimilartestswiththesametypeandnumberofitems,thesameinstructions,etc.Theproblemwiththisapproachisdeterminingwhetherthetwotestsareactuallyparallel.ParalleltestsThetruescoreononetestisequaltothetruescoreontheother.Theerrorvariancesforthetwotestsareequal.Inpractice,wevirtuallyneverhavestrictlyparalleltests,wetreattwotestspara

1 / 56
下载文档,编辑使用

©2015-2020 m.777doc.com 三七文档.

备案号:鲁ICP备2024069028号-1 客服联系 QQ:2149211541

×
保存成功