Evaluating Capture-Recapture Models with Two Inspe

整理文档很辛苦,赏杯茶钱您下走!

免费阅读已结束,点击下载阅读编辑剩下 ...

阅读已结束,您可以下载文档离线阅读编辑

资源描述

V13–17/05/991EvaluatingCapture-RecaptureModelswithTwoInspectorsKhaledElEmamOliverLaitenbergerNationalResearchCouncil,CanadaInstituteforInformationTechnologyBuildingM-50,MontrealRoadOttawa,OntarioCanadaK1AOR6Khaled.El-Emam@iit.nrc.caFraunhoferInstituteforExperimentalSoftwareEngineeringSauerwiesen6D-67661KaiserslauternGermany+49(0)6301707251laiten@iese.fhg.deAbstractCapture-recapture(CR)modelshavebeenproposedasanobjectivemethodforcontrollingsoftwareinspections.CRmodelswereoriginallydevelopedtoestimatethesizeofanimalpopulations.Theyhavealsobeenusedtoestimatethenumberofdefectsinaninspectedartifact.Armedwiththisestimate,onecandecidewhethertheartifactrequiresareinspectiontoensurethataminimalinspectioneffectivenesslevelhasbeenattained.LittleevaluativeresearchhasbeenperformedthusfarontheutilityofCRmodelsforinspectionswithtwoinspectors.Furthermore,thesestudieshavefocusedontherelativeerrorofthedefectcontentestimatesexclusively.InthispaperwereportonanextensiveMonteCarlosimulationthatevaluatedsixcapture-recapturemodelsfortwoinspectorsassumingacodeinspectionscontext.Inadditiontorelativeerror,weevaluatetheaccuracyofthereinspectiondecision.Thelatterismorecongruentwiththemannerinwhichthesemodelswouldbeusedinpractice.Ourresultsindicatethatthemostappropriatecapture-recapturemodelfortwoinspectorsisanestimatororiginallydevelopedbyChapmanthatallowsforinspectorswithdifferentcapabilities.Thiswillhavearelativelyhighdecisionaccuracyandwillperformbetterthanthedefaultdecisionofnoreinspections.Furthermore,weidentifytheconditionsunderwhichthisestimatorwillperformbest.1IntroductionArecentliteraturereviewfoundthat,onaverage,softwareinspectionsfindonly57%ofdefectsincodeanddesigndocuments[8].Giventhesubstantialdefectdetectioncostsavingsthatcanbeaccruedbyincreasingtheeffectiveness1ofinspections[8],contemporaryresearchhasfocusedonimprovedreadingtechniques(e.g.,see[33][3][19][41])andonreinspections(e.g.,see[24])formaximizingeffectiveness.Thefocusofthispaperisonmaximizinginspectioneffectivenessthroughreinspections.Reinspectionscanbeconsideredpartofthegeneralproblemofwhentostopinspections.Asisthecasewithtesting,oneneedsacriterionbywhichtodecidewhetheradocumentshouldbeinspectedanew,orwhetheritcanpasstothesubsequentphase.Mostorganizationshavenotinstitutionalizedproceduresfordecidingwhentostopsoftwareinspections.Thosethatdohaveutilized,forexample,historicalnormssothatiftoomanydefectsarefoundcomparedtothenormthenthisistakenasevidenceofapoordocument,whiletoofewaretakenasevidenceofapoorinspection[24].However,thisapproachassumesthatvariationsamongreviewsarelargerthanvariationsamongdocuments.Ifthisisnotthecasethenthiscanleadtoreinspectionsofhighqualitydocuments,andlowqualitydocumentsmayeasilypass.Toaddressthesepotentialproblems,onecanuseCapture-Recapture(CR)models.CRmodelswereinitiallydevelopedtoestimatethesizeofanimalpopulations(e.g.,see[38][51]).Inasoftwareengineeringcontext,theyhavebeenappliedincontrollingthetestingprocess[4][30][36][21][37],andmorerecentlytheyhavebeenusedincontrollingtheinspectionprocess[23][24].1Effectivenessisdefinedastheproportionofdefectsinadocumentthatwerefoundduringtheinspection.V13–17/05/992Whenappliedtosoftwareinspections,CRmodelscanbeusedtoestimatethenumberofdefectsintheinspecteddocument.Usingthisestimateandtheknownnumberofdefectsfound,thenumberofremainingdefectsintheinspecteddocumentcanbeestimated.Subsequently,armedwiththisinformation,theinspectionteamcanmakethedecisionastowhetherthedocumentshouldbereinspectedtoreduceitsdefectcontentbeforepassingitontothenextphaseofthelifecycle.ResearchersatBellLabsfirstappliedCRmodelsforrequirementsanddesigninspections[23][24][25].However,inthesestudiesthetruenumberofdefectswasunknownandthereforeanevaluationoftheirtrueefficacywasnotpossible.LaterworkconsistedofaMonteCarlosimulationtoevaluatetherobustnessofdifferentCRestimatorstoviolationsoftheirassumptions[50].ObjectiveempiricalevaluationofCRmodelsstartedwiththestudyofWohlinetal.[53].However,thisstudywasconductedwithnon-softwareengineeringdocuments.Subsequentworkusedsoftwareengineeringartifacts[10][12][35][44].Alloftheaboveworkutilizedmodelsthatwereoriginallydevelopedinwildliferesearch.OtherresearchersconsideredtheincorporationofBayesianmethodstoestimatedefectcontent[5],performedfurtherevaluationsofassumptionviolationswhenusingCRestimates[48],andevaluatedtheapplicabilityofCRmodelstoperspective-basedreading[12][49].Analternativeapproachwasproposedin[54],theDetectionProfileMethod(DPM).TheDPMisanintuitivelyappealingapproachthatcanbeeasilyexplainedgraphicallytononspecialists.AlaterstudysuggestedamethodforselectingbetweenaCRmodelandtheDPM[9],andthiswassubsequentlyfurtherevaluatedin[39].InadditiontotheexperiencesreportedbytheresearchersatBellLabs,theuseoftheDPMataninsurancecompanyinGermanywasreportedin[11],andtheapplicationofCRmodelsintelecommunicationsprojects[1].Therefore,thereisagrowingadoptionofdefectcontentestimationmodelsinindustrialpractice,andspecificallyCRmodels.LittleempiricalinvestigationoftheutilityofCRmodelsforinspectionswithtwoin

1 / 50
下载文档,编辑使用

©2015-2020 m.777doc.com 三七文档.

备案号:鲁ICP备2024069028号-1 客服联系 QQ:2149211541

×
保存成功