MUC3andMUC4DataSets(MUC3和MUC4数据集)数据摘要:Foreachevaluation,groundtruthhadtobeestablishedtodeterminethereliabilityoftheparticipatingsystems.Datasetsweretypicallypreparedbyhumanannotatorsfortraining,dryruntest,andformalruntestusage.ContacttheLDCforlicensingofthetextsandrequestthepublicdomainprepareddatasetsusedinMUCandtheMUCscoringsoftware.TheMUC3andMUC4DataSetsareprovidedcompletelyfreeofchargecourtesyofFBIS(FederalBroadcastInformationServices).中文关键词:自然语言处理,MUC3和MUC4,地面实况,可靠性,FBIS,英文关键词:NaturalLanguageProcessing,MUC3andMUC4,Groundtruth,Reliability,FBIS,数据格式:TEXT数据用途:NaturalLanguageProcessing数据详细介绍:MUC3andMUC4DataSetsAbstractForeachevaluation,groundtruthhadtobeestablishedtodeterminethereliabilityoftheparticipatingsystems.Datasetsweretypicallypreparedbyhumanannotatorsfortraining,dryruntest,andformalruntestusage.Thedatasetisnowbeingmadeavailablewhereverpossibleonthiswebsite.DataDescriptionThetextsusedforMUC6andMUC7arecopyrightedmaterialsandareonlyavailablethroughtheLinguisticDataConsortium(LDC)forasmallfee.Thetextsareavailableas:newswirearticlesforMUC-6(MUC-VITextCollection),andnewswirearticlesforMUC-7(NorthAmericanNewsTextCorpora).ContacttheLDCforlicensingofthetextsandrequestthepublicdomainprepareddatasetsusedinMUCandtheMUCscoringsoftware.TheMUC3andMUC4DataSetsareprovidedcompletelyfreeofchargecourtesyofFBIS(FederalBroadcastInformationServices).Reference数据预览:点此下载完整数据集