HUNANUNIVERSITY毕业设计(论文)设计(论文)题目:数据采集自动化处理与数据挖掘学生姓名:武祥斌学生学号:20041610326专业班级:软件工程2004级开发2班指导老师:李玮系主任(院长):林亚平2008年5月26日湖南大学毕业论文第2页湖南大学软件学院数据采集自动化处理与数据挖掘摘要目前,随着社会经济的发展,金融市场变的异常庞大和复杂,而基金作为一种金融衍生产品,在金融市场中占有一席之地。随着基金产业的不断发展,各种类型的基金进入金融市场,作为一种理财产品,就是要为大众服务,帮助大家理财,然而当投资者面对大量的基金产品,不知应该怎样选择,所以我们的目标就是帮助普通的投资者和金融机构做出判断,指引他们选择适合自己的基金。为了达到上面的目的,我们就需要大量的数据来做支撑,所以采集这些基金产品的数据是十分重要的,每天有来自世界各地的金融机构为我们提供这些数据,而我们的目的就是要采集这些数据,保持数据的完整性和正确性就是我们这套系统的主要功能。我们采用程序的方式来实现这样的数据采集,并且不需要人工干预,本套系统采用了c#语言,以及三层结构本身的一些设计上特点做了较为详细的分析,以及大量采用了XML技术,三层架构的设计实现了一套功能相对完备并具有良好用户界面和可扩展性的系统。在本中也对本文中的创新点进行阐述,同时展望了采集数据的自动化和数据挖掘的发展方向以及前景。关键词:基金,XML,数据挖掘湖南大学毕业论文第2页湖南大学软件学院AutomationofDataCollectionandDataMiningABSTRACTAuthor:WuxiangbinTutor:LiWeiAtpresent,asthesocialandeconomicdevelopment,financialmarketsbecomeunusuallylargeandcomplex,andthefundasafinancialderivativeproducts,financialmarketsinaplace.Withthecontinuousdevelopmentoftheindustry,varioustypesoffundsintothefinancialmarkets,financialproductsasameansforthepublicservices,financialmanagementhelppeople,butwheninvestorsfacealotoffundproducts,theydonotknowwhattochoose,Ourgoalistohelpordinaryinvestorsandfinancialinstitutionstomakeajudgement,theguidelinestheychoosetosuittheirownfunds.Toachievetheaboveobjectives,weneedtodoalotofdatasupport,thecollectionofdataproductsofthesefundsisveryimportant,everydayfromallovertheworldfinancialinstitutionstoprovideuswiththesedata,andourgoalistoacquisitionofthesedata,andmaintaindataintegrityandaccuracyofthissystemisourmainfunction.Weadoptaprogramapproachtoachievesuchadatacollection,anddoesnotrequiremanualintervention,thissetofsystemsusedc#language,andthethree-tierstructureitselftodosomedesignfeaturesamoredetailedanalysis,andalargenumberofXMLtechnology,Thethree-tiersystemdesignedtoachievearelativelycompletesetoffeaturesandhasagooduserinterfaceandscalabilityofthesystem.Inthispaperalsoonthepointoninnovation,andtheprospectofautomateddatacollectionanddataminingdirectionforthedevelopmentandprospects.Keywords:fund,XML,datamining.湖南大学毕业论文第2页湖南大学软件学院目录1绪论..........................................................................................................................61.1本课题的简介..................................................................................................61.2本课题的目的和意义.......................................................................................72技术背景..................................................................................................................82.1WEB服务的概念...............................................................................................82.2.NETWEB服务的优势....................................................................................82.3XML..................................................................................................................92.4系统的体系结构............................................................................................112.4.1传统的两层结构....................................................................................112.4.2三层结构简介........................................................................................112.4.3用ASP.NET部署三层架构.................................................................122.4.4IIS...........................................................................................................132.4.5体系结构建立的几个原则....................................................................142.5数据挖掘........................................................................................................152.5.1什么是数据挖掘....................................................................................152.5.2数据挖掘能做什么................................................................................162.5.3数据挖掘的实现....................................................................................173系统功能设计........................................................................................................183.1概要说明........................................................................................................183.2DOWNLOADER模块.........................................................................................193.2.1主要处理流程........................................................................................193.2.2类图........................................................................................................203.2.3功能实现................................................................................................213.3PARSER模块....................................................................................................223.3.1主要处理文件流程................................................................................223.3.2类图........................................................................................................243.3.3功能实现................................................................................................253.4IMPORTER模块................................................................................................27湖南大学毕业论文第2页湖南大学软件学院