Data Mining Concepts and Techniques(Second vision)

整理文档很辛苦,赏杯茶钱您下走!

免费阅读已结束,点击下载阅读编辑剩下 ...

阅读已结束,您可以下载文档离线阅读编辑

资源描述

iDataMining:ConceptsandTechniquesTheSecondEditionJiaweiHanMichelineKamberUniversityofIllinoisatUrbana-ChampaignMorganKaufmannPublishers340PineStreet,SixthFloor,SanFrancisco,CA94104-3205,USA°2006AcademicPressAllrightsreservedPrintedintheUnitedStatesofAmericaNopartofthispublicationmaybereproduced,storedinaretrievalsystem,ortransmittedinanyformorbyanymeans|electronic,mechanical,photocopying,recording,orotherwise|withoutthepriorwrittenpermissionofthepublisher.iiToY.DoraandLawrenceforyourloveandencouragementJ.H.ToErik,Kevan,Kian,andMikaelforyourloveandinspirationM.K.Downloadat|scienti¯cdata,medicaldata,demographicdata,¯nancialdata,andmarketingdata.Peoplehavenotimetolookatthisdata.Humanattentionhasbecomeapreciousresource.So,wemust¯ndwaystoautomaticallyanalyzethedata,toautomaticallyclassifyit,toautomaticallysummarizeit,toautomaticallydiscoverandcharacterizetrendsinit,andtoautomatically°aganomalies.Thisisoneofthemostactiveandexcitingareasofthedatabaseresearchcommunity.Researchersinareassuchasstatistics,visualization,arti¯cialintelligence,andmachinelearningarecontributingtothis¯eld.Thebreadthofthe¯eldmakesitdi±culttograspitsextraordinaryprogressoverthelastfewyears.JiaweiHanandMichelineKamberhavedoneawonderfuljoboforganizingandpresentingdatamininginthisveryreadabletextbook.Theybeginbygivingquickintroductionstodatabaseanddataminingconceptswithparticularemphasisondataanalysis.Theyreviewthecurrentproducto®eringsbypresentingageneralframeworkthatcoversthemall.Theythencoverinachapter-by-chaptertourtheconceptsandtechniquesthatunderlieclassi¯cation,prediction,association,andclustering.Thesetopicsarepresentedwithexamples,atourofthebestalgorithmsforeachproblemclass,andpragmaticrulesofthumbaboutwhentoapplyeachtechnique.Ifoundthispresentationstyletobeveryreadable,andIcertainlylearnedalotfromreadingthebook.JiaweiHanandMichelineKamberhavebeenleadingcontributorstodataminingresearch.Thisisthetexttheyusewiththeirstudentstobringthemuptospeedonthe¯eld.The¯eldisevolvingveryrapidly,butthisbookisaquickwaytolearnthebasicideas,andtounderstandwherethe¯eldistoday.Ifounditveryinformativeandstimulating,andIexpectyouwilltoo.iiiDownloadat|OnWhatKindofData?.................................71.3.1RelationalDatabases........................................71.3.2DataWarehouses..........................................91.3.3TransactionalDatabases......................................111.3.4AdvancedDatabaseSystemsandAdvancedDatabaseApplications..............111.4DataMiningFunctionalities|WhatKindsofPatternsCanBeMined?................151.4.1Concept/ClassDescription:CharacterizationandDiscrimination...............161.4.2MiningFrequentPatterns,Associations,andCorrelations...................171.4.3Classi¯cationandPrediction...................................181.4.4ClusterAnalysis..........................................191.4.5OutlierAnalysis..........................................191.4.6EvolutionAnalysis.........................................201.5AreAllofthePatternsInteresting?....................................201.6Classi¯cationofDataMiningSystems..................................211.7DataMiningTaskPrimitives.......................................231.8IntegrationofaDataMiningSystemwithaDatabaseorDataWarehouseSystem.........251.9MajorIssuesinDataMining.......................................261.10Summary..................................................281.11Exercises..................................................291.12BibliographicNotes.............................................312DataPreprocessing352.1WhyPreprocesstheData?........................................352.2DescriptiveDataSummarization.....................................382.2.1MeasuringtheCentralTendency.................................38vDownloadat

1 / 574
下载文档,编辑使用

©2015-2020 m.777doc.com 三七文档.

备案号:鲁ICP备2024069028号-1 客服联系 QQ:2149211541

×
保存成功