1AbouttheInstructorName:GongZhiguoOffice:N415Phone:3974465E-Mail:zggong@umac.mo2FromFileProcessingtoDBMSFileofcurrentaccountsFileofsavingaccountsFileofcustomersProgram4customerinformationProgram1depositwithdrawProgram2transferProgram3printingstmtDBMSBANKDATABASEProgram4customerinformationProgram1depositwithdrawProgram2transferProgram3printingstmt3DDBS=Database+NetworkingThetechnologyofcomputernetworks,promotesamodeofworkthatgoesagainstallcentralizationeffortsandfacilitatesdistributedcomputingDistributeddatabasesystemtechnologyistheunionofwhatappeartobediametricallyopposedapproachestodataprocessing:DatabaseSystem,ComputerNetworktechnologiesAdatabasesystemaimsatintegratingtheoperationaldataofanenterprise,andtoprovideacentralizedandcontrolledaccesstothatdata4DistributedComputingSystemAdistributedcomputingsystemconsistsofanumberofautonomousprocessingelements(notnecessarilyhomogeneous)interconnectedbyacomputernetworkcooperateinperformingtheirassignedtasksWhatisdistributed?ProcessingLogicFunctionDataControlAllthesearenecessaryandimportantfordistributeddatabasetechnology5DistributedDBMSEnvironmentSite1Site2Site4Site3Site5Site6CommunicationNetwork6DistributedDatabaseSystemAdistributeddatabaseisacollectionofmultiple,logicallyinterrelateddatabasesdistributedoveracomputernetwork;storesdataonmultiplecomputers(nodes)overthenetworkandpermitsaccessfromanynodetothejointdataAdistributeddatabasemanagementsystem(DDBMS)isasoftwaresystemthatpermitsthemanagementofthedistributeddatabasesandmakesthedistributiontransparenttotheusers.7WhatisnotaDistributedDatabaseSystem?ADDBSisnota``collectionoffiles''thatcanbeindividuallystoredateachnodeofacomputernetworkfilesarenotlogicallyrelatednoaccessviacommoninterface8CentralizedDBMSonaNetworkdataresidesonlyatonenodethedatabasemanagementisnodifferentfromcentralizedDBMSremoteprocessing,singleservermultipleclientsSite1Site2Site4Site3Site5Site6CommunicationNetwork9MultiprocessorSystemsSharedMemory(SMP)SharedDiskSharedNothing(network)Sequent,SGI,SunVMScluster,SysplexTandem,Teradata,SP2CLIENTSMemoryProcessorsCLIENTSCLIENTS10DistributedDatabaseSystemTechnologyThekeyisintegration,notcentralizationDistributeddatabasetechnologyattemptstoachieveintegrationwithoutcentralizationDatabaseTechnologyComputersNetworksDistributedDatabaseSystemsIntegrationIntegrationWithoutCentralizationDistributedComputing11ExampleMultinationalmanufacturingcompany:headquartersinMacaumanufacturingplantsinNanningandKunmingwarehousesinZhongshanandDongguanR&DfacilitiesinBeijingDataandInformation:employeerecords(workinglocation)projects(R&D)engineeringdata(manufacturingplants,R&D)inventory(manufacturing,warehouse)12PromisesofDistributedDBMStransparentmanagementofdistributed,fragmented,andreplicateddataimprovedreliabilityandavailabilitythroughdistributedtransactionsimprovedperformancehighersystemextendibility13TransparencyTransparencyreferstoseparationofthehigher-levelsemanticsofasystemfromlower-levelimplementationdetails.FromdataindependenceincentralizedDBMStofragmentationtransparencyinDDBMS.IssuesWhoshouldprovidetransparency?Whatisthestateoftheartintheindustry?14ImprovedReliabilityDistributedDBMScanusereplicatedcomponentstoeliminatesinglepointfailure.Theuserscanstillaccesspartofthedistributeddatabasewith“propercare”eventhoughsomeofthedataisunreachable.Distributedtransactionsfacilitatemaintenanceofconsistentdatabasestateevenwhenfailuresoccur.15ImprovedPerformanceSinceeachsitehandlesonlyaportionofadatabase,thecontentionforCPUandI/Oresourcesisnotthatsevere.Datalocalizationreducescommunicationoverheads.Inherentparallelismofdistributedsystemsmaybeexploitedinter-queryparallelismintra-queryparallelismPerformancemodelsarenotsufficientlydeveloped.16EasierSystemExpansionAbilitytoaddnewsites,data,andusersovertimewithoutmajorrestructuring.Hugecentralizeddatabasesystems(mainframes)arehistory(almost!).PCrevolution(CompaqbuyingDigital,1998)willmakenaturaldistributedprocessingenvironments.Newapplications(suchas,supplychain)arenaturallydistributed-centralizedsystemswilljustnotwork.17DisadvantagesofDDBSsLackofExperienceNooperatingtruedistributeddatabasesystemsinexistenceComplexityDDBSproblemsareinherentlymorecomplexthancentralizedDBMSonesCostMorehardware,softwareandpeoplecostsDistributionofcontrolProblemsofsynchronizationandcoordinationtomaintaindataconsistencySecurityDatabasesecurity+networksecurityDifficulttoconvertNotoolstoconvertcentralizedDBMSstoDDBSs18ComplicatingFactorsDatamaybereplicatedinadistributedenvironment,consequentlytheDDBSisresponsibleforchoosingoneofthestoredcopiesoftherequesteddataforaccessincaseofretrievalsmakingsurethattheeffectofanupdateisreflectedoneachandeverycopyofthatdataitemIfthereissite/linkfailurewhileanupdateisbeingexecuted,theDDBSmustmakesurethattheeffectswillbereflectedonthedataresidingatthefailingorunreachablesitesassoonasthesystemrecoversfromthefailure19ComplicatingFactorsMaintainingconsistencyofdistributed/replicateddata.Sinceeachsitecannothave