python外文文献

整理文档很辛苦,赏杯茶钱您下走!

免费阅读已结束,点击下载阅读编辑剩下 ...

阅读已结束,您可以下载文档离线阅读编辑

资源描述

..apythonEnvironmentforTreeExplorationReviewedbyJaimeHuerta-Cepas,correspondingauthor1JoaquínDopazo,2andToniGabaldóncorrespondingauthor1AbstractManybioinformaticsanalyses,rangingfromgeneclusteringtophylogenetics,producehierarchicaltreesastheirmainresult.Theseareusedtorepresenttherelationshipsamongdifferentbiologicalentities,thusfacilitatingtheiranalysisandinterpretation.Anumberofstandaloneprogramsareavailablethatfocusontreevisualizationorthatperformspecificanalysesonthem.However,suchapplicationsarerarelysuitableforlarge-scalesurveys,inwhichahigherlevelofautomationisrequired.Currently,manygenome-wideanalysesrelyontree-likedatarepresentationandhencethereisagrowingneedforscalabletoolstohandletreestructuresatlargescale.Keywords:Python,spikingneurons,simulation,integrateandfire,teaching,neuralnetworks,computationalneuroscience,softwareBackgroundHerewepresenttheEnvironmentforTreeExploration(ETE),apythonprogrammingtoolkitthatassistsintheautomatedmanipulation,analysisandvisualizationofhierarchicaltrees.ETElibrariesprovideabroadsetoftreehandlingoptionsaswellasspecificmethodstoanalyzephylogeneticandclusteringtrees.Amongotherfeatures,ETEallowsfortheindependentanalysisoftreepartitions,hassupportfortheextendednewickformat,providesanintegratednodeannotationsystemandpermitstolinktreestoexternaldatasuchasmultiplesequencealignmentsornumericalarrays.Inaddition,ETEimplementsanumberofbuilt-inanalyticaltools,includingphylogeny-basedorthologypredictionandclustervalidationtechniques.Finally,ETE'sprogrammabletreedrawingenginecanbeusedtoautomatethegraphicalrenderingoftreeswithcustomizednode-specificvisualizations.ConclusionsETEprovidesacompletesetofmethodstomanipulatetreedatastructuresthatextendscurrentfunctionalityinotherbioinformatictoolkitsofamoregeneralpurpose.ETEisfreesoftwareandcanbedownloadedfrom(OTU)andinternalnodesastheircorrespondingancestors.Insuchphylogenetictrees,branchlengthsareusuallyproportionaltotheevolutionarydistanceamongsequences.Otherapplications,suchastheanalysisofgeneexpression,usehierarchicalclusteringanalysistogroupgenesorexperimentalconditionsaccordingtothesimilarityoftheirexpressionpatterns.Likewise,treesareusedbymanyproteinclassificationmethodsandfortheanalysisofphylogeneticprofiles.Thus,theanalysisoftreedatastructuresisacommontaskinmanyareasofbioinformaticsandthereisaneedforanalyticalandvisualizationtools.Inthisrespect,anumberofbioinformaticprogramsdoexistthatassistintheexplorationofhierarchicaltrees.Mostofthem,however,consistofstandaloneapplicationsthatarefocusedonvisualizationand,occasionally,onperformingspecifictests.SomewellknownexamplesareTreeView[1],awidelyusedprogramforinspectingphylogenetictrees;ClusterTreeview[2],anapplicationforvisualizingmicroarrayclusteringresults;ATV[3],ajavaprogramusedtoexplorephylogenieswhichprovidesalsosomeeditingoptions;MEGA[4],anevolutionarygeneticsanalysissuitethatincludesabuilt-intreeviewer;andmanyotherrecentapplications[5-8].Whilealltheseprogramsareveryusefultomanagesingletrees,theycanhardlybeautomatizedoradaptedtospecificneeds.Thus,whentheanalysisofhundredsorthousandsoftreesisrequired,theuseofstandaloneprogramsbecomesrestrictive,becauseamuchhigherlevelofautomationisrequired.Insuchcases,programmingtoolkitsrepresentamoreadequateframework,sincetheyprovidetoolsandmethodstohandledataatalowerlevel.Usingtoolkits,bioinformaticianscaneasilycreatetheirownanalysispipelinesandprogramcustomtasksoverlargecollectionsofdata[9].Severalgenericbioinformatictoolkitsdoexistthatcoverawiderangeofprogramminglanguagesandscopes,withBioPerl[10]andBioPython[11]beingthemostextensivelydeveloped.Togetherwithabroadrangeofotherfeatures,thesetoolkitsallowcertainlevelofinteractionwithtreedatastructures.However,onlybasicactionsarecurrentlysupported.Alternatively,thePyCogent[12]andP4~pf/p4.htmlpythontoolkitscanbeusedtoextendthisfunctionality,althoughtheyaremostlyfocusedonphylogeneticreconstruction.R[13],ageneralpurposestatisticalframework,doesincludeseveralpackagestoperformstatisticaltestsonclusteringandphylogenetictrees.Nevertheless,thesepackagesarefocusedonperformingspecificanalysesratherthaninprovidingtreehandlingandmanipulationfeatures.Finally,incontrasttothegreatnumberofstandalonetreeviewers,programmingtoolkitsofferfew,ifany,graphicalrenderingpossibilities.Anintermediatealternativebetweenstandaloneviewersandprogrammatictreerenderingisthatofth

1 / 10
下载文档,编辑使用

©2015-2020 m.777doc.com 三七文档.

备案号:鲁ICP备2024069028号-1 客服联系 QQ:2149211541

×
保存成功