毕业设计(论文)中文题目:微博舆情管理平台数据分析系统的设计与实现英文题目:MicroBlogPublicOpinionManagementPlatform:TheDesignandImplementationoftheDataAnalysisSystem学院:专业:学生姓名:学号:指导教师:年月日北京交通大学毕业设计(论文)中文摘要随着网络技术应用的普及和发展,舆情的传播方式和传播速度都发生了根本性变化,网络舆情对人类的社会状态产生了全方位的影响,微博舆情则是网络舆情的重要组成部分,它的特点有:直接性,突发性,偏差性,丰富性和互动性。本文以微博消息为研究对象,研究了微博消息传播的特点与模型,通过对抓取数据的分析发现了微博传播的单向性,便捷性,背对脸等特点,还有微博意见领袖在微博传播中的重要作用,微博热点的产生规律。根据对数据分析的结果提出了趋势分析的算法。利用空间向量模型完成对微博内容的结构数据化,利用K-means算法完成对微博消息的聚类分析,找到所要分析的某类微博内容,进而在这类微博中找出微博消息意见领袖,提出微博意见领袖影响力评估算法,WeiboRank算法,并结合算法完成了微博消息预警模块的实现,初步实现了微博舆情管理平台的数据预警分析功能。关键词:微博舆情文本聚类趋势分析北京交通大学毕业设计(论文)AbstractAlongwiththeuniversalapplicationandrapiddevelopmentofnetworktechnology,theapproachesthatthenet-mediatedpublicsentimentspreadhavebeenfundamentallychanged.Thenet-mediatedpublicsentimenthasexertedhugeinfluenceonthewaythatthesocietyoperates.Astheoneofthemostsignificantpartsofthenet-mediatedpublicsentiment,thepublicsentimentwhichisproducedandspreadbythemicrobloghasseveralimportantcharacters,suchasdirectness,immediacy,deviation,variability,interactivity.Takingthemicroblogmessagesasourinvestigatingsubject,thispaperaimedtodoresearchonthecharacteristicsandmodelsofdeliveringmessagesbetweenmicroblogusers,Throughtheanalysisofthecapturedatafoundunidirectional,micro-blogcommunicationconvenience,backonthefaceandothercharacteristics,andraisedaneffectivealgorithmtosortthesekindsofmessages.Usingthespatialvectormodel,theK-meansalgorithmdidclusteranalysisonmicroblogmessages,andfoundouttheopinionleadersamongtremendousmessages.Then,aninfluentialestimationalgorithmofthemicroblogopinionleaderswasraised,WeiboRankalgorithm.Togetherwiththeestimationalgorithm,wealsoachievedtheearlywarningpartandsomebasicdatawarninganalysisfunctionsonthewholemicroblog-mediatedpublicsentimentplatform.Keywords:microblog-mediatedpublicsentiment,textclustering,trendanalysis北京交通大学毕业设计(论文)目录一、概述.....................................................................................................11.1课题背景与研究意义........................................................................11.1.1课题背景..................................................................................11.1.2研究现状..................................................................................31.1.3研究意义..................................................................................31.2论文结构.............................................................................................4二、微博消息传播模型...................................................................................42.1微博消息传播的特点.........................................................................42.2微博用户状态.....................................................................................62.3微博意见领袖.....................................................................................72.4微博传播模型.....................................................................................9三、微博舆情管理平台的设计与实现.........................................................123.1微博舆情管理平台的总体流程.......................................................123.2数据分析系统设计流程...................................................................13四、微博舆情管理平台的实现.....................................................................144.1样本选取与数据来源.......................................................................144.2微博数据转化...................................................................................154.3微博文本聚类...................................................................................174.3.1文本聚类定义........................................................................174.3.2机器学习................................................................................184.3.3K-means算法..........................................................................194.4微博意见领袖重要性评估...............................................................214.4.1PageRank算法.......................................................................214.4.2WeiboRank算法....................................................................224.4.3算法对比...............................................................................234.5微博舆情预警模块...........................................................................254.5.1微博舆情预警........................................................................25北京交通大学毕业设计(论文)4.5.2趋势分析模块........................................................................264.6趋势分析结果比较...........................................................................29五、结论与展望.............................................................................................315.1系统不足...........................................................................................315.2未来展望...........................................................................................325.2.1改进预期................................................................................325.2.2新增功能................................................................................325.3结束语.........................................................................................