|
| 1 | +## 项目名称:基于Spark2.x新闻网大数据实时分析可视化系统项目 |
| 2 | + |
| 3 | +### 项目简介 |
| 4 | + |
| 5 | +**目标** |
| 6 | + |
| 7 | +1、完成大数据项目的架构设计,安装部署,架构继承与开发、用户可视化交互设计 |
| 8 | + |
| 9 | +2、完成实时在线数据分析 |
| 10 | + |
| 11 | +3、完成离线数据分析 |
| 12 | + |
| 13 | +**具体功能** |
| 14 | + |
| 15 | +1、捕获用户浏览日志信息 |
| 16 | + |
| 17 | +2、实时分析前20名流量最高的新闻话题 |
| 18 | + |
| 19 | +3、实时统计当前线上已曝光的新闻话题 |
| 20 | + |
| 21 | +4、统计哪个时段用户浏览量最高 |
| 22 | + |
| 23 | +5、报表 |
| 24 | + |
| 25 | +**所用组件** |
| 26 | +Hadoop2.x、Zookeeper、Flume、Hive、Hbase、Kafka、Spark2.x、SparkStreaming、MySQL、Hue、J2EE、websoket、Echarts |
| 27 | + |
| 28 | +### 开发工具 |
| 29 | + |
| 30 | +虚拟机: VMware、centos |
| 31 | + |
| 32 | +虚拟机SSH: SecureCRT(在windows上链接多个虚拟机) |
| 33 | + |
| 34 | +程序编辑器:IDEA |
| 35 | + |
| 36 | +查看各种数据:notepad++(安装NppFTP插件,修改虚拟机中配置文件,好用的一批) |
| 37 | + |
| 38 | +**所有软件下载地址:** |
| 39 | + |
| 40 | +链接:https://pan.baidu.com/s/18wrxmczkzgoNE2WTZwjPSA |
| 41 | +提取码:73q8 |
| 42 | + |
| 43 | + |
| 44 | +### 项目架构 |
| 45 | + |
| 46 | + |
| 47 | + |
| 48 | +### 集群资源规划 |
| 49 | + |
| 50 | +利用VMware虚拟机+centos完成,基本要求笔记本电脑内存在8G以上。 |
| 51 | +最低要去克隆出3台虚拟机,每台给2G内存。 |
| 52 | + |
| 53 | + |
| 54 | +### 项目实现步骤 |
| 55 | + |
| 56 | +[1、第一章:项目需求分析与设计][1] |
| 57 | + |
| 58 | +[2、第二章:linux环境准备与设置][2] |
| 59 | + |
| 60 | +[3、第三章:Hadoop2.X分布式集群部署][3] |
| 61 | + |
| 62 | +[4、第四章:Zookeeper分布式集群部署][4] |
| 63 | + |
| 64 | +[5、第五章:hadoop的高可用配置(HA)][5] |
| 65 | + |
| 66 | +[6、第六章:hadoop的HA下的高可用HBase部署][6] |
| 67 | + |
| 68 | +[7、第七章:Kafka简介和分布式部署][7] |
| 69 | + |
| 70 | +[8、第八章:Flume简介和分布式部署][8] |
| 71 | + |
| 72 | +[9、第九章:Flume源码修改与HBase+Kafka集成][9] |
| 73 | + |
| 74 | +[10、第十章:Flume+HBase+Kafka集成全流程测试][10] |
| 75 | + |
| 76 | +[11、第十一章:mysql、Hive安装与集成][11] |
| 77 | + |
| 78 | +[12、第十二章:Hive与Hbase集成][12] |
| 79 | + |
| 80 | +[13、第十三章:Cloudera HUE大数据可视化分析][13] |
| 81 | + |
| 82 | +[14、第十四章:Spark2.X集群安装与spark on yarn部署][14] |
| 83 | + |
| 84 | +[15、第十五章:基于IDEA环境下的Spark2.X程序开发][15] |
| 85 | + |
| 86 | +[16、第十六章:Spark Streaming实时数据处理][16] |
| 87 | + |
| 88 | +### 项目配套视频 |
| 89 | + |
| 90 | +链接:https://pan.baidu.com/s/1Q-XGRjRwyVa0UFSzfbjFdQ |
| 91 | + |
| 92 | +提取码:qart |
| 93 | + |
| 94 | +### 群内有更多相关电子书籍和1000G网盘资料 |
| 95 | +欢迎加入大数据交流群。 |
| 96 | +QQ群号码:528040253 |
| 97 | + |
| 98 | + |
| 99 | + |
| 100 | + |
| 101 | + |
| 102 | + |
| 103 | + [1]: https://github.com/TALKDATA/JavaBigData/blob/master/news-bigdataproject/1%E3%80%81%E9%A1%B9%E7%9B%AE%E9%9C%80%E6%B1%82.md |
| 104 | + [2]: https://github.com/TALKDATA/JavaBigData/blob/master/news-bigdataproject/2%E3%80%81linux%E9%85%8D%E7%BD%AE.md |
| 105 | + [3]: https://github.com/TALKDATA/JavaBigData/blob/master/news-bigdataproject/3%E3%80%81hadoop%E9%83%A8%E7%BD%B2.md |
| 106 | + [4]: https://github.com/TALKDATA/JavaBigData/blob/master/news-bigdataproject/4%E3%80%81zk%E9%83%A8%E7%BD%B2.md |
| 107 | + [5]: https://github.com/TALKDATA/JavaBigData/blob/master/news-bigdataproject/5%E3%80%81ha%E5%AE%9E%E7%8E%B0.md |
| 108 | + [6]: https://github.com/TALKDATA/JavaBigData/blob/master/news-bigdataproject/6%E3%80%81hbase%E9%83%A8%E7%BD%B2.md |
| 109 | + [7]: https://github.com/TALKDATA/JavaBigData/blob/master/news-bigdataproject/7%E3%80%81kafka%E9%83%A8%E7%BD%B2.md |
| 110 | + [8]: https://github.com/TALKDATA/JavaBigData/blob/master/news-bigdataproject/8%E3%80%81flume%E9%83%A8%E7%BD%B2.md |
| 111 | + [9]: https://github.com/TALKDATA/JavaBigData/blob/master/news-bigdataproject/9%E3%80%81flume-hbase-kfk%E9%85%8D%E7%BD%AE.md |
| 112 | + [10]: https://github.com/TALKDATA/JavaBigData/blob/master/news-bigdataproject/10%E3%80%81flume-hbase-kfk%E8%81%94%E8%B0%83.md |
| 113 | + [11]: https://github.com/TALKDATA/JavaBigData/blob/master/news-bigdataproject/11%E3%80%81mysql-hive.md |
| 114 | + [12]: https://github.com/TALKDATA/JavaBigData/blob/master/news-bigdataproject/12%E3%80%81hive-hbase.md |
| 115 | + [13]: https://github.com/TALKDATA/JavaBigData/blob/master/news-bigdataproject/13%E3%80%81hue.md |
| 116 | + [14]: https://github.com/TALKDATA/JavaBigData/blob/master/news-bigdataproject/14%E3%80%81spark%20on%20yarn.md |
| 117 | + [15]: https://github.com/TALKDATA/JavaBigData/blob/master/news-bigdataproject/15%E3%80%81spark-idea.md |
| 118 | + [16]: https://github.com/TALKDATA/JavaBigData/blob/master/news-bigdataproject/16%E3%80%81spark-streaming1.md |
0 commit comments