Recruitment information analysis platform based on big data
本项目使用 hadoop 集群(一主两从)作为数据清洗的环境,其中 hadoop 集群包括 hadoop、zookeeper、hive、hbase 等软件组成。Tomcat 作为前端页面展示环境。数据清洗中,使用 java 编写的 MapReduce 进行特殊清洗,使用hive 的 HQL 语句进行细化清洗,使用 java 编写的 UDF 进行特殊细化清洗。前端页面展示中,使用 java 编写的 servlet 与 hbase 交互,获取数据并用 js 解析,把结果通过使用 echarts 表格、layui 等界面等框架配合优化展示效果,最终展示形式采用网页形式。
This project uses the Hadoop cluster (one master and two slaves) as the data cleaning environment, and the Hadoop cluster includes Hadoop, Zookeeper, Hive, Hbase and other software. Tomcat serves as a front-end page display environment. In data cleaning, MapReduce written in java is used for special cleaning, HQL statements are used for refinement cleaning, and UDFs written in java are used for special fine-tuning cleaning. In the front-end page display, the servlet written in Java is used to interact with HBash, obtain data and parse it with JS, and optimize the display effect by using frameworks such as eCharts tables, layUIs and other interfaces, and the final display form adopts the form of a web page.
File description
-
Java 后端代码 实现servlet与hbase交互
-
Java back-end code implements servlets to interact with hbase
-
shell 自动化shell脚本
-
Shell automation shell scripts
-
web web项目 (eclipse)
-
Web Web Project (Eclipse)
-
project File.pdf 项目环境搭建文档
-
Project File.pdf Project environment construction document
-
imgs 项目相关图片
-
Images related to the IMGS project
Project environment
hadoop完全分布式环境:一主两从
Hadoop is a fully distributed environment: one master and two slaves
Software version selection
- jdk-8u201-linux-x64
- apache-tomcat-7.0.94
- hadoop-2.6.0-cdh5.14.2
- MySQL-5.5.40-1.linux2.6.x86_64
- hive-1.1.0-cdh5.14.2
- zookeeper-3.4.5-cdh5.14.2
- hbase-1.2.0-cdh5.14.2
Visual presentation of the project