Code Monkey home page Code Monkey logo

bigdata's Introduction

1、基于大数据的招聘信息分析平台

Recruitment information analysis platform based on big data

本项目使用 hadoop 集群(一主两从)作为数据清洗的环境,其中 hadoop 集群包括 hadoop、zookeeper、hive、hbase 等软件组成。Tomcat 作为前端页面展示环境。数据清洗中,使用 java 编写的 MapReduce 进行特殊清洗,使用hive 的 HQL 语句进行细化清洗,使用 java 编写的 UDF 进行特殊细化清洗。前端页面展示中,使用 java 编写的 servlet 与 hbase 交互,获取数据并用 js 解析,把结果通过使用 echarts 表格、layui 等界面等框架配合优化展示效果,最终展示形式采用网页形式。

This project uses the Hadoop cluster (one master and two slaves) as the data cleaning environment, and the Hadoop cluster includes Hadoop, Zookeeper, Hive, Hbase and other software. Tomcat serves as a front-end page display environment. In data cleaning, MapReduce written in java is used for special cleaning, HQL statements are used for refinement cleaning, and UDFs written in java are used for special fine-tuning cleaning. In the front-end page display, the servlet written in Java is used to interact with HBash, obtain data and parse it with JS, and optimize the display effect by using frameworks such as eCharts tables, layUIs and other interfaces, and the final display form adopts the form of a web page.

2、 文件说明

File description

  • Java 后端代码 实现servlet与hbase交互

  • Java back-end code implements servlets to interact with hbase

  • shell 自动化shell脚本

  • Shell automation shell scripts

  • web web项目 (eclipse)

  • Web Web Project (Eclipse)

  • project File.pdf 项目环境搭建文档

  • Project File.pdf Project environment construction document

  • imgs 项目相关图片

  • Images related to the IMGS project

3、 项目环境

Project environment

hadoop完全分布式环境:一主两从
Hadoop is a fully distributed environment: one master and two slaves

4、 软件版本选型

Software version selection

  • jdk-8u201-linux-x64
  • apache-tomcat-7.0.94
  • hadoop-2.6.0-cdh5.14.2
  • MySQL-5.5.40-1.linux2.6.x86_64
  • hive-1.1.0-cdh5.14.2
  • zookeeper-3.4.5-cdh5.14.2
  • hbase-1.2.0-cdh5.14.2

5、 项目可视化展示

Visual presentation of the project

githunt

bigdata's People

Contributors

xizhengl avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar yohoqy avatar

Watchers

 avatar

bigdata's Issues

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.