Code Monkey home page Code Monkey logo

johnhus / recruit Goto Github PK

View Code? Open in Web Editor NEW

This project forked from frank-qlu/recruit

1.0 0.0 0.0 42.17 MB

recruit 招聘爬虫+数据分析 1.爬虫: 采用Scrapy 分布式爬虫技术,使用mongodb作为数据存储,爬取的网站Demo为51job,数据我目前爬了有几千条 2.数据处理: 采用pandas对爬取的数据进行清洗和处理 2.数据分析: 采用flask后端获取mongodb数据,前端使用bootstrap3.echarts以及D3的词云图,如果喜欢请star or Fork,预览详见

Home Page: http://www.xunguo.site

License: Apache License 2.0

JavaScript 0.09% Python 98.43% C 1.22% Fortran 0.03% Tcl 0.09% PowerShell 0.01% CSS 0.03% HTML 0.08% Smarty 0.01% Mako 0.01% Batchfile 0.01%

recruit's Introduction

recruit

招聘爬虫+数据分析 1.爬虫: 采用Scrapy 分布式爬虫技术,使用mongodb作为数据存储,爬取的网站Demo为51job,数据我目前爬了有几万条 2.数据处理: 采用pandas对爬取的数据进行清晰和处理 2.数据分析: 采用flask后端获取mongodb数据,前端使用bootstrap3.echarts以及D3的词云图

###注意:1. pymongo安装版本 <=3.0 建议 pip install pymongo==2.8### 2. 如果scrapy安装不上,在这上面查找https://www.lfd.uci.edu/~gohlke/pythonlibs/ 先安装对应版本 twisted ,再安装scrapy就没问题。 3.mongodb启动,进入安装mongodb的文件夹的bin目录下面,输入 mongod --dbpath= data文件夹路径

关于项目启动

  1. 爬虫:

    1.cd 目录 2. pip install pymongo==2.8 3. scrapy crawl zlzp

  2. 数据可视化

    1. 激活虚拟环境 cd venv/Scripts activate
    2. python zlzpView.py

version 1.0:


    首次更新项目

version 2.0(2019-06-17更新):


   1.优化界面,采用blueprint设计模式
   2.添加高级搜索(聚合查找)
   3.添加前后台,增添redis数据库
   3. 后台设置招聘信息过期时间
   4.后台用户管理

version 3.0(未来):


   1.采用flask-restful
   2.优化数据分析模块
   3. 设置兴趣标签,添加推荐系统,相似职位推荐

项目预览

###

recruit's People

Contributors

frank-qlu avatar

Stargazers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.