Code Monkey home page Code Monkey logo

crawling_cnki's Introduction

说明文档

环境

python环境 == python3.6.3    
系统环境 == win10_64

安装包

requirements.txt文件里面

原理

主要运用Python+Selenium+ChromeDriver进行爬虫
由于反爬虫太厉害,而且难度较大,所以用这种方法进行爬虫
注意事项:
    爬虫时候时间尽量再1分钟左右,否则cnki发生点击频繁而拒绝点击
    如果 > 20s CNKI系统会死
    如果还不行,记得driver.delete_cookies()每爬一个,清一次cookie

启动文件

启动文件, main.py

环境安装

安装 Python -下载并配置好环境变量,shell输入:python -V 出现对应版本号即安装成功!
安装pip(Python包管理工具)
安装selenium-pip install selenium 提示:Successfully installed selenium-即安装成功!
安装ChromeDriver-
下载ChromeDriver,注意版本需与浏览器版本对应,附:版本号对应描述(64位浏览器下载32位即可),下载后与chrome安装目录放在一起,然后配置至环境变量即可,配置好后shell输入:chromedriver 无错误即安装成功!
安装python IDE pyCharm

crawling_cnki's People

Contributors

renfanzi avatar

Stargazers

 avatar 人间理想 avatar Ankn Lin avatar  avatar Jason Rabbit avatar  avatar  avatar hello avatar Xiao Jie avatar  avatar

Watchers

James Cloos avatar  avatar

Forkers

wentong2org whigg

crawling_cnki's Issues

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.