Light

openube / manhuaguipycrawler Goto Github PK

View Code? Open in Web Editor NEW

This project forked from shalvenlaw/manhuaguipycrawler

0.0 2.0 0.0 155 KB

用Python爬取网站"漫画柜"的漫画

License: MIT License

Python 71.02% HTML 28.98%

manhuaguipycrawler's Introduction

ManhuaguiPyCrawler

用Python爬取漫画柜的漫画

开发环境

操作系统: Win10
编辑器: VS Code
编程语言: Python3
Python第三方库: requests, selenium, bs4, lxml
安装Chrome和对应的Chrome Driver, 并把chromedriver.exe的路径放在环境变量Path里, 后者具体可参考如何快速下载、安装和配置chromedriver?

实现原理

用Python+Selenium操作Chrome浏览器, 进行漫画网站的浏览, 按钮点击等, 从而将目标漫画一页一页地下载到本地.

优点

能有效地下载那些异步加载的图片(大多数漫画网站都是采取这种方式加载图片).

缺点

程序运行的时候会自动打开一个Chrome窗口, 因此会占用比较多的系统资源.

注意

漫画柜有时候会因为迷之因素而上不去, 记得挂上VPN.

TODO

下一步想分析生成图片资源url的JavaScript代码, 从而直接找到图片资源的真实地址, 这样不用开浏览器, 少占用资源, 也能加快下载速度.

manhuaguipycrawler's People

Contributors

Watchers

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.