The crawler from 0qiqi0

crawler's Introduction

这是一个基于node.js的爬虫项目

1、task文件夹下的代码实现：获取网页信息（百度小说）并将所爬内容存入mysql数据库的功能

数据库中有两张表①小说分类表category；②所有分类中的小说的汇总表：articles

2、准备路由，当访问根目录时候，使用async.parallel方法并行读取数据库中的两张表

3、将爬取的内容显示在网页上，使用jade模板

4、部署：

A 将工程通过git命令pull到部署好的阿里云服务器上

B 安装pm2模块，防止node进程down掉

C 安装nginx，配置。启动项目。

5、展望：丰富web页面，增加查询功能、点击文章加入贴吧功能、点击分类切换文章

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.

Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

TensorFlow

An Open Source Machine Learning Framework for Everyone

Django

The Web framework for perfectionists with deadlines.

Laravel

A PHP framework for web artisans

D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

web

Some thing interesting about web. New door for the world.

server

A server is a program made to process requests and deliver data to clients.

Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

Visualization

Some thing interesting about visualization, use data art

Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.

Microsoft

Open source projects and samples from Microsoft.

Google

Google ❤️ Open Source for everyone.

Alibaba

Alibaba Open Source for everyone

D3

Data-Driven Documents codes.

Tencent

China tencent open source team.

0qiqi0 / crawler Goto Github PK