Code Monkey home page Code Monkey logo

webspider's Introduction

爬虫文档

爬虫功能

爬虫采用了jsoup + httpclient 的爬取形式来提取股票的信息,主要爬取源为东方财富网,同花顺网站,新浪财经接口。 爬取下来的数据主要包含了以下两种:

  • 上市公司的基本信息
  • 股票的实时行情

实体介绍

CompanyInfo 实体:

字段名称 字段描述 字段类型
stockId 上市公司股票id int
perShareEarnings 每股收益 double
retainedProfits 净利润 double
increaseInRetainedProfits 净利润同比增长率 double
increaseInNetProfit 净利润环比增长率 double
nonNetProfitDeduction 扣非净利润 double
increaseInNonNetProfitDeduction 扣非净利润同比增长率 double
grossRevenue 营业总收入 double
yearOnYearGrowthRateOfTotalRevenue 营业总收入同比增长率 double
netAssertValuePerShare 每股净资产 double
rateOfReturnOnCommonStockholders 净资产收益率 double
netAssertYieldDiluted 净资产收益率-摊薄 double
assetLiabilityRatio 资产负债比率 double
capitalReversePerShare 每股本公积金 double
retainedEarnings 每股未分配利润 double
operationCashFlowPerShare 每股经营现金流 double
grossProfitMargin 销售毛利率 double
inventoryTurnoverRatio 存货周转率 double
netProfitMarginOnSales 销售净利率 double
date 发布日期 Date

MarketValue 实体:

字段名称 字段描述 字段类型
stockId 上市公司股票id int
stockName 上市公司股票名称 String
open 开盘价 double
preClose 昨收价 double
current 当前价 double
highest 最高价 double
lowest 最低价 double
bidOne 竞买价 double
bidAskOne 竞卖价 double
settlement 成交价 double
settlementAmount 成交量 double
buyOne 买一价 double
buyOneAmount 买一量 int
buyTwo 买二价 double
buyTwoAmount 买二量 int
buyThree 买三价 double
buyThreeAmount 买三量 int
buyFour 买四价 double
buyFourAmount 买四量 int
buyFive 买五价 double
buyFiveAmount 买五量 int
askOne 卖一价 double
askOneAmount 卖一量 int
askTwo 卖二价 double
askTwoAmount 卖二量 int
askThree 卖三价 double
askThreeAmount 卖三量 int
askFour 卖四价 double
askFourAmount 卖四量 int
date 数据日期 Date
time 数据时间 Date

webspider's People

Contributors

duanmuhan avatar

Stargazers

 avatar

Watchers

James Cloos avatar  avatar

Forkers

mario-szk

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.