Code Monkey home page Code Monkey logo

puppeteer-facebook-video-hook's Introduction

core document.querySelectorAll('.g-tree li .haschild') document.querySelectorAll('.g-tree li .haschild:not([class*="open"])')

https://www.facebook.com/scshop.tw/videos/567889097048999/

TODO List

  • spider framework
  • spider webhook
  • spider web pages
  • spider data save db
  • database api server

后台功能

  • 任务管理,运行管理员添加任务:用户名,密码,视频的ID
  • 任务查看,可以查看当前任务进度

Api

  • /api/:channel/:videoId/comments 可以通过channel和videoId进行评论查询
  • /api/comments?liveUrl= 可以通过视频地址获取到所有讯息
  • /api/task 可以通过api进行提交采集任务
  • /api/config 配置和管理spider系统

Spider功能

  • 实时采集live视频,realtime采集
  • 支持多worker工作,提高采集效率
  • 支持提交liveUrl,在回放过程中,循环采集完所有历史的所有评论
  • worker管控功能和代理功能

遇到的问题

  • facebook 在拦截掉request修改postData后,发现后端不反数据,说明可能前端参数有签名,发现__dyn是动态参数,但是传入空值后就可以正常访问
  • facebook的json数据包含了一个for (;;);的字符串需要去除掉
  • facebook中每页最多能拿到50条数据,多了直接会报错不给数据

puppeteer-facebook-video-hook's People

Contributors

gtsigner avatar

Stargazers

 avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.