scraping-xx Goto Github PK
Type: Organization
Type: Organization
Add CrossRef bibliographic metadata (or any XMP) to a PDF
Python PDF Parser
A more complete example of programming with PDFMiner, which continues where the default documentation stops
PDF Structure and Syntactic Analysis for Metadata Extraction and Tagging - https://code.google.com/p/pdfssa4met/
Convert pdf to text
Parsers and formatters for person names, street addresses, city/state/zip, phone numbers, etc.
A Chrome extension to annotate urls and send the annotated html markup to a server.
PhantomJS-based web performance metrics collector and monitoring tool
Headless WebKit with JavaScript API
PhantomJS integration module for NodeJS
Phantompy is a headless WebKit engine with full clean pythonic api build on top of Qt5 Webkit
Robot Framework Remote Test Library for PhantomJS
a pdf parser in php
PHP class to download website data from Google Webmaster Tools as CSV.
PHPElasticManager is a GUI admin system for managing your elasticsearch indexes.
phrasecount recommener
Apache Pig utilities to build training corpora for machine learning / NLP out of public Wikipedia and DBpedia dumps.
in-browser css inspect/edit.
Compile Yahoo! Pipes to Javascript (Node.js)
Build Pivot Tables from CSV/JSON Data
A web-scraping framework written in Javascript, using PhantomJS and jQuery
pkgcloud is a standard library for node.js that abstracts away differences among multiple cloud providers.
A platform detection library that works on nearly all JavaScript platforms.
JavaScript source analysis and visualizer
Powerful and flexible web-based application builder
A Plomino plugin that provides fields and other form elements built using the Patterns javascript library.
The next generation CLI process manager for Node.js with native clusterization
URI normalization, c18n, escaping, and extraction
Fast, Nimble PDF Writer for Ruby
aka "Bayesian Methods for Hackers": An introduction to Bayesian methods + probabilistic programming with a computation/understanding-first, mathematics-second point of view. All in pure Python ;)
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.