apify Goto Github PK
Name: Apify
Type: Organization
Bio: We're making the web more programmable.
Twitter: apify
Location: The Interweb
Blog: https://apify.com/
Name: Apify
Type: Organization
Bio: We're making the web more programmable.
Twitter: apify
Location: The Interweb
Blog: https://apify.com/
Apify actor to crawl a list of URLs
Apify actor to upload crawler results to AWS S3.
Apify actor that crawls website and indexes selected web pages to Algolia index. It's used to power the search on https://help.apify.com
This Actor enables you to chat with your website like you do with your friends.
You can use this act to monitor any page's content and get a notification when content changes.
DEPRECATED: An actor that crawls websites and parses HTML pages using Cheerio library. Supports recursive crawling as well as URL lists.
DEPRECATED: An Apify actor that enables crawling of websites using headless Chrome and Puppeteer. The actor is highly customizable and supports recursive crawling of websites as well as lists of URLs.
Example of Apify actor using PHP
Example: Intercept requests from https connection using "Man in the middle" proxy solution.
Example Apify Actor written in Python
Example actor showcasing the secret input fields
Apify Actor to convert HTML string to pdf
Returns an image containing difference of two given images.
This Apify actor is used for integration tests.
The actor implements the legacy Apify Crawler product. It uses PhantomJS headless browser to recursively crawl websites and extract data from them using a piece of JavaScript code.
An example repository with multiple Apify Actors sharing code between each other.
Apify actor that opens a web page in headless Chrome and analyzes the HTML and JavaScript objects, looks for schema.org microdata and JSON-LD metadata, analyzes AJAX requests, etc.
Contains a boilerplate of an Apify actor to help you get started quickly build your own actors.
House of Apify Scrapers. Generic scraping actors with a simple UI to handle complex web crawling and scraping use cases.
Example of Python Scrapy project. It scrapes book data from https://books.toscrape.com/.
Apify actor to run web spiders written in Python in the Scrapy library
Actor that runs Selenium based Mocha tests.
This project is the :house: home of Apify actor template projects to help users quickly get started.
This is the experimental version of Web Automation Agent. The agent uses natural language instructions to browse the web and extract data.
How to get clean web data for chatbots and LLMs slides and supporting materials.
Airbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases.
An Implementation of Algolia to emulate its REST API
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.