@crawlee/linkedom
The scalable web crawling and scraping library for JavaScript/Node.js. Enables development of data extraction and web automation jobs (not only) with headless Chrome and Puppeteer.
The scalable web crawling and scraping library for JavaScript/Node.js. Enables development of data extraction and web automation jobs (not only) with headless Chrome and Puppeteer.
Templates for the crawlee projects
An attestate crawler strategy to download and transform Ethereum block event logs
Puppeteer-based image search engine scraper.
A port of n0madic/twitter-scraper to Node.js.
fork of https://github.com/sudheer-ranga/aliexpress-product-scraper
A simple module, which can use for checking all link status code, given links. Below shows how to use the module
The server crawler for Rabbit Server List
Crawler for Gerrit
gRPC tokio based web crawler
Fast asynchronous NodeJS module for crawling/scraping a web through worker_threads.
The unofficial HLTV Node.js API
A web crawler. Supercrawler automatically crawls websites. Define custom handlers to parse content. Obeys robots.txt, rate limits and concurrency limits.
Papercut is a scraping/crawling library for Node.js, written in Typescript.
IT Events Crawler of China
Web data extraction can be effectively performed using CSS selectors.
A Bing command line dictionary, which obtains the query results of bing dictionary by crawler.
Recursive and multi-threaded broken link checker
Access Google Play by logging in and making requests as an Android device!
A Simple ontology database built from a list of ontologies, with auto-download abilities