realfish-yc
Real Fish Youtube Video Crawling Module
Real Fish Youtube Video Crawling Module
Real Fish Youtube Trend Video Crawling
Crawler (spider) of site web pages by domain name
Easily scrap the web for torrent and media files.
Easily crawl your public notion pages
A library to recursively retrieve and serialize Notion pages with customization for machine learning applications.
NodeCraw is a web crawling application that allows you to crawl specified URLs and extract information from web pages. It utilizes various modules and libraries to perform crawling and save the results.
PhantomJS sitemap generator
Collection of patches for puppeteer and playwright to avoid automation detection and leaks. Helps to avoid Cloudflare and DataDome CAPTCHA pages. Easy to patch/unpatch, can be enabled/disabled on demand.
Collection of patches for puppeteer and playwright to avoid automation detection and leaks. Helps to avoid Cloudflare and DataDome CAPTCHA pages. Easy to patch/unpatch, can be enabled/disabled on demand.
A tool to get sitemaps from websites and crawl them
Gracefully handle timeout and network error with auto retry.
Providers are the core of applications, where the subtitles are collected. Each provider exports a unique strategy for gathering data. From legendastv's web scraping from opensubtitle API usage, you can collect subtitles from your favorite tv shows and mo
Crawler is a web spider written with Nodejs. It gives you the full power of jQuery on the server to parse a big number of pages as they are downloaded, asynchronously
StackSleuth in-house browser automation agent for debugging and user simulation
A Simple Job Manager
Fast asynchronous NodeJS module for crawling/scraping a web through worker_threads.
A set of shared utilities that can be used by crawlers
Datasco API SDK for Node.js to collect any data from any website