turbocrawl
The simple and fast crawling framework. So you can focus on scraping.
The simple and fast crawling framework. So you can focus on scraping.
Command Line Interface for Turbo Crawl
GNewsScraper is a TypeScript package that scrapes article data from Google News based on a keyword or phrase. It returns the results as an array of JSON objects, making it convenient to access and use the scraped information
Build web scraping agents using AI to auto-extract the data from websites
SyphonX is a tool that extracts data from HTML data, transforming it into JSON of any shape or size. It combines the power of CSS Selectors and jQuery, Regular Expressions, and Javascript into a declarative template format to elegantly solve the simplest