@hanivanrizky/nestjs-html-parser
A powerful NestJS HTML parsing service with XPath and CSS selector support, proxy configuration, random user agents, and rich response metadata including headers and status codes
A powerful NestJS HTML parsing service with XPath and CSS selector support, proxy configuration, random user agents, and rich response metadata including headers and status codes
Web data extraction can be effectively performed using CSS selectors.
Domain-oriented action request with path decoding, business object preloading, and action extracting
WordPress CSS & JS Dependency Extraction Webpack Plugin
Use LLMs to robustly extract and enrich structured data from HTML and markdown
Crittr is a high performance critical css extraction library without puppeteer itself.
Advanced plain object handling, manipulation and extraction
A component that accepts JSON objects to execute simple or complex extraction maneuvres.
An javascript implementation of the Rapid Automated Keyword Extraction (RAKE) algorithm. Forked from https://github.com/sleepycat/rapid-automated-keyword-extraction
A kit to extract link
A kit to extract location
A kit to extract math-expression
A kit to extract number
A kit to extract room location
Internal kit of reskit
A lightweight utility to extract postal codes from raw strings for supported European countries.
NodeJS SDK to interact with Scrape-it Cloud API
A JSX based prompt design library for structured extraction
A tool to turn any Git repository into a simple text digest of its codebase.
The purpose of the neume-network/core Extraction Worker (short: "EW") is to parallelize retrieving distributed information from various data sources by abstracting away the complexity of scaling processes accross a distributed system such as e.g. multiple