Epic-crawler NPM | npm.io

1.0.21 • Published 6 years ago

Install

Weekly downloads

3

License

MIT

Repository

Last release

6 years ago

Epic Crawler

A simple crawler for scraping important data from web pages.

Installation

$ npm i epic-crawler --save

Usage

const crawler = new epicCrawler;
crawler.init("https://google.com", {
    depth: 5,
}).then(() => {
    crawler.crawl().then((data) => {
        console.log(data);
    });
}).catch((data) => {
    console.log(data);
});

Options

Just three options are supported for now.

depth - 1 to 5 (Default 1) | Crawling Depth.
strict - boolean (Default True) | Set to False if you also want to collect links related to other websites.
cache - boolean (Default True) | Speeds up the crawl by saving data in the cache.

Methods

init: (url: string, { depth, strict, cache }?: options) => Promise - Initialize crawler.
blackList: (fingerPrintList: (string | RegExp)[]) => this - Black List Links.
clearCache: () => this - Clear previous crawled cache.
crawl: () => Promise - Start Crawling.

html epic crawler deep data retrive meta tags keywords collect links scraper

@types/image-size epic-link-crawler epic-sync-loops image-size text-cleaner

@everything-registry/sub-chunk-1591 @zalastax/nolb-epi epic-chat-bot-teacher @infinitebrahmanuniverse/nolb-epi

6 years ago

6 years ago

6 years ago

6 years ago

6 years ago

6 years ago

6 years ago

6 years ago

6 years ago

6 years ago

6 years ago

6 years ago

6 years ago

6 years ago

6 years ago

6 years ago

6 years ago

6 years ago

6 years ago

6 years ago

6 years ago

6 years ago