0.1.6 • Published 10 years ago

spidee v0.1.6

Weekly downloads
1
License
MIT
Repository
-
Last release
10 years ago

Tiny web crawler

Install

npm install spidee

Usage

var spidee = require('spidee')(options);
spidee.crawl();

or

var spidee = require('spidee')();
spidee.configure(options).crawl();

Options

optionmandatorydefault valuedescription
urlyesstarting point for crawling
sleepno150how long should spider wait between each request
timeoutno1500how long should spider wait for response
repeatno1how many request should spider do on each link, including starting url, useful for cache testing
ignoreRelativenofalsedeterminates if spider should ignore all relative links
onSuccessnocallback called on request success
onFailurenocallback called on request failure
shouldCrawlnofunctionfunction defining if spider should follow this url, useful to prevent spider to run out of your web scope, which it does by default. function MUST return boolean

MIT

0.1.6

10 years ago

0.1.5

10 years ago

0.1.4

10 years ago

0.1.3

10 years ago

0.0.2

10 years ago

0.0.1

10 years ago

0.0.0

10 years ago

1.0.0

10 years ago