1.0.3 • Published 8 years ago

wwweb v1.0.3

Weekly downloads
5
License
MIT
Repository
github
Last release
8 years ago

WWWEB

npm.io

An autonomous webcrawler for indexing robots.txt files.

Requirements

  • node.js ^6.0.0

Usage

wwweb -d <domain> [-s <interval>] -o <directory> [--rest <seconds>] [[-v] -v] [-t <timeout>]

Options

FlagAliasDescriptionInfo
--domain-dInitial domainrequired
--save-interval-sInterval in seconds for outputting reportsdefault: 30
--output-oName of the output directoryrequired
--help-hShow help
--rest-rSeconds to rest between requestsdefault: 0
--timeout-tMilliseconds before a request times outdefault: 15000
--verbose-vVerbose output of what is going on-vv for debug output
--no-colorDisable colorful output

Examples

Crawl from example.org and output files to the current working directory:

wwweb -d example.org -o .

Crawl from example.org, output files to ./reports/, output warning, wait eight seconds for files to load and save a report every minute.

wwweb -d example.org -o reports/ -v -t 8000 -s 60