0.9.3 • Published 1 year ago

create-spidergram v0.9.3

Weekly downloads
-
License
GPL
Repository
github
Last release
1 year ago

Build a custom web analysis tool with Spidergram

Spidergram is a toolkit for crawling and analyzing complex web properties. create-spidergram is a quick and easy way to set up a new Spidergram project of your own.

Usage

  1. Ensure you're running NodeJS 18 (node -v)
  2. Install ArangoDB via direct download or homebrew. Alternately, if you've got Docker installed, you can use Spidergram's included docker-compose.yml file to spin up an Arango container for testing and development.
  3. Create a new project directory, cd into it, and run npx create-spidergram. You'll be prompted for the project's name and your choice of project template.
  4. Run npm install
  5. Kick the tires with npm run crawl <url>, or dive right in to customizing the project.

The Templates

  • Boilerplate is a simple script that fires up a Spidergram crawler, grabs the contents of one or more sites, and prints out a summary report of their URL structures.
  • Boilerplate (Typescript) is a Typescript version of Boilerplate, with no other functional differences.
  • Crawl with Report (Typescript) demonstrates basic data extraction and report generation in plaintext and Excel formats.
0.9.3

1 year ago

0.9.0

1 year ago

0.8.0

1 year ago

0.7.2

1 year ago

0.7.1

1 year ago

0.7.0

1 year ago