0.1.1 • Published 11 years ago

snapshoter v0.1.1

Weekly downloads
8
License
-
Repository
-
Last release
11 years ago

Snapshooter

Simple 'javascript' crawler.

Version 0.1.1

About

Snapshooter is basicly a crawler, which will load a URL, wait the javascript to render, save it as plain HTML file and carry on until all hrefs are rendered.

Issues

Do not hesitate to open a feature request or a bug report.

https://github.com/serpentem/snapshooter/issues

Docs

Have fun. :)

npm install -g snapshooter

snapshooter [http://your_url] [output_folder]

A very tiny bit of integration is needed in order for it to effectively wait until all javascript opterations such as data loadings, template rendering etc.

Considering you have a Single Page Application I bet you have also some render method, and possibly another in and out too for handling transitions.

Well, the only matter here is to inform Snapshooter that the page has finish rendering. It's achieved by setting the property window.crawler.is_rendered.

window.crawler.is_rendered = true

Snapshooter will keep waiting for the page until this variable gets true and then the rendered DOM will be saved as a plain html file.

Download the repo and have fun, pull requests are more than welcome.

  git clone git://github.com/serpentem/snapshooter.git
  cd snapshooter
  npm link

Powered by