0.1.0 • Published 12 years ago

sinew-node v0.1.0

Weekly downloads
3
License
-
Repository
github
Last release
12 years ago

Welcome to Sinew-Node

Sinew-Node collects structured data from web sites (screen scraping).

Sinew is distributed as a ruby gem:

npm install sinew-node

Example

Here's an example for collecting Reddit's topic list:

sinew = require '../lib/sinew-node'
sinew.get 'http://www.reddit.com/r/javascript/', ->
  (@$ '#siteTable div.thing a.title').each (index)->
    console.log @innerHTML if index < 5

Features

  • Sinew caches all HTTP requests on disk. That makes it possible to iterate quickly. Crawl once and then continue to work on your recipe. Run the recipe over and over while you tune your CSS selectors and regular expressions.