0.2.1 • Published 8 years ago

scrapm v0.2.1

Weekly downloads
5
License
MIT
Repository
github
Last release
8 years ago

scrapm

more convenient scraper for node.js

Usage

var scrapm = require('scrapm');

scrapm([options], callback);

options are request options:

  • url - (required) target url you want to scrape
  • blocked - default false, true if the server is blocking scraping by machine. it cheats the server by changhing 'User-Agent' in header.
  • encoding - default false, true if the encoding of the webpage is not utf-8.
  • sourceEncoding - default '', encoding of the webpage if options.encoding is true
  • targetEncoding - default '', encoding you want to get if options.encoding is true

callback is called with (err, $). You can manipulate DOM with $ as you want. $ is implemented with jsdom.

Dependencies

  • iconv
  • jsdom
  • request

License

MIT License