2.0.0 • Published 7 years ago

recrawler v2.0.0

Weekly downloads
4
License
MIT
Repository
github
Last release
7 years ago

recrawler NPM version NPM downloads Circle CI

Remote web content crawler done right.

Motivation

Sometimes I want to grab some nice images from a url like http://bbs.005.tv/thread-492392-1-1.html, so I made this little program to combine node-fetch and cheerio to make my attempt fulfilled.

Install

$ npm install --save recrawler

For Single Page Apps please head to recrawler-nightmare

Usage

const recrawler = require('recrawler')

recrawler('http://some-url.com/a/b/c')
	.then($ => {
		$('img.nice-images').each(function () {
			const url = $(this).attr('src')
			console.log(url)
		})
	})

API

recrawler(url, opts)

opts

cheerio

cheerio options. Except decodeEntities is false by default here.

License

MIT © EGOIST