2.5.2 • Published 6 years ago

scrapes v2.5.2

Weekly downloads
-
License
ISC
Repository
-
Last release
6 years ago

Scrapes

CircleCI codecov


Scrapes uses cheerio to parse the content of webpages and returns it.

Installation

To install locally run:

npm install --save scrapes
# Or
npm install -g scrapes

Usage

Scrapes can be used inside of node.js scripts and through the command line.

CLI

Documentation is provided for cli commands with the --help option

# Install locally
npm install -g scrapes
scrapes --help

# Use from npx
npx scrapes --help

scrapeSelection

Takes provided string and a query selector. Returns the text content of the elements that match the selector.

import { scrapeSelection } from "scrapes";

const markup = `<!doctype html><html><body><h1>A title</h1></body></html>`;

scrapeSelection(markup, "h1");
//=> 'A title'

scrapeLinks

Takes provided string and returns the content of href attributes on any anchor tags in an array.

import { scrapeLinks } from "scrapes";

const markup = `<!doctype html><html><body><a href="/example-path">Link</a></body></html>`;

scrapeLinks(markup);
//=> ['/example-path']

prefixLinks

Returns a list of items prefixed with a provided hostname. The prefixing should autoformat, ignoring existing hostnames where specified and adding the correct pathname syntax where it is / isn't provided.

prefixLinks("example.com", [
	"newsguidelines.html",
	"1",
	"mailto:email@example.com",
	"/3",
	"ftp://ftp.example.com"
]);
/*=>
 *	[
 *		"example.com/newsguidelines.html",
 *		"example.com/1",
 *		"mailto:email@example.com",
 *		"example.com/3",
 *		"ftp://ftp.example.com"
 * 	]
 */

Maintained by @ShaunBulbrook.

2.5.2

6 years ago

2.5.1

6 years ago

2.5.0

6 years ago

2.4.0

6 years ago

2.3.0

6 years ago

2.2.0

6 years ago

2.1.1

6 years ago

2.1.0

6 years ago

2.0.0

7 years ago

1.0.1

7 years ago

1.0.0

7 years ago