1.0.14 • Published 7 years ago

html-article-extractor v1.0.14

Weekly downloads
6
License
MIT
Repository
github
Last release
7 years ago

html-article-extractor

A web page content extractor for News websites

installation

npm install html-article-extractor

usage

var htmlArticleExtractor = require("html-article-extractor");

var dom = new JSDOM("...");
var body = dom.window.document.body
result = htmlArticleExtractor(body);
console.log(result)

Outputs:

{
    html: '<div>contents</div>',
    text: 'contents'
}

example

git clone https://github.com/jungyoun/html-article-extractor
cd html-article-extractor
npm install
node example/crawler.js

demo

https://online-article-extractor.herokuapp.com/

1.0.14

7 years ago

1.0.13

7 years ago

1.0.12

7 years ago

1.0.11

7 years ago

1.0.10

7 years ago

1.0.9

7 years ago

1.0.8

7 years ago

1.0.7

7 years ago

1.0.6

7 years ago

1.0.5

7 years ago

1.0.4

7 years ago

1.0.3

7 years ago

1.0.2

7 years ago

1.0.1

7 years ago

1.0.0

7 years ago