1.0.9 • Published 9 years ago

bbc-reader v1.0.9

Weekly downloads
3
License
MIT
Repository
github
Last release
9 years ago

bbc-reader

Scrape a BBC article from BBC.com

Install

npm install bbc-reader --save

Use

   var BBCReader = require('bbc-reader');
   var bbcreader = new BBCReader();

   // Promise
   bbcreader.read('http://www.bbc.com/news/world-europe-34602621').then(function(article) {
      // Do Something with Article
   });

   // Callback
   bbcreader.read('http://www.bbc.com/news/world-europe-34602621', function(article) {
      // Do Something with Article
   });

Article

var Article = {
   title: '',
   datetime: '',
   body: {
      clean: '',
      markdown: ''
   },
   images: [
      {
         full: ''
      }
   ],
   source: ''
};

title The title of the Article. What appears in the h1 on the page.

datetime The datetime with timezone of the last update of the article. Format: YY-mm-dd H:i:s GMT. The datetime will always be GMT+0000.

body The body of the article. Comes in two formats. clean and minimal. The clean format removes all html elements and separates paragraphs by two newlines. Markdown attempts to provide a markdown version of the article.

images An array of image urls found in the body. Comes in sizes full for each image.

source The url of the bbc article.

1.0.9

9 years ago

1.0.8

9 years ago

1.0.7

9 years ago

1.0.6

9 years ago

1.0.5

9 years ago

1.0.4

9 years ago

1.0.2

9 years ago

1.0.1

9 years ago

1.0.0

9 years ago