0.1.0 • Published 7 years ago

artixtractor v0.1.0

Weekly downloads
3
License
MIT
Repository
github
Last release
7 years ago

Built with Grunt Build Status Code Climate

artixtractor

Extract article/blog from websites like medium.com, inc42.com,etc

Alt text

Installation

$ npm install artixtractor --save

Usuage

const articleParser = require('artixtractor');

articleParser('https://medium.com/@Aegist/how-to-end-googles-monopoly-5c46ef7db20d')
.then((result)=>{
    console.log(result);
	 /*=>{  domain: 'medium.com',
			title: 'How to End Google’s Monopoly – Shane Greenup – Medium',
			articleContent: 'Over a year ago I said Google would never implement a fact based assessment in their 		  algorithm because 	I thought 	they would understand that making such a change would be the first step to losing their search monopoly.I 	was wrong.
				......
				......
				Founder of rbutr and dedicated to solving the problem of misinformation.Entrepreneur, Philosopher, Scientist, Traveller, Extreme Sports enthusiast.',
			primaryImage: 'https://cdn-images-1.medium.com/max/1200/1*EDO7CRa7DA3HfkRcUU6Qtg.jpeg',
			date: '2017-08-03T10:42:48.314Z',
			author: '@Aegist',
			shortDescription: 'Over a year ago I said Google would never implement a fact based assessment in their algorithm 		because I thought they would understand that…' } */

}).catch((reason)=>{
    console.log(reason);
});

Features

  • Text extraction
  • Primary image extraction
  • Description extraction
  • Keyword extraction
  • Author extraction
  • Article Posted Date extraction

License

MIT © Bharathvaj Ganesan