1.0.34 • Published 7 years ago

content_scraper v1.0.34

Weekly downloads
2
License
MIT
Repository
github
Last release
7 years ago

Content-scrape-Javascript An npm Package for Node.js using a third party npm package to scrape content from the site.

This project is a Node.js command line application that goes to an Ecommerce site to gets the latest prices and saves them to a spreadsheet (CSV format). This spreadsheet will be used by another application to populate a database.

This creates a csv file with current date of when the scrape happened into the data folder e.g. 2016-01-29.csv .

The csv file column headers Title, Price, ImageURL, URL and Time .

Basic Usage

var Scraper = require("content_scraper");

var url = "http://www.shirts4mike.com/";

var shirts4mikeScraper = new Scraper.scrape(url);

var directory = 'outputDir';

var filename = 'results';

/**
* When the "end" event is triggered an array of shirt
* metadata is returned
**/
shirts4mikeScraper.on("end", function(result){
	  //To view result in console
	  //console.dir(result);
	  //Optional parameter to change the filename and the output directory
	  
	  Scraper.print(result, directory, filename);
	  
	  //Or simply to use the default directory 
	  // Scraper.print(result);
});

/**
* If a parsing, network or HTTP error occurs an
* error object is passed in to the handler or callback
**/
shirts4mikeScraper.on("error", console.error);

Example of Returned Data

[ 
  {    title: 'Mike the Frog Shirt, Orange',
       price: '$20',
       imageUrl: 'http://www.shirts4mike.com/img/shirts/shirt-108.jpg',
       href: 'http://www.shirts4mike.com/shirt.php?id=108',
       time: '0:37:58'
  },
  {    title: 'Mike the Frog Shirt, Orange',
       price: '$20',
       imageUrl: 'http://www.shirts4mike.com/img/shirts/shirt-108.jpg',
       href: 'http://www.shirts4mike.com/shirt.php?id=108',
       time: '0:37:58'
  },
]

title, price, url and imageURL are strings and time is a JavaScript Date object.

Sample CSV

./data/yyyy-mm-dd.csv

Title	                      Price ($)	ImageURL	                                         URL	                                    Time
 Mike the Frog Shirt, Orange	$25	    http://www.shirts4mike.com/img/shirts/shirt-108.jpg	http://www.shirts4mike.com/shirt.php?id=108	0:38:49
 Logo Shirt, Teal	            $20	    http://www.shirts4mike.com/img/shirts/shirt-107.jpg	http://www.shirts4mike.com/shirt.php?id=107	0:38:49
 Mike the Frog Shirt, Yellow	$25	    http://www.shirts4mike.com/img/shirts/shirt-105.jpg	http://www.shirts4mike.com/shirt.php?id=105	0:38:49
 Logo Shirt, Gray	            $20	    http://www.shirts4mike.com/img/shirts/shirt-106.jpg	http://www.shirts4mike.com/shirt.php?id=106	0:38:49
1.0.34

7 years ago

1.0.33

7 years ago

1.0.32

7 years ago

1.0.31

7 years ago

1.0.30

8 years ago

1.0.29

8 years ago

1.0.28

8 years ago

1.0.27

8 years ago

1.0.26

8 years ago

1.0.22

8 years ago

1.0.12

8 years ago

1.0.11

8 years ago

1.0.10

8 years ago