1.0.7 • Published 11 months ago

scrapifyjs v1.0.7

Weekly downloads
-
License
ISC
Repository
-
Last release
11 months ago

My Scraping Module

My Scraping Module is a JavaScript library that allows you to perform scraping easily and extract specific information from HTML text. The module provides several useful functions to facilitate the process of scraping elements and extracting content based on properties.

Installation

To install the module, use npm (Node Package Manager):

npm install scrapifyjs

Documentation

htmlText(html) The htmlText function takes an HTML-formatted text as a parameter and initializes the scraping process.

Parameters html (string): The HTML text to be processed. Returns A Scraping object representing the scraped HTML. Scraping.element(tag) The element method searches for HTML elements that match the specified tag within the scraped HTML.

Parameters tag (string): The HTML tag to search for. Returns A Scraping object representing the filtered HTML elements. Scraping.content() The content method extracts the content of the HTML elements found in the previous step.

Returns The content of the HTML elements as an array of strings. Usage Here's an example of how to use the functions of the scraping module:

// Import the module
const scraping = require('scrapifyjs');

// Example usage of the htmlText function
const htmlText = '<div><h1>Document Title</h1><p>Document content</p></div>';


const elements = scraping(htmlText).element('h1').content();
console.log(elements); // Output: ['Document Title']

// Example usage of the content function
const content = scraping(htmlText).content();
console.log(content); // Output: ['Document content','Document content']

// Example usage of the props function

const htmlText = '<div><div class="my-class">hello word</div></div>'

const elementsWithProp = scraping(htmlText).props('my-class');
console.log(elementsWithProp); // Output: ['<div class="my-class">, </div>']

const elementsWithProp = scraping(htmlText).props('my-class').content();
console.log(elementsWithProp); // Output: ['hello word']
1.0.7

11 months ago

1.0.6

11 months ago

1.0.4

11 months ago

1.0.3

11 months ago

1.0.2

11 months ago

1.0.1

11 months ago

1.0.0

11 months ago