0.0.3 • Published 3 years ago
scraping2file v0.0.3
Getting Started
Install this package.
npm i -g scraping2fileCreate files for execution (Only do the first time)
scraping2file initUpdate config.yml. This is used for scraping websites, transforming text, and so on. You can set below keys and properties.
separator: "\t" # separator for each columns.
hostname:
  example.com: # hostname for scraping.
  - selector: "p" # selector is used as `document.querySelector(${selector})`.
    all: true # Can omit it. When setting true, `document.querySelector(${selector})` is changed `document.querySelectorAll(${selector})`.
    callback: !!js/function > # Can omit it. When you want to transform text, set this callback. this callback's argument is `Node.textContent` of each found elements. 
      (str) => str.replace("...", "")Add urls that you scraping websites to urls.txt. You separate each urls with line feed.
Finally, execute below command.
scraping2fileIn finishing the execution, scraping2file create a file in output directory. 🎉
License
MIT