0.0.4 • Published 5 months ago

cli-web-fetch v0.0.4

Weekly downloads
-
License
MIT
Repository
github
Last release
5 months ago

cli-web-fetch

A command-line tool for web scraping and image downloading with piping support.

Features

  • Extract content using CSS selectors
  • Download images from websites
  • JSON output format
  • Pipeline support for chaining commands
  • Concurrent image downloads

Installation

# Install globally
npm install -g cli-web-fetch

# Or use with npx
npx cli-web-fetch

Quick Start

# Extract all links
cli-web-fetch scrape https://example.com -s "a" -a "href"

# Download all images
cli-web-fetch scrape https://example.com -s "img" -a "src" | cli-web-fetch download -o ./images

# real example - you can do something like this
cli-web-fetch scrape https://www.shutterstock.com/explore/royalty-free-images -s img -a src | cli-web-fetch download

Command Reference

scrape

cli-web-fetch scrape <url> [options]

Options:
  -s, --selector <selector>     CSS selector for filtering
  -a, --attribute <attribute>   Attribute to extract (href/src/text)
  -f, --format <format>        Output format (plain/json)
  -H, --headers <headers>      Custom headers in JSON format

download

cli-web-fetch download [options]

Options:
  -o, --output <dir>           Output directory (default: ./downloads)
  [url]                        Direct URL to download (optional)

More examples

# Get text from paragraphs
cli-web-fetch scrape https://example.com -s "p" -a "text"

# Get JSON output
cli-web-fetch scrape https://example.com -s "h1" -f json

Browser emulation

The tool automatically uses common browser headers to avoid detection: User-Agent Accept headers Sec-Fetch headers Referrer

Note

Please respect websites' terms of service and robots.txt when scraping content.

0.0.4

5 months ago

0.0.3

5 months ago

0.0.2

5 months ago

0.0.1

5 months ago