1.1.2 • Published 6 months ago

@aashari/nodejs-confluence-export v1.1.2

Weekly downloads
-
License
ISC
Repository
github
Last release
6 months ago

Confluence Export CLI

A powerful Node.js/TypeScript command-line tool for exporting Confluence pages with advanced filtering capabilities, preserving page relationships and content formatting.


Overview

This CLI tool enables you to export entire Confluence spaces to your local filesystem while maintaining hierarchical structure and page relationships. It converts Confluence's complex storage format into clean Markdown or HTML files with proper formatting for macros, tables, code blocks, and other Confluence elements.

Key Features

  • Complete Space Export: Export an entire Confluence space with a single command
  • Hierarchical Organization: Preserves page ancestry and relationships in folder structure
  • Flexible Filtering: Exclude specific pages or entire branches with parent-based or title pattern filters
  • Content Preservation: Properly renders Confluence macros, tables, code blocks, info/note panels, and more
  • Metadata Retention: Includes page metadata (created date, updated date, path/breadcrumbs) in exports
  • Progress Indicators: Shows real-time progress during export with clear summaries

Getting Started

Prerequisites

  • Node.js (>=18.x): Download
  • Confluence Cloud Account with API access

Step 1: Get Your Atlassian API Token

  1. Go to your Atlassian API token management page: https://id.atlassian.com/manage-profile/security/api-tokens
  2. Click Create API token.
  3. Give it a descriptive Label (e.g., confluence-export-cli).
  4. Click Create.
  5. Copy the generated API token immediately. You won't be able to see it again.

Step 2: Configure Credentials

Method A: Environment Variables

Set these environment variables:

export ATLASSIAN_SITE_NAME="yourcompany"        # For yourcompany.atlassian.net
export ATLASSIAN_USER_EMAIL="your.email@example.com"
export ATLASSIAN_API_TOKEN="your-api-token"

Method B: .env File

Create a .env file in your project directory:

ATLASSIAN_SITE_NAME=yourcompany
ATLASSIAN_USER_EMAIL=your.email@example.com
ATLASSIAN_API_TOKEN=your-api-token

Usage Examples

Quick Use with npx

Export a space to the default output directory (./output):

npx -y @aashari/nodejs-confluence-export export --space SPACEKEY

Using inline environment variables:

ATLASSIAN_SITE_NAME=yourcompany ATLASSIAN_USER_EMAIL=your.email@example.com ATLASSIAN_API_TOKEN=your-api-token npx -y @aashari/nodejs-confluence-export export --space SPACEKEY

Install Globally

npm install -g @aashari/nodejs-confluence-export

Then run directly:

confluence-export export --space SPACEKEY

Custom Output Directory

npx -y @aashari/nodejs-confluence-export export --space SPACEKEY --output-dir ./my-exports

Choose Format (Markdown or HTML)

npx -y @aashari/nodejs-confluence-export export --space SPACEKEY --format html

Filter Pages to Exclude

Exclude a specific page and all its children:

npx -y @aashari/nodejs-confluence-export export --space SPACEKEY --ignore parent:12345678

Exclude pages matching a title pattern:

npx -y @aashari/nodejs-confluence-export export --space SPACEKEY --ignore title:DRAFT

Use multiple filters:

npx -y @aashari/nodejs-confluence-export export --space SPACEKEY --ignore parent:12345678 --ignore title:"Internal Only"

Command Reference

Main Commands

confluence-export export [options]  Export Confluence pages from a specified space

Export Options

  • -s, --space <spaceKey>: (Required) The key of the Confluence space to export
  • -o, --output-dir <path>: Directory to save exported files (default: ./output)
  • -f, --format <format>: Export format - markdown or html (default: markdown)
  • --ignore <filter...>: Pages to ignore. Format: "parent:ID" to ignore a page and all children, or "title:REGEX" to ignore pages matching regex pattern. Can be used multiple times.

Output Structure

The exported content will be organized as follows:

output/
├── space-name/
│   ├── page-title-1/
│   │   ├── child-page-1.md
│   │   └── child-page-2.md
│   ├── page-title-2/
│   │   └── ...
│   └── ...
└── ...

Each Markdown/HTML file includes:

  1. Title as heading
  2. Metadata section with:
    • Path/breadcrumbs
    • Created date
    • Updated date
  3. Page content with properly rendered Confluence elements
  4. Footer with export timestamp

Troubleshooting

If you encounter issues with the export:

  1. Verify your Atlassian credentials are correct
  2. Ensure you have appropriate permissions for the space
  3. Check for rate limiting if exporting large spaces
  4. Use the DEBUG=true environment variable for detailed logs:
DEBUG=true npx -y @aashari/nodejs-confluence-export export --space SPACEKEY

Development

  • Build: npm run build
  • Test: npm test
  • Format code: npm run format
  • Lint code: npm run lint

License

ISC