0.2.0 • Published 6 years ago

etl-toolkit v0.2.0

Weekly downloads
2
License
MIT
Repository
github
Last release
6 years ago

ETL toolkit

General components for managing the retrieval and processing of data

GitHub Release Greenkeeper badge Build Status Coverage Status Known Vulnerabilities

Components

Queues

populateIds may be used to add IDs to the etlStore from a paged source.

populateRecordsFromIds may be used to populate records from the IDs in the etlStore.

ETL Store

The etlStore manages the state of the ETL including IDs, loaded records, and a list of errored IDs. The store can persist state to the local file system during queue processing enabling an ETL to continue after interruption.

Environment variables

Environment variables are expected to be managed by the environment in which the application is being run. This is best practice as described by twelve-factor.

Environment variables are used to set application level settings for each environment.

VariableDescriptionDefaultRequired
LOG_LEVELlog levelDepends on NODE_ENV
NODE_ENVnode environmentdevelopment
OUTPUT_FILEFilename saved to azureetl-data
HITS_PER_HOURMaximum number of times to call a queue operation per hour20000
ETL_NAMEName used in Bunyan loggeretl-toolkit
0.2.0

6 years ago

0.1.1

6 years ago

0.1.0

6 years ago