1.1.2 • Published 8 years ago
big-query-importer v1.1.2
Google BigQuery Importer
big-query-importer gets all published pages from Clay, maps their data to a schema that Google BigQuery accepts, and imports the data as a stream directly to a specified table within a specified dataset.
Any logic beyond mapping values from Clay to values in BigQuery should be avoided.
Setup
- git clone
- nvm install v6
- npm install
- create keyfile.jsonwith BigQuery account keys- ask another dev or download the big-query-importer service account keys from Google Cloud Platform.
 
Commands
- npm test- runs eslint and mocha tests
- ./bin/cli.js- imports Clay page data to BigQuery- For help run ./bin/cli.js --help
- Normal usage:- Run command for each site
- View imported data in BigQuery UI
 
 
- For help run 
Development
Directory Structure
    app.js              - entrypoint for yargs
    lib/                - main library called by app.js
    modules/            - each type of instance may need a different mapping to big query
        page/           - example of one module for page instances
            schema.json - the app assumes this file describes the Big Query table
            transform.js- the app assunes this file converts composed instance json to big query data objectCode Style
Matches other New York Media repos; linted by eslint.
We are using bluebird for promises and lodash for basic utilities; otherwise vanilla.
TODO
- Write tests for services
- Better documentation
- Tests for modules
- Memory limits
- Import any component into big query e.g. --url http://nymag.com/selectall/components/ads/instances