@siddjain/datastore-writer v0.1.3
README
A thin library to help with bulk loading data into Google Datastore (Firestore in Datastore Mode). When you include this library in your package.json,
you also need to include @google-cloud/datastore. The library does not include it for you and treats it as a peer dependency.
Usage
There are two classes provided by the library: DatastoreWriter and BulkLoader.
DatastoreWriter
DatastoreWriter functions as a buffered writer. You add Datastore entities to it and it automatically batches and writes them to Datastore.
The constructor allows you to set various parameters such as:
buffer_size: controls the buffer size. entities are stored in a buffer and flushed (written to Datastore) when the buffer gets full. Datastore does not allow writing more than 500 entities at a time. see this. defaults to500.max_concurrent_requests: controls the maximum number of concurrent requests made to Google Datastore. It is recommended to keep this well below 100. defaults to10.upsert: controls whether to upsert or insert the entities. if set to true, entities will be upserted else they will be inserted. defaults tofalse.
DatastoreWriter provides following public methods, properties and events:
Methods
add: add an entity toDatastoreWriter. Entities are batched and auto-flushed (written to Datastore) when the buffer gets full.flush: explicitly flush entities in the buffer.ready: returns a Promise that resolves when the number of concurrent requests in progress <max_concurrent_requests.
Properties
isBusy:trueif number of concurrent requests in progress =max_concurrent_requests.
Events
busy: raised when number of concurrent requests in progress becomes equal tomax_concurrent_requests.idle: raised when number of concurrent requests in progress drops belowmax_concurrent_requests.stats: raised after each request to Datastore. Contains following information:start: when the request started (nanoseconds)end: when the request ended (nanoseconds)elapsed:end - startsummed over all the requests ever made (nanoseconds)request_count: number of requests that have been made until now
error: exceptions and errors
BulkLoader
A class that can be used to read data from a text file and load into Datastore. It is a good starting point to learn how to use DatastoreWriter.
The constructor expects a file to read, a Datastore instance and an object that can be used to convert a line from the text file to a
Datastore entity. Some optional parameters are supported:
num_lines: number of lines to read from the given file. set to 0 or undefined to indicate no limit. defaults to0.num_concurrent: sets the maximum number of concurrent requests to Datastore. defaults to10.batch_size: sets the size ofDatastoreWriter's buffer. defaults to500.upsert: controls whether to upsert or insert the entities. if set to true, entities will be upserted else they will be inserted. defaults tofalse.
Following methods, properties and events are supported:
Methods
load: starts loading the data from given file and returns a Promise that resolves when the load is completed.
Example
import { BulkLoader } from '@siddjain/datastore-writer';
let loader = new BulkLoader(csv_file, datastore, converter, options);
console.time('load');
loader.load().then(() => {
console.log('Goodbye!');
console.timeEnd('load');
}).catch(err => {
console.error(err);
console.timeEnd('load');
});