file-format-parser v2.1.1
file-format-parser

File format parser which makes it easy to parse binary file formats through data schemas.
The primary goal of ffp is to make it easy to make file format parsers and writers. This is done through defining sequences of entities (data formats) and types (data readers/writers).
ffp comes with built-in support for several basic data types, and allows you to define other data types as needed.
installation
npm i file-format-parser --saveusage
let ffp = require('file-format-parser');
// ffp.addDataType - adds a data type to ffp
ffp.addDataType('string', {
read: (stream, entity, store) => {
let uint8 = ffp.getDataType('uint8'),
length = uint8.read(stream, entity, store),
buf = stream.read(length);
return buf.toString('ascii');
},
write: (stream, entity, data, context) => {
let uint8 = ffp.getDataType('uint8');
uint8.write(stream, entity, data.length, context);
stream.write(Buffer.alloc(data.length, data, 'ascii'));
}
});
// ffp.addDataFormat - adds a data format to ffp
ffp.addDataFormat('MyFileType', [{
type: 'array',
count: {type: 'uint8'},
entry: {type: 'uint8'},
storageKey: 'someBytes'
}, {
type: 'string',
storageKey: 'aString'
}]);
let data = ffp.parseFile('path/to/file', 'MyFileType');
console.log(data.someBytes); // array of uint8 numbers
console.log(data.aString); // a string parsed after the arraySee the tests for more thorough code examples. E.g. writeFile.test.js shows how to read and write ico files.
api
entities
At a fundamental level, ffp functions by parsing entity definitions. Entity definitions are JavaScript objects with properties which instruct ffp and data readers on how to read, write, and store data.
The following base properties are part of how ffp works with entities. Additional properties may be expected by data type readers/writers.
type- Required. The data type to use to read/write the entity.storageKey- Recommended. The key where the data will be stored when reading, and is expected to exist when writing. You should only omit thestorageKeyfor entities that use types which data storage included in their readers and writers.
schemas
In ffp, a schema is just a sequence of entities. Schemas can be nested inside of each other using the array, record, and union data types. Schemas are also referred to as "data formats" internally.
provided data types
ffp provides several data types out of the box to help you get started parsing files. The data types provided are for standard use cases and aren't intended to be complete.
uint8- 1 byte unsigned integer.uint16- 2 byte unsigned integer. (endian type)uint32- 4 byte unsigned integer. (endian type)int16- 2 byte signed integer. (endian type)int32- 4 byte signed integer. (endian type)float- 4 byte floating point number. (endian type)array- Array of any other type. The size of the array is expected before the contents of the array. The type used for the size can any integer type. Entity properties:count- entity for the count.countKey- key where the count can be found, if it was parsed as a separate entity. Takes precedence overcount(provide one or the other).entry- entity for array entries. Required.
record- A reference to a data format registered withffp.addDataFormat. Entity properties:format- The name of the data format to use when reading/writing the record.
union- Switches between parsing different data types based on a previously parsed value. Entity properties:switchKey- The key to the data to "switch" on. Resolved relative to the entities data storage context.cases- Object which is used to determine the entity to parse. Keys should correspond to possible data switch values. Values should be entity objects.
data reader functions
Data reader functions are passed to ffp.addDataType and ffp.addEndianType and are used to read a data type from a stream. They use the following arguments:
stream- TheSyncReadableStreamto read the data from. Usestream.read(n)to readnbytes from the stream (returns aBuffer).entity- The current entity you're reading data for.context- The current data storage context of the reader.
data writer functions
Data writer functions are passed to ffp.addDataType and ffp.addEndianType and are used to write a data type to a stream. They use the following arguments:
stream- TheSyncWriteableStreamto write the data to. Usestream.write(buf)to write the Bufferbufto the stream.entity- The current entity you're writing data for.data- The data you are writing to the stream.context- The current data storage context of the writer.
ffp.setEndianness(endianness)
Sets the endianness to be used by endian types.
Arguments:
endianness-'LE'for Little Endian or'BE'for Big Endian.
ffp.parseFile(filePath, schema)
Loads and parses the file at filePath according to schema. Returns the parsed data once parsing is completed.
Arguments:
filePath- Path to the file to load and parse.schema- Name of the schema to use when parsing the file or the schema itself.
ffp.writeFile(filePath, schema, data)
Writes a file to filePath according to schema and data.
Arguments:
filePath- Path to the file to write to disk.schema- Name of the schema to use when parsing the file or the schema itself.data- Data structure to write to disk. Should have every property expected by theschema.
ffp.addDataType(name, type)
Adds a data type to ffp.
Arguments:
name- Unique string identifier for the data type.type- Object which contains a data reader and a data writer for the type. Properties:read- Data reader function called to read the data type from a stream.write- Data writer function called to write the data type to the stream.
ffp.addEndianType(name, endianType)
Adds an endian data type to ffp.
Arguments:
name- Unique string identifier for the data type.endianType- Object which contains little endian and big endian data readers and a data writers for the type. Properties:read- Object containing data readers for the type. Properties:LE- Data reader function called to read data from the stream in little endian byte order.BE- Data reader function called to read data from the stream in big endian byte order.
write- Object containing data writers for the type. Properties:LE- Data writer function called to write data to the stream in little endian byte order.BE- Data writer function called to write data to the stream in big endian byte order.
ffp.getDataType(name)
Gets a data type from ffp matching name. Returns the data type if found.
Arguments:
name- Unique string identifier of the data type to get.
ffp.addDataFormat(name, format)
Adds a data format to ffp.
Arguments:
name- Unique string identifier for the data format.format- Schema array, as described in the schemas section.
ffp.getDataFormat(name)
Gets a data format from ffp matching name. Returns the data format if found.
Arguments:
name- Unique string identifier of the data format to get.