Char-encoding-detector NPM

travis static

=====

Port of node-chardet in pure JavaScript without NodeJS specific code. Module is based on ICU project http://site.icu-project.org/, which uses character occurrence analysis to determine the most probable encoding.

Installation

npm i char-encoding-detector

yarn add char-encoding-detector

Usage

To return the encoding with the highest confidence:

import { detectEncoding, detectFileEncoding } from 'char-encoding-detector';
const encoding = detectEncoding(uint8Array);
// or
detectFileEncoding(file).then((encoding) => {});

To return the full list of possible encodings:

import { detectEncoding, detectFileEncoding } from 'char-encoding-detector';

const matches = detectEncoding(uint8Array, { allMatches: true });
// or
detectFileEncoding(file, { allMatches: true }).then((matches) => {});

Working with large data sets

Sometimes, when data set is huge and you want to optimize performance (in tradeoff of less accuracy), you can sample only first N bytes of the buffer.