1.0.6 • Published 1 year ago

itemizer v1.0.6

Weekly downloads
2
License
MIT
Repository
github
Last release
1 year ago

itemizer

Dice up JS strings by script, emoji and bidi direction. Hand ported from Pango with the help of SheenBidi and emoji-segmenter, both of which are compiled and distributed in WebAssembly form.

A useful text-stack component if you want to do your own font selection and shaping in JS, or if you need more power to customize the display of different scripts or emojis in a browser.

API

The module itself is a promise since WASM loads asynchronously:

const {emoji, script, bidi} = await require('itemizer');

Each of the values on the resolved object have the same API but iterate different things. The emoji iterator stops at boundaries between text and emoji sequences:

const str = 'I 👏🏼 proper i18n support';
let last = 0;
for (const {i, isEmoji} of emoji(str)) {
  console.log(str.slice(last, i)); // logs 3 times
  last = i;
}

The bidi iterator has the same API:

const str = 'Latin is common ពួកគេទាំងអស់   ';

let last = 0;
for (const {i, dir} of bidi(str, 0 /* base level (ex. 1 for RTL) */)) {
  console.log(str.slice(last, i)); // logs 2 times
  last = i;
}

and so does the script iterator's API:

const str = 'Latin is common ពួកគេទាំងអស់   ';

for (const {i, script} of script(str)) {
  console.log(script); // logs 2 times
}

Bigger example

Check out the repo and run

$ node test.js

Building the WebAssembly locally

  1. Get ragel from your package manager
  2. Get WASI (sysroot version)
  3. Get LLVM Clang 9+ from your package manager
  4. Open the makefile and set RL, WASI_SYSROOT CXX according to steps 1, 2, 3
  5. make
1.0.6

1 year ago

1.0.2

2 years ago

1.0.5

1 year ago

1.0.4

1 year ago

1.0.3

1 year ago

1.0.1

2 years ago

1.0.0

2 years ago

0.2.0

2 years ago

0.1.2

3 years ago

0.1.0

3 years ago

0.1.1

3 years ago

0.0.1

4 years ago