1.0.0 โ€ข Published 4 years ago

string-breaker v1.0.0

Weekly downloads
4
License
MIT
Repository
github
Last release
4 years ago

string-breaker

Break string into array

  • string array with elements that are a specific width
  • string array with elements for each line
  • string array with element fo reach word

Documentation

For more information view the Documentation

Repository

View the source code on github

Install

npm install --save string-breaker

Usage

import { stringBreaker } from 'string-breaker';

let x = stringBreaker('The quick brown fox jumped over the lazy dog', 5);
// x => ['The q','uick ','brown',' fox ','jumpe','d ove','r the',' lazy',' dog']

x = stringBreaker('Hello World\nNice ๐Ÿ˜‡\nhmm... ', 5);
// x => ['Hello', ' Worl', 'dNice', ' ๐Ÿ˜‡hmm', '... ']

x = stringBreaker('\uD83D\uDE07Hello World\nNice ๐Ÿ˜‡\nhmm...', 6);
// x => ['๐Ÿ˜‡Hello', ' World', 'Nice ๐Ÿ˜‡', 'hmm...']

x = stringBreaker('\uD83D\uDE07Hello World\nNice ๐Ÿ˜‡\r\nhmm...', {
    width: 6,
    lnEnd: lnEndOpt.encode
    });
// x => ['๐Ÿ˜‡Hello', ' World', '\\nNice', ' ๐Ÿ˜‡\\nhm', 'm...']

console.log(x.join('\n'));
// ๐Ÿ˜‡Hello
// World
// \nNice
// ๐Ÿ˜‡\nhm
// m...

x = stringBreaker('\uD83D\uDE07Hello World\nNice ๐Ÿ˜‡\nhmm...', {
    width: 6,
    lnEnd: lnEndOpt.none
    });
// x => [ '๐Ÿ˜‡Hello', ' World', '\nNice ', '๐Ÿ˜‡\nhmm.', '..' ]
console.log(x.join('\n'));
// ๐Ÿ˜‡Hello
// World
//
// Nice 
// ๐Ÿ˜‡
// hmm.
// ..

// replace all extra space with a single space
x = stringBreaker('\uD83D\uDE07    Hello     World\nNice    ๐Ÿ˜‡ \r\nhmm...', {
    width: 5,
    noExSp: true
    });
// x => ['๐Ÿ˜‡ Hel', 'lo Wo', 'rldNi', 'ce ๐Ÿ˜‡h', 'mm...']

Split by End of Line
stringBreaker can split by eol by setting option splitOpt: splitByOpt.line

import { stringBreaker } from 'string-breaker';

// mixing \n and \r will result in the same output
let strSrc = 'Happy cat.';
strSrc += '\nThe quick brown fox jumped over the lazy dog.';
strSrc += '\r\nThe moon is full tonight.\rI like full moons!';

const x = stringBreaker(strSrc, { splitOpt: splitByOpt.line });
// x => [
//  'Happy cat.',
//  'The quick brown fox jumped over the lazy dog.',
//  'The moon is full tonight.',
//  'I like full moons!' ]

Split by Words stringBreaker can split into words by setting option splitOpt: splitByOpt.word

import { stringBreaker } from 'string-breaker';

// mixing \n and \r will result in the same output
let strSrc = 'Happy cat.';
strSrc += '\nThe quick   brown\t\t fox jumped over the lazy dog.';
strSrc += '\r\nThe moon is full tonight.\rI like full moons!';

const x = stringBreaker(strSrc, { splitOpt: splitByOpt.word });
// x => [ 'Happy','cat.','The','quick','brown','fox','jumped',
//        'over','the','lazy','dog.','The','moon','is','full',
//        'tonight.','I','like','full','moons!' ]

stringBreaker by default will use code points to build output; however, this can be switched to character length by setting lenOpt of stringBreaker options.

With options is set to fullwitdh the output is built based upon if the code point for each char that is fullwidth will take two positions.

import { stringBreaker } from 'string-breaker';

let str = '!ๅฏไปฅ็œ‹ๅˆฐไป–็š„ๆฃ•่‰ฒ่…น้ƒจ';
let x = stringBreaker(str, 6);
// x => [ '!ๅฏไปฅ็œ‹ๅˆฐไป–', '็š„ๆฃ•่‰ฒ่…น้ƒจ' ]

x = stringBreaker(str , {
  width: 6,
  lenOpt: widthFlags.fullwidth
});
// all but the first char is a fullwidth
// x => [
//      '!ๅฏไปฅ็œ‹', // \u21 \u53EF \u4EE5 \u770B
//      'ๅˆฐไป–็š„',  // \u5230 \u4ED6 \u7684
//      'ๆฃ•่‰ฒ่…น',  // \u68D5 \u8272 \u8179
//      '้ƒจ'      // \u90E8
//      ]

With options is set to surrogatePair the output is built based upon if the code point for each char that is surrogate pair will take two positions.

import { stringBreaker } from 'string-breaker';

let str = '๐Ÿง€๐Ÿ˜Š๐Ÿ˜€๐Ÿ˜ƒ๐Ÿ˜„๐Ÿ˜';
console.log(str.length); // 12 all surrogate pairs

let x = stringBreaker(str, 3);
// x => [ '๐Ÿง€๐Ÿ˜Š๐Ÿ˜€', '๐Ÿ˜ƒ๐Ÿ˜„๐Ÿ˜' ]

x = stringBreaker(str , {
  width: 5,
  lenOpt: widthFlags.surrogatePair
});
// all the characters are surrogate pair
// because widthFlags.surrogatePair is set the output
// string array returns 6 not 5 due to surrogate pairs not breakable.
// x => [
//               { surrogate } | { surrogate } | { surrogate }
//  '๐Ÿง€๐Ÿ˜Š๐Ÿ˜€', //  \uD83E \uDDC0 | \uD83D \uDE0A | \uD83D \uDE00
//  '๐Ÿ˜ƒ๐Ÿ˜„๐Ÿ˜'  //  \uD83D \uDE03 | \uD83D \uDE04 | \uD83D \uDE01
//  ]

widthFlags can be set to combine options

import { stringBreaker } from 'string-breaker';

let str = '๐Ÿง€๐Ÿ˜Š๐Ÿ˜€ๆฃ•่…น้ƒจ!!';
console.log(str.length); // 11 mix of surrogate pair, halfwidth and fullwidth

let x = stringBreaker(str, 4);
// x => [ '๐Ÿง€๐Ÿ˜Š๐Ÿ˜€ๆฃ•', '่…น้ƒจ!!' ]

x = stringBreaker(str , {
  width: 4,
  lenOpt: widthFlags.fullwidth | widthFlags.surrogatePair
});
// mix of surrogate pair, fullwidth and halfwidth
// x => [
// '๐Ÿง€๐Ÿ˜Š', // \uD83E \uDDC0 \uD83D \uDE0A two surrogate
// '๐Ÿ˜€ๆฃ•', // \uD83D \uDE00 \u68D5 one surrogate and one fullwidth
// '่…น้ƒจ', // \u8179 \u90E8 two fullwidth
// '!!'  // \u21 \u21 two halfwidth
// ]

Example split by width and preserve words

When split using width and flag widthFlags.nearestWord the elements in the array will split where there is a whitespace and not before.

import { stringBreaker } from 'string-breaker';

var str = 'On this day. For this morning, when Gregor\u3000Samsa woke from troubled dreams; he found himself transformed.';
const result: string[] = stringBreaker(str, { width: 10, lenOpt: widthFlags.nearestWord });
const strResult = result.join('\n');
console.log(strResult);
On thisแš€day.
For this morning,
when Gregor
Samsa woke
from troubled
dreams; he
found himself
transformed.