0.0.5 • Published 10 years ago

tkn v0.0.5

Weekly downloads
45
License
MIT
Repository
github
Last release
10 years ago

tkn

Simple word tokeniser that ignores punctuation. Returning an Array of words.

Usage

tokenise(text, noStopWords). Simply returns an array of terms, without punctuation.

  • text is the string (text document) in which the calculations are to be performed on.
  • noStopWords defaults to true. Set to false if you want to include stop words–e.g words such as "I" and "the".
var tkn = require('tkn');

var str = "you're simply a test, a mere test";
var tokenised = tkn.tokenise(str);
>> ['simply', 'test', 'mere', 'test']
0.0.5

10 years ago

0.0.4

10 years ago

0.0.3

10 years ago

0.0.2

10 years ago

0.0.1

10 years ago