0.0.6 • Published 12 years ago

hbloom v0.0.6

Weekly downloads
31
License
-
Repository
github
Last release
12 years ago

hBloom

Hierarchical bloom classifier for tagging text with a structured word list.

This has prpbably been solved before by smater people than myself. Given a structured tree of categories and corresponding words/strings, hBloom will create a bloom filter for each level of depth in the data, which returns true/false for all sub categories, in the case of 'true' it will return the matching categories.

##Install

npm install hBloom

##Usage

var hBloom = require('../hBloom');

var myBloom = hBloom( {STRUCTURED DATA} );
var txt = "This post is about celtic and rangers, but mentions villa.";

myBloom.classifyText(txt, function(result){
	console.log(result);
});

##Methods

classifyText( text, callback )

classify( word )

##Structured Data?

The data passed to hBloom({DATA}) should follow the example below:

{
	"racing": {
		"asscot": ["asscot", "ass", "the big race"]
	},
	"football": {
		"manchester united": ["manu", "man united", "mufc", "manchester united", "manufc"],
		"aston villa": ["aston villa", "villa" , "villafc"],
		"manchester city": ["mancity", "manchester city", "cityfc", "man city", "mancityfc"],
		"scottish league": {
			"dundee united": ["dundee", "dundee united", "dundeefc"],
			"rangers": ["rangers","rangersfc"],
			"celtic": ["celtic", "celticfc"]
		}
	}
}

##License

Not sure yet, busy deciding, most likely MIT.

Copyright 2012 Christopher de beer

0.0.6

12 years ago

0.0.5

12 years ago

0.0.4

12 years ago

0.0.3

12 years ago

0.0.2

12 years ago

0.0.1

12 years ago