@shumai/shumai_cpu NPM

Fast machine learning in JavaScript with bun + flashlight. ⚠️ This is experimental software! ⚠️

Discord npm GitHub GitHub contributors

Usage
Install
Build from source
Contributing
Supported operations

Why build this?

Here are some of the areas that shumai hopes to tackle:

Creating datasets
- JavaScript, with native typed arrays and a JIT compiler, is perfect for twiddling with data before it can be made into a big flat GPU-compatible array
Training small models
- FFI bindings in Bun are crazy fast (~3ns), so JS gets out of the way when training small models
Advanced/fine-grained training/inference logic
- Bun uses the JSC JIT compiler, meaning you can confidently write complex training logic without needing a native C++ implementation
Building applications
- JavaScript has a ~~large~~ HUGE ecosystem, which makes application development a lot easier

Usage

shumai will always attempt to use an attached GPU or accelerator, so it should be quite fast. If it seems slow, please file an issue!

Standard array utilities:

import * as sm from "@shumai/shumai"

// create a 1024 by 1024 tensor, randomly filled with normal distribution
let X = sm.randn([1024, 1024])
let W = sm.identity(1024)
let Y = X.matmul(W)
console.log(Y.shape)

Conversion to and from JavaScript native arrays:

const data : Float32Array = new Float32Array(128)
for (let i = 0; i < 128; ++i) {
  data = Math.random()
}

const X : Tensor = sm.tensor(data)
const pi = sm.scalar(3.14)
const Y = X.mul(pi)

// tensors can be converted back to native JavaScript
const Y_data = Y.toFloat32Array()

// scalar tensors can be converted to JavaScript numbers
const total : number = X.sum().toFloat32()

Gradients:

const W = sm.randn([128, 128])
W.requires_grad = true

const X = sm.randn([128, 128])
const diff = X.sub(W)
const mse = diff.mul(diff).sum()
mse.backward()

W.grad // this gradient is now populated

// copy W without allowing gradient updates
const Y = W.detach()
Y.sum().backward() // nothing changes

Some more examples can be found here.

Supported operators can be found here.

Install

This is a current work in progress! If you have any problems building or installing, we would greatly appreciate filed issues.

Ensure you have bun installed (https://bun.sh).

On MacOS:

brew install arrayfire
bun install @shumai/shumai

On Linux (Ubuntu/Debian):

sudo apt install arrayfire-cuda3-cuda-11-6
bun install @shumai/shumai

Installing local build from source

Note: not required when developing the library locally

This process will require building the FFI for bun and then running npm pack to generate a @shumai/shumai_*.tgz package. You can then use npm install $PATH_TO_SOURCE/@shumai/shumai-*.tgz to install the package where you'd like.

MacOS from source

Install flashlight

mkdir -p $HOME/usr/ # installing flashlight here
brew install arrayfire
git clone --recursive --depth 1 https://github.com/bwasti/flashlight.git
cd flashlight
mkdir -p build
cd build
cmake .. -DFL_ARRAYFIRE_USE_CPU=ON -DFL_BUILD_DISTRIBUTED=OFF -DFL_USE_ONEDNN=OFF -DFL_BUILD_TESTS=OFF -DFL_BUILD_EXAMPLES=OFF -DFL_BUILD_SCRIPTS=OFF -DCMAKE_INSTALL_PREFIX=$HOME/usr/
make -j$(nproc)
make install

Build bindings for shumai

cd shumai
mkdir -p build
cd build
cmake .. -Dflashlight_DIR=$HOME/usr/share/flashlight/cmake/
make -j$(nproc)

(you can record perf stuff with xcrun xctrace record --template "Time Profiler" --launch $(which bun) train.js)

Linux from source

First, install flashlight.

Then, build bindings for shumai:

mkdir -p build && cd build
cmake .. \
    -DBUILD_SHARED_LIBS=ON \
    -DCMAKE_BUILD_TYPE=RelWithDebInfo \ # or as specified
    -Dflashlight_DIR=${FLASHLIGHT_INSTALL_PREFIX}/share/flashlight/cmake \
    -DArrayFire_DIR=${ARRAYFIRE_INSTALL_PREFIX}/share/ArrayFire/cmake
make -j$(nproc)

Contributing

If you'd like to make changes to the code, first build from source.

All files ending in *.inl or *_gen.ts are generated. These can be modified by editing scripts/gen_binding.py and running ./scripts/gen_all_binding.sh.

See the CONTRIBUTING file for style guidance and more info on how to help out. 😁

Supported operations

Some operations are supported as both static functions and methods on existing tensors.

Operation	Function	Tensor Method (`t : Tensor`)
rand	`rand(shape: number[]) : Tensor`
randn	`randn(shape: number[]) : Tensor`
full	`full(shape: number[], val: number) : Tensor`
identity	`identity(dim: number) : Tensor`
arange	`arange(start: number, end: number, step: number = 1) : Tensor`
iota	`iota(dims: number[], tileDims: number[] = [1]) : Tensor`
reshape	`reshape(tensor: Tensor, shape: number[]) : Tensor`	`t.reshape(shape: number[]) : Tensor`
transpose	`transpose(tensor: Tensor, axes: number[]) : Tensor`	`t.transpose(axes: number[]) : Tensor`
tile	`tile(tensor: Tensor, shape: number[]) : Tensor`	`t.tile(shape: number[]) : Tensor`
nonzero	`nonzero(tensor: Tensor) : Tensor`	`t.nonzero() : Tensor`
negative	`negative(tensor: Tensor) : Tensor`	`t.negative() : Tensor`
logicalNot	`logicalNot(tensor: Tensor) : Tensor`	`t.logicalNot() : Tensor`
exp	`exp(tensor: Tensor) : Tensor`	`t.exp() : Tensor`
log	`log(tensor: Tensor) : Tensor`	`t.log() : Tensor`
log1p	`log1p(tensor: Tensor) : Tensor`	`t.log1p() : Tensor`
sin	`sin(tensor: Tensor) : Tensor`	`t.sin() : Tensor`
cos	`cos(tensor: Tensor) : Tensor`	`t.cos() : Tensor`
sqrt	`sqrt(tensor: Tensor) : Tensor`	`t.sqrt() : Tensor`
tanh	`tanh(tensor: Tensor) : Tensor`	`t.tanh() : Tensor`
floor	`floor(tensor: Tensor) : Tensor`	`t.floor() : Tensor`
ceil	`ceil(tensor: Tensor) : Tensor`	`t.ceil() : Tensor`
rint	`rint(tensor: Tensor) : Tensor`	`t.rint() : Tensor`
absolute	`absolute(tensor: Tensor) : Tensor`	`t.absolute() : Tensor`
abs	`abs(tensor: Tensor) : Tensor`	`t.abs() : Tensor`
sigmoid	`sigmoid(tensor: Tensor) : Tensor`	`t.sigmoid() : Tensor`
erf	`erf(tensor: Tensor) : Tensor`	`t.erf() : Tensor`
flip	`flip(tensor: Tensor, dim: number) : Tensor`	`t.flip(dim: number) : Tensor`
clip	`clip(tensor: Tensor, low: Tensor, high: Tensor) : Tensor`	`t.clip(low: Tensor, high: Tensor) : Tensor`
roll	`roll(tensor: Tensor, shift: number, axis: number) : Tensor`	`t.roll(shift: number, axis: number) : Tensor`
isnan	`isnan(tensor: Tensor) : Tensor`	`t.isnan() : Tensor`
isinf	`isinf(tensor: Tensor) : Tensor`	`t.isinf() : Tensor`
sign	`sign(tensor: Tensor) : Tensor`	`t.sign() : Tensor`
tril	`tril(tensor: Tensor) : Tensor`	`t.tril() : Tensor`
triu	`triu(tensor: Tensor) : Tensor`	`t.triu() : Tensor`
where	`where(cond: Tensor, x: Tensor, y: Tensor) : Tensor`	`t.where(x: Tensor, y: Tensor) : Tensor`
sort	`sort(tensor: Tensor, dim: number) : Tensor`	`t.sort(dim: number) : Tensor`
add	`add(tensor: Tensor, other: Tensor) : Tensor`	`t.add(other: Tensor) : Tensor`
sub	`sub(tensor: Tensor, other: Tensor) : Tensor`	`t.sub(other: Tensor) : Tensor`
mul	`mul(tensor: Tensor, other: Tensor) : Tensor`	`t.mul(other: Tensor) : Tensor`
div	`div(tensor: Tensor, other: Tensor) : Tensor`	`t.div(other: Tensor) : Tensor`
eq	`eq(tensor: Tensor, other: Tensor) : Tensor`	`t.eq(other: Tensor) : Tensor`
neq	`neq(tensor: Tensor, other: Tensor) : Tensor`	`t.neq(other: Tensor) : Tensor`
lessThan	`lessThan(tensor: Tensor, other: Tensor) : Tensor`	`t.lessThan(other: Tensor) : Tensor`
lessThanEqual	`lessThanEqual(tensor: Tensor, other: Tensor) : Tensor`	`t.lessThanEqual(other: Tensor) : Tensor`
greaterThan	`greaterThan(tensor: Tensor, other: Tensor) : Tensor`	`t.greaterThan(other: Tensor) : Tensor`
greaterThanEqual	`greaterThanEqual(tensor: Tensor, other: Tensor) : Tensor`	`t.greaterThanEqual(other: Tensor) : Tensor`
logicalOr	`logicalOr(tensor: Tensor, other: Tensor) : Tensor`	`t.logicalOr(other: Tensor) : Tensor`
logicalAnd	`logicalAnd(tensor: Tensor, other: Tensor) : Tensor`	`t.logicalAnd(other: Tensor) : Tensor`
mod	`mod(tensor: Tensor, other: Tensor) : Tensor`	`t.mod(other: Tensor) : Tensor`
bitwiseAnd	`bitwiseAnd(tensor: Tensor, other: Tensor) : Tensor`	`t.bitwiseAnd(other: Tensor) : Tensor`
bitwiseOr	`bitwiseOr(tensor: Tensor, other: Tensor) : Tensor`	`t.bitwiseOr(other: Tensor) : Tensor`
bitwiseXor	`bitwiseXor(tensor: Tensor, other: Tensor) : Tensor`	`t.bitwiseXor(other: Tensor) : Tensor`
lShift	`lShift(tensor: Tensor, other: Tensor) : Tensor`	`t.lShift(other: Tensor) : Tensor`
rShift	`rShift(tensor: Tensor, other: Tensor) : Tensor`	`t.rShift(other: Tensor) : Tensor`
minimum	`minimum(tensor: Tensor, other: Tensor) : Tensor`	`t.minimum(other: Tensor) : Tensor`
maximum	`maximum(tensor: Tensor, other: Tensor) : Tensor`	`t.maximum(other: Tensor) : Tensor`
power	`power(tensor: Tensor, other: Tensor) : Tensor`	`t.power(other: Tensor) : Tensor`
matmul	`matmul(tensor: Tensor, other: Tensor) : Tensor`	`t.matmul(other: Tensor) : Tensor`
amin	`amin(tensor: Tensor, axes: number[] = [], keep_dims: boolean = false) : Tensor`	`t.amin(axes: number[] = [], keep_dims: boolean = false) : Tensor`
amax	`amax(tensor: Tensor, axes: number[] = [], keep_dims: boolean = false) : Tensor`	`t.amax(axes: number[] = [], keep_dims: boolean = false) : Tensor`
argmin	`argmin(tensor: Tensor, axis: number, keep_dims: boolean = false) : Tensor`	`t.argmin(axis: number, keep_dims: boolean = false) : Tensor`
argmax	`argmax(tensor: Tensor, axis: number, keep_dims: boolean = false) : Tensor`	`t.argmax(axis: number, keep_dims: boolean = false) : Tensor`
sum	`sum(tensor: Tensor, axes: number[] = [], keep_dims: boolean = false) : Tensor`	`t.sum(axes: number[] = [], keep_dims: boolean = false) : Tensor`
cumsum	`cumsum(tensor: Tensor, axis: number) : Tensor`	`t.cumsum(axis: number) : Tensor`
mean	`mean(tensor: Tensor, axes: number[] = [], keep_dims: boolean = false) : Tensor`	`t.mean(axes: number[] = [], keep_dims: boolean = false) : Tensor`
median	`median(tensor: Tensor, axes: number[] = [], keep_dims: boolean = false) : Tensor`	`t.median(axes: number[] = [], keep_dims: boolean = false) : Tensor`
var	`var(tensor: Tensor, axes: number[] = [], bias: boolean = false, keep_dims: boolean = false) : Tensor`	`t.var(axes: number[] = [], bias: boolean = false, keep_dims: boolean = false) : Tensor`
std	`std(tensor: Tensor, axes: number[] = [], keep_dims: boolean = false) : Tensor`	`t.std(axes: number[] = [], keep_dims: boolean = false) : Tensor`
norm	`norm(tensor: Tensor, axes: number[] = [], p: number = 2, keep_dims: boolean = false) : Tensor`	`t.norm(axes: number[] = [], p: number = 2, keep_dims: boolean = false) : Tensor`
countNonzero	`countNonzero(tensor: Tensor, axes: number[] = [], keep_dims: boolean = false) : Tensor`	`t.countNonzero(axes: number[] = [], keep_dims: boolean = false) : Tensor`
any	`any(tensor: Tensor, axes: number[] = [], keep_dims: boolean = false) : Tensor`	`t.any(axes: number[] = [], keep_dims: boolean = false) : Tensor`
all	`all(tensor: Tensor, axes: number[] = [], keep_dims: boolean = false) : Tensor`	`t.all(axes: number[] = [], keep_dims: boolean = false) : Tensor`

License

shumai is MIT licensed, as found in the LICENSE file.

@shumai/darwin_x64_shumai_flashlight @shumai/darwin_arm64_shumai_flashlight @shumai/linux_x64_shumai_flashlight_cpu

@infinitebrahmanuniverse/nolb-_shu @everything-registry/sub-chunk-833 @zalastax/nolb-_shu

0.0.1

3 years ago