0.0.0 • Published 10 months ago

aws-cdk-neuronx-models v0.0.0

Weekly downloads
-
License
Apache-2.0
Repository
github
Last release
10 months ago

AWS CDK Neuronx Models

This construct library is a construct library optimized for hosting OSS models in Inferentia2 and Trainium1 on SageMaker. You can host a cost-effective model simply by setting an instance type strategy.

Install

npm i aws-cdk-neuronx-models

Example

The following code is an example of defining cyberagent/calm3-22b-chat. The names of the constructs differ depending on each model, but they are basically the same.

Instance Type and workers every strategies

modelstrategyinstance typenumber of workersnumber of positionsquantization
cyberagent/calm3-22b-chatLOW_COSTml.trn1.2xlarge11024s8
cyberagent/calm3-22b-chatHIGH_PERFORMANCEml.inf2.24xlarge34096
0.0.0

10 months ago