@abbyy-sdk/document-ai v0.1.4
@abbyy-sdk/document-ai
Developer-friendly & type-safe Typescript SDK specifically catered to leverage @abbyy-sdk/document-ai API.
Summary
Document AI API: A modern, simple, and easy-to-integrate OCR and document processing API service
Table of Contents
SDK Installation
The SDK can be installed with either npm, pnpm, bun or yarn package managers.
NPM
npm add @abbyy-sdk/document-aiPNPM
pnpm add @abbyy-sdk/document-aiBun
bun add @abbyy-sdk/document-aiYarn
yarn add @abbyy-sdk/document-ai zod
# Note that Yarn does not install peer dependencies automatically. You will need
# to install zod as shown above.!NOTE This package is published with CommonJS and ES Modules (ESM) support.
Requirements
For supported JavaScript runtimes, please consult RUNTIMES.md.
SDK Example Usage
Example
import { DocumentAi } from "@abbyy-sdk/document-ai";
const documentAi = new DocumentAi({
apiKeyAuth: process.env["DOCUMENTAI_API_KEY_AUTH"] ?? "",
});
async function run() {
const result = await documentAi.documents.list({
cursor: "xyz",
});
for await (const page of result) {
// Handle the page
console.log(page);
}
}
run();Authentication
Per-Client Security Schemes
This SDK supports the following security scheme globally:
| Name | Type | Scheme | Environment Variable |
|---|---|---|---|
apiKeyAuth | http | HTTP Bearer | DOCUMENTAI_API_KEY_AUTH |
To authenticate with the API the apiKeyAuth parameter must be set when initializing the SDK client instance. For example:
import { DocumentAi } from "@abbyy-sdk/document-ai";
const documentAi = new DocumentAi({
apiKeyAuth: process.env["DOCUMENTAI_API_KEY_AUTH"] ?? "",
});
async function run() {
const result = await documentAi.documents.list({
cursor: "xyz",
});
for await (const page of result) {
// Handle the page
console.log(page);
}
}
run();Available Resources and Operations
documents
models
models.airWaybill
- beginFieldExtraction - Begin Air Waybill Field Extraction
- getExtractedFields - Get Air Waybill Fields
models.arrivalNotice
- beginFieldExtraction - Begin Arrival Notice Field Extraction
- getExtractedFields - Get Arrival Notice Fields
models.bankStatement
- beginFieldExtraction - Begin Bank Statement Field Extraction
- getExtractedFields - Get Bank Statement Fields
models.basicContract
- beginFieldExtraction - Begin Basic Contract Field Extraction
- getExtractedFields - Get Basic Contract Fields
models.billOfLading
- beginFieldExtraction - Begin Bill Of Lading Field Extraction
- getExtractedFields - Get Bill Of Lading Fields
models.brokerageStatement
- beginFieldExtraction - Begin Brokerage Statement Field Extraction
- getExtractedFields - Get Brokerage Statement Fields
models.certificateOfOrigin
- beginFieldExtraction - Begin Certificate Of Origin Field Extraction
- getExtractedFields - Get Certificate Of Origin Fields
models.commercialInvoice
- beginFieldExtraction - Begin Commercial Invoice Field Extraction
- getExtractedFields - Get Commercial Invoice Fields
models.customsDeclaration
- beginFieldExtraction - Begin Customs Declaration Field Extraction
- getExtractedFields - Get Customs Declaration Fields
models.dangerousGoodsDeclaration
- beginFieldExtraction - Begin Dangerous Goods Declaration Field Extraction
- getExtractedFields - Get Dangerous Goods Declaration Fields
models.deliveryNote
- beginFieldExtraction - Begin Delivery Note Field Extraction
- getExtractedFields - Get Delivery Note Fields
models.documentConversion
- beginConversion - Begin Document Conversion
- getConversion - Get Conversion
- downloadConvertedDocument - Download Converted Document
models.hotelInvoice
- beginFieldExtraction - Begin Hotel Invoice Field Extraction
- getExtractedFields - Get Hotel Invoice Fields
models.imageToText
- beginTextExtraction - Begin Image To Text Extraction
- getExtractedText - Get Extracted Text
models.internationalConsignmentNote
- beginFieldExtraction - Begin International Consignment Note Field Extraction
- getExtractedFields - Get International Consignment Note Fields
models.invoice
- beginFieldExtraction - Begin Invoice Field Extraction
- getExtractedFields - Get Invoice Fields
models.packingList
- beginFieldExtraction - Begin Packing List Field Extraction
- getExtractedFields - Get Packing List Fields
models.personalEarningsStatement
- beginFieldExtraction - Begin Personal Earnings Statement Field Extraction
- getExtractedFields - Get Personal Earnings Statement Fields
models.purchaseOrder
- beginFieldExtraction - Begin Purchase Order Field Extraction
- getExtractedFields - Get Purchase Order Fields
models.receipt
- beginFieldExtraction - Begin Receipt Field Extraction
- getExtractedFields - Get Receipt Fields
models.remittanceAdvice
- beginFieldExtraction - Begin Remittance Advice Field Extraction
- getExtractedFields - Get Remittance Advice Fields
models.seaWaybill
- beginFieldExtraction - Begin Sea Waybill Field Extraction
- getExtractedFields - Get Sea Waybill Fields
models.taxiReceipt
- beginFieldExtraction - Begin Taxi Receipt Field Extraction
- getExtractedFields - Get Taxi Receipt Fields
models.usForm1040
- beginFieldExtraction - Begin US Form 1040 Field Extraction
- getExtractedFields - Get US Form 1040 Fields
models.usFormW2
- beginFieldExtraction - Begin US Form W2 Field Extraction
- getExtractedFields - Get US Form W2 Fields
models.utilityBill
- beginFieldExtraction - Begin Utility Bill Field Extraction
- getExtractedFields - Get Utility Bill Fields
Standalone functions
All the methods listed above are available as standalone functions. These functions are ideal for use in applications running in the browser, serverless runtimes or other environments where application bundle size is a primary concern. When using a bundler to build your application, all unused functionality will be either excluded from the final bundle or tree-shaken away.
To read more about standalone functions, check FUNCTIONS.md.
documentsDelete- Delete DocumentdocumentsList- List DocumentsmodelsAirWaybillBeginFieldExtraction- Begin Air Waybill Field ExtractionmodelsAirWaybillGetExtractedFields- Get Air Waybill FieldsmodelsArrivalNoticeBeginFieldExtraction- Begin Arrival Notice Field ExtractionmodelsArrivalNoticeGetExtractedFields- Get Arrival Notice FieldsmodelsBankStatementBeginFieldExtraction- Begin Bank Statement Field ExtractionmodelsBankStatementGetExtractedFields- Get Bank Statement FieldsmodelsBasicContractBeginFieldExtraction- Begin Basic Contract Field ExtractionmodelsBasicContractGetExtractedFields- Get Basic Contract FieldsmodelsBillOfLadingBeginFieldExtraction- Begin Bill Of Lading Field ExtractionmodelsBillOfLadingGetExtractedFields- Get Bill Of Lading FieldsmodelsBrokerageStatementBeginFieldExtraction- Begin Brokerage Statement Field ExtractionmodelsBrokerageStatementGetExtractedFields- Get Brokerage Statement FieldsmodelsCertificateOfOriginBeginFieldExtraction- Begin Certificate Of Origin Field ExtractionmodelsCertificateOfOriginGetExtractedFields- Get Certificate Of Origin FieldsmodelsCommercialInvoiceBeginFieldExtraction- Begin Commercial Invoice Field ExtractionmodelsCommercialInvoiceGetExtractedFields- Get Commercial Invoice FieldsmodelsCustomsDeclarationBeginFieldExtraction- Begin Customs Declaration Field ExtractionmodelsCustomsDeclarationGetExtractedFields- Get Customs Declaration FieldsmodelsDangerousGoodsDeclarationBeginFieldExtraction- Begin Dangerous Goods Declaration Field ExtractionmodelsDangerousGoodsDeclarationGetExtractedFields- Get Dangerous Goods Declaration FieldsmodelsDeliveryNoteBeginFieldExtraction- Begin Delivery Note Field ExtractionmodelsDeliveryNoteGetExtractedFields- Get Delivery Note FieldsmodelsDocumentConversionBeginConversion- Begin Document ConversionmodelsDocumentConversionDownloadConvertedDocument- Download Converted DocumentmodelsDocumentConversionGetConversion- Get ConversionmodelsHotelInvoiceBeginFieldExtraction- Begin Hotel Invoice Field ExtractionmodelsHotelInvoiceGetExtractedFields- Get Hotel Invoice FieldsmodelsImageToTextBeginTextExtraction- Begin Image To Text ExtractionmodelsImageToTextGetExtractedText- Get Extracted TextmodelsInternationalConsignmentNoteBeginFieldExtraction- Begin International Consignment Note Field ExtractionmodelsInternationalConsignmentNoteGetExtractedFields- Get International Consignment Note FieldsmodelsInvoiceBeginFieldExtraction- Begin Invoice Field ExtractionmodelsInvoiceGetExtractedFields- Get Invoice FieldsmodelsPackingListBeginFieldExtraction- Begin Packing List Field ExtractionmodelsPackingListGetExtractedFields- Get Packing List FieldsmodelsPersonalEarningsStatementBeginFieldExtraction- Begin Personal Earnings Statement Field ExtractionmodelsPersonalEarningsStatementGetExtractedFields- Get Personal Earnings Statement FieldsmodelsPurchaseOrderBeginFieldExtraction- Begin Purchase Order Field ExtractionmodelsPurchaseOrderGetExtractedFields- Get Purchase Order FieldsmodelsReceiptBeginFieldExtraction- Begin Receipt Field ExtractionmodelsReceiptGetExtractedFields- Get Receipt FieldsmodelsRemittanceAdviceBeginFieldExtraction- Begin Remittance Advice Field ExtractionmodelsRemittanceAdviceGetExtractedFields- Get Remittance Advice FieldsmodelsSeaWaybillBeginFieldExtraction- Begin Sea Waybill Field ExtractionmodelsSeaWaybillGetExtractedFields- Get Sea Waybill FieldsmodelsTaxiReceiptBeginFieldExtraction- Begin Taxi Receipt Field ExtractionmodelsTaxiReceiptGetExtractedFields- Get Taxi Receipt FieldsmodelsUsForm1040BeginFieldExtraction- Begin US Form 1040 Field ExtractionmodelsUsForm1040GetExtractedFields- Get US Form 1040 FieldsmodelsUsFormW2BeginFieldExtraction- Begin US Form W2 Field ExtractionmodelsUsFormW2GetExtractedFields- Get US Form W2 FieldsmodelsUtilityBillBeginFieldExtraction- Begin Utility Bill Field ExtractionmodelsUtilityBillGetExtractedFields- Get Utility Bill Fields
Pagination
Some of the endpoints in this SDK support pagination. To use pagination, you
make your SDK calls as usual, but the returned response object will also be an
async iterable that can be consumed using the for await...of
syntax.
Here's an example of one such pagination call:
import { DocumentAi } from "@abbyy-sdk/document-ai";
const documentAi = new DocumentAi({
apiKeyAuth: process.env["DOCUMENTAI_API_KEY_AUTH"] ?? "",
});
async function run() {
const result = await documentAi.documents.list({
cursor: "xyz",
});
for await (const page of result) {
// Handle the page
console.log(page);
}
}
run();Retries
Some of the endpoints in this SDK support retries. If you use the SDK without any configuration, it will fall back to the default retry strategy provided by the API. However, the default retry strategy can be overridden on a per-operation basis, or across the entire SDK.
To change the default retry strategy for a single API call, simply provide a retryConfig object to the call:
import { DocumentAi } from "@abbyy-sdk/document-ai";
const documentAi = new DocumentAi({
apiKeyAuth: process.env["DOCUMENTAI_API_KEY_AUTH"] ?? "",
});
async function run() {
const result = await documentAi.documents.list({
cursor: "xyz",
}, {
retries: {
strategy: "backoff",
backoff: {
initialInterval: 1,
maxInterval: 50,
exponent: 1.1,
maxElapsedTime: 100,
},
retryConnectionErrors: false,
},
});
for await (const page of result) {
// Handle the page
console.log(page);
}
}
run();If you'd like to override the default retry strategy for all operations that support retries, you can provide a retryConfig at SDK initialization:
import { DocumentAi } from "@abbyy-sdk/document-ai";
const documentAi = new DocumentAi({
retryConfig: {
strategy: "backoff",
backoff: {
initialInterval: 1,
maxInterval: 50,
exponent: 1.1,
maxElapsedTime: 100,
},
retryConnectionErrors: false,
},
apiKeyAuth: process.env["DOCUMENTAI_API_KEY_AUTH"] ?? "",
});
async function run() {
const result = await documentAi.documents.list({
cursor: "xyz",
});
for await (const page of result) {
// Handle the page
console.log(page);
}
}
run();Error Handling
Some methods specify known errors which can be thrown. All the known errors are enumerated in the models/errors/errors.ts module. The known errors for a method are documented under the Errors tables in SDK docs. For example, the list method may throw the following errors:
| Error Type | Status Code | Content Type |
|---|---|---|
| errors.BadRequestError | 400 | application/json |
| errors.UnauthorizedError | 401 | application/json |
| errors.TooManyRequestsError | 429 | application/json |
| errors.InternalServerError | 500 | application/json |
| errors.APIError | 4XX, 5XX | */* |
If the method throws an error and it is not captured by the known errors, it will default to throwing a APIError.
import { DocumentAi } from "@abbyy-sdk/document-ai";
import {
BadRequestError,
InternalServerError,
SDKValidationError,
TooManyRequestsError,
UnauthorizedError,
} from "@abbyy-sdk/document-ai/models/errors";
const documentAi = new DocumentAi({
apiKeyAuth: process.env["DOCUMENTAI_API_KEY_AUTH"] ?? "",
});
async function run() {
let result;
try {
result = await documentAi.documents.list({
cursor: "xyz",
});
for await (const page of result) {
// Handle the page
console.log(page);
}
} catch (err) {
switch (true) {
// The server response does not match the expected SDK schema
case (err instanceof SDKValidationError): {
// Pretty-print will provide a human-readable multi-line error message
console.error(err.pretty());
// Raw value may also be inspected
console.error(err.rawValue);
return;
}
case (err instanceof BadRequestError): {
// Handle err.data$: BadRequestErrorData
console.error(err);
return;
}
case (err instanceof UnauthorizedError): {
// Handle err.data$: UnauthorizedErrorData
console.error(err);
return;
}
case (err instanceof TooManyRequestsError): {
// Handle err.data$: TooManyRequestsErrorData
console.error(err);
return;
}
case (err instanceof InternalServerError): {
// Handle err.data$: InternalServerErrorData
console.error(err);
return;
}
default: {
// Other errors such as network errors, see HTTPClientErrors for more details
throw err;
}
}
}
}
run();Validation errors can also occur when either method arguments or data returned from the server do not match the expected format. The SDKValidationError that is thrown as a result will capture the raw value that failed validation in an attribute called rawValue. Additionally, a pretty() method is available on this error that can be used to log a nicely formatted multi-line string since validation errors can list many issues and the plain error string may be difficult read when debugging.
In some rare cases, the SDK can fail to get a response from the server or even make the request due to unexpected circumstances such as network conditions. These types of errors are captured in the models/errors/httpclienterrors.ts module:
| HTTP Client Error | Description |
|---|---|
| RequestAbortedError | HTTP request was aborted by the client |
| RequestTimeoutError | HTTP request timed out due to an AbortSignal signal |
| ConnectionError | HTTP client was unable to make a request to a server |
| InvalidRequestError | Any input used to create a request is invalid |
| UnexpectedClientError | Unrecognised or unexpected error |
Server Selection
Override Server URL Per-Client
The default server can be overridden globally by passing a URL to the serverURL: string optional parameter when initializing the SDK client instance. For example:
import { DocumentAi } from "@abbyy-sdk/document-ai";
const documentAi = new DocumentAi({
serverURL: "https://api.abbyy.com/document-ai",
apiKeyAuth: process.env["DOCUMENTAI_API_KEY_AUTH"] ?? "",
});
async function run() {
const result = await documentAi.documents.list({
cursor: "xyz",
});
for await (const page of result) {
// Handle the page
console.log(page);
}
}
run();Custom HTTP Client
The TypeScript SDK makes API calls using an HTTPClient that wraps the native
Fetch API. This
client is a thin wrapper around fetch and provides the ability to attach hooks
around the request lifecycle that can be used to modify the request or handle
errors and response.
The HTTPClient constructor takes an optional fetcher argument that can be
used to integrate a third-party HTTP client or when writing tests to mock out
the HTTP client and feed in fixtures.
The following example shows how to use the "beforeRequest" hook to to add a
custom header and a timeout to requests and how to use the "requestError" hook
to log errors:
import { DocumentAi } from "@abbyy-sdk/document-ai";
import { HTTPClient } from "@abbyy-sdk/document-ai/lib/http";
const httpClient = new HTTPClient({
// fetcher takes a function that has the same signature as native `fetch`.
fetcher: (request) => {
return fetch(request);
}
});
httpClient.addHook("beforeRequest", (request) => {
const nextRequest = new Request(request, {
signal: request.signal || AbortSignal.timeout(5000)
});
nextRequest.headers.set("x-custom-header", "custom value");
return nextRequest;
});
httpClient.addHook("requestError", (error, request) => {
console.group("Request Error");
console.log("Reason:", `${error}`);
console.log("Endpoint:", `${request.method} ${request.url}`);
console.groupEnd();
});
const sdk = new DocumentAi({ httpClient });Debugging
You can setup your SDK to emit debug logs for SDK requests and responses.
You can pass a logger that matches console's interface as an SDK option.
!WARNING Beware that debug logging will reveal secrets, like API tokens in headers, in log messages printed to a console or files. It's recommended to use this feature only during local development and not in production.
import { DocumentAi } from "@abbyy-sdk/document-ai";
const sdk = new DocumentAi({ debugLogger: console });You can also enable a default debug logger by setting an environment variable DOCUMENTAI_DEBUG to true.