@pg-nano/pg-parser v16.1.5
@pg-nano/pg-parser
A fork of libpg-query with best-in-class type definitions and AST utilities.
import { parseQuery } from "@pg-nano/pg-parser"
const ast = await parseQuery("SELECT 1; SELECT 2")
// ^? ParseResult
ast.version // => 160001
ast.stmts // => [{ stmt: SelectStmt, stmt_len: 8 }, { stmt: SelectStmt, stmt_location: 9 }]Install
pnpm add @pg-nano/pg-parserThe major and minor version of this package is meant to be aligned with the supported PostgreSQL major and minor version. Older and newer versions of PostgreSQL may be compatible, but this is not guaranteed.
!WARNING This package may be improved in ways that break your type guards. You'll probably want to pin to a specific version to avoid unexpected type changes.
Upon install, the pre-compiled binary for your operating system and architecture will be pulled from GitHub Releases.
API
This package exports the following native functions:
parseQuery(for async parsing a SQL string of one or more statements)parseQuerySyncparsePlPgSQL(for async parsing a plpgsql string)parsePlPgSQLSyncfingerprint(for generating a unique string for a SQL string)fingerprintSyncscanSync(for scanning a SQL string and returning a list of tokens)splitWithScannerSync(for splitting a SQL string into one or more statements)
Note: There is no deparse function (for turning an AST back into a string) included, as this isn't needed for my use case.
AST utilities
I've implemented some TypeScript utilities for working with the AST:
walk(for traversing the AST)select(for type-safe, deep field access through dot-notation)$(for type-safe node proxy and type guards)
Walking the AST
Let's explore the walk function, ideal for AST traversal where you're only concerned with specific node types. You can pass a callback or a visitor object. You can return false to not walk into the children of the current node.
Each node passed to your visitor is wrapped in a NodePath instance, which tracks the parent node and provides type guards (e.g. isSelectStmt) for type narrowing. You can access the underlying node with path.node.
import { parseQuerySync, walk, NodeTag } from "@pg-nano/pg-parser"
walk(parseQuerySync(sql), (path) => {
path.tag // string
path.node // the node object
path.parent // the parent node
if (path.isSelectStmt()) {
// The visitor pattern is also supported.
walk(path.node.targetList, {
ColumnRef(path) {
const id = path.node.fields
.map((f) => (NodeTag.isString(f) ? f.String.sval : "*"))
.join(".")
console.log(id)
},
})
// don't walk into the children
return false
}
})Don't forget the select function, which excels at type-safe field access via dot-notation.
Note: You must not include the node types (i.e. the capitalized names) in the field path.
import { select, Expr } from "@pg-nano/pg-parser"
/**
* Given an expression node of many possible types,
* check for a `typeName` field.
*/
const typeName = select(expr as Expr, 'typeName')
// ^? TypeName | undefinedSimilar to select, you may like the $ function for field access. It returns a proxy that makes field access less verbose. It also comes with type guards for all nodes.
import { $, walk } from "@pg-nano/pg-parser"
walk(ast, {
SelectStmt(path) {
for (const target of path.node.targetList) {
const { name, val } = $(target)
if ($.isColumnRef(val)) {
console.log(
name,
$(val).fields.map(field => {
return $.isA_Star(field) ? "*" : field.String.sval
}).join("."),
)
}
}
}
})Type definitions
Every possible type that could be returned from libpg_query is defined in ast.ts. If a type is missing, it's probably because libpg_query didn't tell us about it (otherwise, please file an issue).
The type definitions are generated from the srcdata of libpg_query (the C library this package binds to). If you're interested in how they're generated, see scripts/generateTypes.ts and scripts/inferFieldMetadata.ts. For some cases, type definitions are manually specified in scripts/typeMappings.ts.
Other improvements
- Uses
prebuild-installto avoid bundling every platform's binaries into the package. - Added
splitWithScannerSyncfor SQL statement splitting. - Generated unit tests (see snapshots of every SQL case supported by
libpg_query).
Contributing
To generate the type definitions, you can use this command:
pnpm prepare:typesTo compile the TypeScript bindings and the C++ addon (and recompile them on file changes), you can use this command:
pnpm devOtherwise, pnpm build will compile just once.
If you're editing C++ code, you'll want to have compiledb installed and the clangd extension in VSCode. This enables the clangd language server for features like autocomplete, static analysis, and code navigation.
brew install compiledb⚠️ Windows support: The binding.gyp file is currently broken for Windows builds. Any help would be appreciated!
License
MIT