1.0.2 • Published 10 months ago

@kitschpatrol/dict-en-wiktionary v1.0.2

Weekly downloads
-
License
MIT
Repository
github
Last release
10 months ago

@kitschpatrol/dict-en-wiktionary

NPM Package @kitschpatrol/dict-en-wiktionary License: MIT

A highly permissive English dictionary sourced from Wiktionary for use with CSpell.

Overview

This is a pre-built CSpell dictionary with 800k+ English words sampled from the Wiktionary project in July 2024.

Getting started

Requirements

ToolVersion
cspell>= 6
Code Spell Checker - Visual Studio Code>= 2

Global Installation

To install globally:

npm install -g @kitschpatrol/dict-en-wiktionary
cspell link add @kitschpatrol/dict-en-wiktionary

To uninstall globally:

cspell link remove @kitschpatrol/dict-en-wiktionary

Manual Installation

To install in a specific project:

npm install -d @kitschpatrol/dict-en-wiktionary

The cspell-ext.json file in this package should be added to the import section in your cspell.json configuration file.

{
    // …
    "import": ["@kitschpatrol/dict-en-wiktionary/cspell-ext.json"],
    // …
}

Implementation notes

This repository's structure is based on examples the cspell-dicts repo.

No censorship and almost no curation is performed when generating the dictionary file, with some minor exceptions:

  • Certain Wiktionary categories, such as "Intentional Misspellings" are excluded.
  • Some parts of speech, such as "symbol", are excluded.
  • Certain Wiktionary tags, such as "archaic" are excluded.

See the generate-dictionary.ts script for additional details.

My intention is to update this dictionary with the latest words from Wiktionary every quarter or two.

Maintainers

@kitschpatrol

Acknowledgments

The CSpell project

Thanks to Jason Dent and Street Side Software for creating and maintaining the CSpell project.

Wiktionary

This CSpell dictionary uses data from Wiktionary, which is available under the Creative Commons Attribution-ShareAlike License. Data sourced from Wiktionary is unmodified.

Kaikki.org

Thanks to Tatu Ylonen at Kaikki.org for providing pre-processed machine-readable versions of the Wiktionary data dumps.

Contributing

Issues and pull requests are welcome.

License

This extension's (very small quantity of) code and (very large quantity of) dictionary data are shared under difference licenses:

Extension source code

MIT

Wiktionary dictionary data

Dual-licensed under the Creative Commons Attribution-ShareAlike 4.0 International License (CC-BY-SA) and the GNU Free Documentation License (GFDL).

See the full Wiktionary license text.