Arthasgpt NPM | npm.io

ArthasGPT

311941658-6b93b041-f30f-4121-a951-a746a19c75fc

Benefits

Scoped Knowledge: Using a generic chatbot like ChatGPT for narrow use cases like customer support, a game NPC, or writing code can yield undesired responses, or provide information outside the intended scope of knowledge. You don't want your MMORPG shopkeeper talking about about Ford F-150s or Chick-Fil-A, do you? Arthas scrapes a URL you provide as a knowledge source (usually a Wiki style web page, but could be anything - it's very flexible), and uses llamaindex to store and index that knowledge. It handles questions that fall outside of the scope of knowledge gracefully, so it will still feel like the user is interacting with a person even when it doesn't know the answer.
Distinct Personalities: Answers to questions are always rephrased from the first-person perspective in the style of a persona that you define. Because you're asked to define things like prose, tone, and even art style, Arthas is able to generate the appropriate prompts for your persona, resulting in statements the target persona would perceivably say.
Extensible: Arthas can be ran as an API, in a React app, as a CLI, or as a a dependency in your application. It uses Ollama for text so you can choose from a wide range of models, and defaults to Stable Diffusion (txt2img) for images.

Web app

You can interact with ArthasGPT via this Node/React full stack application.

CLI examples

Image quality & GUI

Note that in a default Terminal you will not see text colors and the image quality will be diminished. Using a Terminal like iTerm2 or Kitty will allow you to view the full resolution (1024x1024 by default).

In native Terminal with no addons:

Question: "what town are you from"
Answer:

With high-res image support:

Question: "what happened between you and sylvanas?"
Answer:

In verbose mode with caching:

Question: "why are you so mean"
Answer:

In verbose mode when he doesn't know the answer based on the knowledge he has:

Question: what is your favorite memory

For this one, llamaindex could not find any relevant info, resulting in this prompt fragment:

"Arthas's favorite memory is not explicitly mentioned in the context information provided."

Yet the prompt is still robust enough to provide a meaningful response in the style of Arthas:

"In the realm of my existence, a cherished memory lies concealed, veiled by the shadows of time. Its essence, though unspoken, resonates within my being. A tale of valor and darkness, woven intricately in the tapestry of my soul."

And we still get a relevant image:

Usage

Set up the environment. No API keys needed!

.env scaffold

LLM_FRAMEWORK=llamaindex
TEXT_MODEL=mistral
STABLE_DIFFUSION_URI=http://127.0.0.1:7860
IMAGE_MODEL=txt2img
DELAY=200
RENDER=true
VERBOSE=true
GREETING=false
CACHE=true
MAX_STORAGE_KEY_LENGTH=32
LOG_PREFIX=<ArthasGPT>
STORAGE_URI=./.tmp

Install Ollama

Download Ollama
Linux: curl -fsSL https://ollama.com/install.sh | sh
Windows & Mac: ollama.com/download
Run the CLI
ollama start
Find a model you like here and run it in your Terminal:
ollama run mistral

The Ollama (Mistral) API is now listening on http://localhost:11434/

Install Stable Diffusion

Have Python 3 already installed
Navigate to the desired directory and
git clone https://github.com/AUTOMATIC1111/stable-diffusion-webui.git
Run the web UI
Linux & Mac: Run ./webui.sh --api --lowvram.
Windows: Run ./webui-user.bat --api --lowvram from Windows Explorer as normal, non-administrator, user.
Note: --lowvram is an optional flag, if running on a great machine (16GB+ vram) you can omit this.

The Stable Diffusion API is now listening on http://localhost:7860/

Run Arthas

npm start

Persona configuration

Pass this config object to ArthasGPT when you instantiate a new persona.

const agent = await ArthasGPT({
  cache,
  greeting,
  knowledgeURI,
  name,
  artStyle,
  writingStyle,
  writingTone,
  query
});

Custom personas

Want to go beyond Arthas? You can create a custom persona for just about anyone as long as there's an online knowledgebase to point to.

See personas.md.

Environment config

TEXT_MODEL

Example: mistral.

STABLE_DIFFUSION_URI

Example: http://127.0.0.1:7860.

DELAY

Delay between requests (in ms), for rate limiting, artificial delays, etc.

VERBOSE

Set to true to show all logs. Enable VERBOSE to see the generated prompts in your console, for example, in this case the query was "how many blood elves have you killed?":

<ArthasGPT> Text (mistral) Prompt: Re-write the following message in the first-person, as if you are Arthas, in a style that is inspiring but grim, from the year 1200 A.D., using as few characters as possible (never exceed 500), in a tone that is slightly resentful, omitting any references to Earth or real-world society: Arthas killed Sylvanas Windrunner, King Anasterian Sunstrider, and Dar'Khan Drathir, who were blood elves. So, Arthas has killed three blood elves.
<ArthasGPT> Text (mistral) responded with "I, Arthas, vanquished Sylvanas Windrunner, King Anasterian Sunstrider, and Dar'Khan Drathir, noble blood elves. Three lives claimed by my hand.".
<ArthasGPT> Waiting 2 seconds...
<ArthasGPT> Image (txt2img) Prompt: Render the following in the style of Blizzard's World of Warcraft concept art in high resolution like a finely-tuned video game model including each detail and anatomically correct features (if any): I, Arthas, vanquished Sylvanas Windrunner, King Anasterian Sunstrider, and Dar'Khan Drathir, noble blood elves. Three lives claimed by my hand.

CACHE

Set to true to cache inputs, llamaindex queries, LLM prompts, responses, & images.

The transformed input/prompt is what's cached, not the literal user input. For example, the questions "who are you", "explain who you are", and "who is arthas?" all transform to the same query ("Who is Arthas?"). The LLM responses are cached too, so you'll get the same answer when asking similar questions (but without having to request the LLM again).

MAX_STORAGE_KEY_LENGTH

How long storage keys can be. The keys are derived from queries/prompts, but there are key/value limits in localStorage and some prompts can be very long. An alternative to this config would be to make the developer provide a key (similar to React) each time remember is called, but that isn't supported right now.

STORAGE_URI

Path to a temp folder used for cache (default is ./.tmp).

Middleware

To ensure integrity, optionally integrate lifecycle middleware at 2 stages: 1. LLM query: Run the formatted prompt through another transformer (like GPT 4) 2. Transformed response: Run the final image prompt through a different image model (like DALL-E 3)

Instructions coming soon.

dotenv llamaindex node-localstorage terminal-image textract

1 year ago

1 year ago

1 year ago

1 year ago