0.3.0 • Published 2 months ago
n8n-nodes-google-speech v0.3.0
n8n-nodes-google-speech
This is an n8n community node for Google Speech API integration. It provides functionality to convert audio to text using Google's Speech-to-Text API.
Installation
Follow these instructions to install this node:
In official n8n version
- Go to Settings > Community Nodes
- Select Install
- Enter
n8n-nodes-google-speech
in Enter npm package name - Click Install
In Docker or own setup
- Install npm package:
npm install n8n-nodes-google-speech
- Add node to N8N_CUSTOM_EXTENSIONS:
export N8N_CUSTOM_EXTENSIONS="/path/to/n8n-nodes-google-speech"
Prerequisites
To use this node, you need:
- A Google Cloud account with the Speech-to-Text API enabled
- A service account with access to the Speech API
- A JSON key file for the service account
Setting up Google Cloud credentials:
- Go to the Google Cloud Console
- Create a new project or select an existing one
- Enable the Speech-to-Text API in the API Library
- Go to "IAM & Admin" > "Service Accounts"
- Create a new service account or use an existing one
- Give it the "Cloud Speech Client" role
- Create a new JSON key (Actions > Manage keys > Add key > Create new key)
- Download the JSON key file
- Paste the entire contents of the JSON file into the n8n credentials field
Usage
- Add a "Google Speech" node to your workflow
- Select the "Speech to Text" resource and "Recognize" operation
- Configure the credentials using your Service Account Key JSON
- Provide audio as a base64-encoded string (without any prefix like "data:audio/...")
- Set the language code (e.g. 'en-US', 'it-IT', 'fr-FR', etc.)
- Run the workflow to convert speech to text
Important Notes About Audio Format:
- The audio must be properly encoded in base64 format
- The audio should be in a supported format (e.g., WAV, MP3, FLAC)
- Google Speech API will attempt to auto-detect the audio format
- For best results, use audio with clear speech and minimal background noise
- Ensure your service account has the proper permissions for the Speech API
Resources
License
0.3.0
2 months ago