puppeteer-recaptcha-solver v1.0.1
puppeteer-recaptcha-solver
Google Recapctha v2 solver with puppeteer. You can simply use it in your project by passing to the constructor your
Pageobject. The solver is using SpeechToText recognition, you can use one of our integrated solvers with your API key or to provide your own solving function. You can also integrate your own logger.
Disclaimer
This is an academic project, it is not intended to be used in production. It is not recommended to use this project for any other purpose than educational. The author is not responsible for any misuse of this project.
Demo
Table of contents
Prerequisites
This project requires NodeJS (version 8 or later) and NPM. Node and NPM are really easy to install.
Getting Started
These instructions will help to you install the package in your project, set it and use it. See Contributing for notes on how to help and contribute this project.
Installation
BEFORE YOU INSTALL: please read the prerequisites
To install and set up the library, run:
$ npm install puppeteer-recaptcha-solverUsage
To use, simply create the object and execute the solve command.
Example:
(async () => {
const browser = await puppeteer.launch({
headless: false,
});
const page = await browser.newPage();
const solver = new ReCaptchaSolver({
page,
maxRetries: 3,
transcriber: Transcribers.witAI,
apiKey: 'YOUR_API_KEY'
});
await page.goto(
'https://recaptcha-demo.appspot.com/recaptcha-v2-checkbox.php'
);
const solved = await solver.solve();
console.log('Captcha solved: ', solved);
await page.screenshot({ path: 'example/example.png' });
await browser.close();
})();API
Constructor
const solver = new ReCaptchaSolver({
page,
log,
maxRetries: 3,
transcriber: Transcribers.witAI,
apiKey: 'YOUR_API_KEY'
});A constructor to the object.
Fields
Supported options for the constructor field are listed below.
| Field | Type | Default value | Required | Description |
|---|---|---|---|---|
| page | Page | Yes | puppeteer page object | |
| log | Logger | console.log | No | A logger that the solver will use. You can also use the default logger or noopLogger to disable the logs |
| transcriber | Transcriber | witAI | No | A transcriber that the solver will use to transcriber the audio to text. You can can choose between witAI or googleSpeechToText by passing Transcribers.witAI or Transcribers.googeSpeechToText or passing you own Transcriber function. |
| maxRetries | number | 3 | No | Total number of retries until the captcha is solved |
| apiKey | string | No | API key to your transcribe service |
Solve
const solved: boolean = await solver.solve();A command that will start the solving process.
Returns a Promise<boolean> to indicate if the captcha successfully solved.
General Types
| Type | Signature | Description |
|---|---|---|
| Logger | interface Logger { log(message: string): void | Promise\<void>; error(message: string): void | Promise\<void>; warn(message: string): void | Promise\<void>; info(message: string): void | Promise\<void>; debug(message: string): void | Promise\<void>;} | A logger object that the solver will use. |
| Transcriber | type Transcriber = ( audioBuffer: ArrayBuffer, apiKey?: string) => Promise\<string | null>; | A transcribe function that gets an ArrayBuffer and should return the text |
Examples
default Logger
const defaultLogger: Logger = {
log: (message: string) => console.log('[LOG]', message),
error: (message: string) => console.error('[ERROR]', message),
warn: (message: string) => console.warn('[WARN]', message),
info: (message: string) => console.info('[INFO]', message),
debug: (message: string) => console.debug('[DEBUG]', message),
};witAI Transcriber
const witAI: Transcriber = async (
audioBuffer: ArrayBuffer,
apiKey?: string
) => {
if (!apiKey) {
throw new Error('witAI transcriber requires API key');
}
const { data } = await axios.post<string>(
'https://api.wit.ai/speech?v=20220622',
audioBuffer,
{
headers: {
Authorization: `Bearer ${apiKey}`,
'Content-Type': 'audio/mpeg3',
},
}
);
const parsed =
typeof data === 'string'
? JSON.parse(data.split('\r\n').slice(-1)[0] || '{}')
: data;
return parsed?.text;
}Google SpeechToText Transcriber
const googleSpeechToText: Transcriber = async (
audioBuffer: ArrayBuffer,
apiKey?: string
) => {
if (!apiKey) {
throw new Error('googleSpeechToText transcriber requires API key');
}
const { data } = await axios.post<string>(
`https://speech.googleapis.com/v1p1beta1/speech:recognize?key=${apiKey}`,
{
config: {
encoding: 'MP3',
sampleRateHertz: 16000,
languageCode: 'en-US',
},
audio: {
content: Buffer.from(audioBuffer).toString('base64'),
},
}
);
const parsed =
typeof data === 'string'
? JSON.parse(data.split('\r\n').slice(-1)[0] || '{}')
: data;
return parsed?.results?.[0]?.alternatives?.[0]?.transcript;
};Contributing
Start with cloning this repo on your local machine:
$ git clone https://github.com/dore51/puppeteer-captcha-solver.git
$ cd puppeteer-captcha-solverTo install and set up the library, run:
$ npm installTo check that everything works
$ npm run exampleRunning the tests
$ npm testBuilding a distribution version
$ npm run buildThis task will create a distribution version of the project
inside your local lib/ folder
publishing the distribution version
$ npm publishPlease read CONTRIBUTING.md for details on our code of conduct, and the process for submitting pull requests to us.
- Fork it!
- Create your feature branch:
git checkout -b my-new-feature - Add your changes:
git add . - Commit your changes:
git commit -am 'Add some feature' - Push to the branch:
git push origin my-new-feature - Submit a pull request :sunglasses:
Built With
This package has the following dependencies:
- Axios: A promise-based HTTP client for the browser and Node.js. Axios is used to make HTTP requests in the package.
The following dependencies are only required for development and testing purposes:
- Node.js: A JavaScript runtime built on Chrome's V8 JavaScript engine. Node.js is required to run the package.
- Puppeteer: A Node library that provides a high-level API to control Chrome or Chromium over the DevTools Protocol. Puppeteer is used to automate and control the browser in order to solve the reCAPTCHA challenge.
- Prettier: A code formatter that enforces a consistent style across the codebase.
- Jest: A testing framework for JavaScript.
- puppeteer-screen-recorder: A utility for recording screencasts of a Puppeteer page.
- TSLint: A static analysis tool that checks TypeScript code for readability, maintainability, and functionality errors.
Authors
- Dor Eitan - Github
See also the list of contributors who participated in this project.
License
MIT License © Dor Eitan