1.0.0 • Published 2 years ago
amplify-portals-crawler v1.0.0
Amplify Portals' Crawler
Environment variables
Variable | Default value | Description |
---|---|---|
APS_CRAWLER_PORT | 3000 | The port that used to communicate with the Crawler API |
APS_CRAWLER_API_KEY | N/A | Admin token to start a crawling session |
APS_CRAWLER_CONCURRENCY_DOCS | 100 | (HTTP Requests) Number of pages that could crawled at give time |
APS_CRAWLER_CONCURRENCY_DEVBLOG | 2 | (Headless browser) Number of pages that could crawled at give time |
APS_CRAWLER_CONCURRENCY_BLOG | 2 | (Headless browser) Number of pages that could crawled at give time |
APS_CRAWLER_CONCURRENCY_COMMUNITY | 5 | (Headless browser) Number of pages that could crawled at give time |
APS_CRAWLER_CONCURRENCY_KBARTICLES | 5 | (HTTP Requests) Number of pages that could crawled at give time |
APS_CRAWLER_ES_HOST | N/A | Elasticsearch host URL |
APS_CRAWLER_ES_USERNAME | N/A | Elasticsearch username |
APS_CRAWLER_ES_PASSWORD | N/A | Elasticsearch password |
APS_CRAWLER_CHROME_BIN | N/A | - This is optional - Headless browser location. It's used by the DockerFile |
1.0.0
2 years ago