1.0.0 • Published 2 years ago

amplify-portals-crawler v1.0.0

Weekly downloads
-
License
ISC
Repository
-
Last release
2 years ago

Amplify Portals' Crawler

Environment variables

VariableDefault valueDescription
APS_CRAWLER_PORT3000The port that used to communicate with the Crawler API
APS_CRAWLER_API_KEYN/AAdmin token to start a crawling session
APS_CRAWLER_CONCURRENCY_DOCS100(HTTP Requests) Number of pages that could crawled at give time
APS_CRAWLER_CONCURRENCY_DEVBLOG2(Headless browser) Number of pages that could crawled at give time
APS_CRAWLER_CONCURRENCY_BLOG2(Headless browser) Number of pages that could crawled at give time
APS_CRAWLER_CONCURRENCY_COMMUNITY5(Headless browser) Number of pages that could crawled at give time
APS_CRAWLER_CONCURRENCY_KBARTICLES5(HTTP Requests) Number of pages that could crawled at give time
APS_CRAWLER_ES_HOSTN/AElasticsearch host URL
APS_CRAWLER_ES_USERNAMEN/AElasticsearch username
APS_CRAWLER_ES_PASSWORDN/AElasticsearch password
APS_CRAWLER_CHROME_BINN/A- This is optional - Headless browser location. It's used by the DockerFile