puppeteer-server-side-render v1.0.4
Puppeteer Server Side Render
A server side render service based on puppeteer
This is a puppeteer(chrome headless) server side render service.
Feature
- Can limit render origin
- Cache
Dependencies
How server side render work?
Before use this service, you must know how server side render work.
| Step | Role | File path | Do |
|---|---|---|---|
| 1 | Proxy(.htaccess) | dist/.htaccess | Detect origin is crawler or not by checking user agent. |
| 2 | Middleware(ssr.php) | dist/ssr.php | Send the request with page's url to this service's http server. |
| 3 | Puppeteer | :x: | If origin is valid, it will trigger server side render crawler(puppeteer) start. |
| 4 | Response | :x: | The http server of this service will return response with render result. |
| 5 | Middleware(ssr.php) | dist/ssr.php | Render the result to crawler. |
Install Google Chrome
Skip this step if you has install chrome browser
sudo apt-get install libxss1 libappindicator1 libindicator7 -y
wget https://dl.google.com/linux/direct/google-chrome-stable_current_amd64.deb
sudo dpkg -i google-chrome*.deb
sudo apt-get install -f
rm google-chrome-stable_current_amd64.debStart service
Clone repository
git clone git@github.com:ciao-chung/puppeteer-server-side-render.gitGo into Server folder and install node modules
cd Prod/Server
yarnSetup config by coping the example config
In Prod/Server/static
cp config.example.json config.jsonStart server
In Prod/Server
npm run startConfiguration
Prod/Server/static/config.json
Example
{
"allowOrigin": [
"http://localhost:8081", "https://foo.bar"
],
"cache": {
"ttl": 60,
"maxsize": 1000
},
"debug": true
}- port(optional): Number, port of Node.js express app, default is 3000.
- host(optional): String, host of Node.js express app, default is 'localhost'.
- allowOrigin(required): String/Array, allow origin, you can set it as * if you don't want to limit any origin.
- timeout(optional): Number, if client don't trigger server side render service in this timeout, crawler will auto get page result and response, default is 5000ms, at most 15000ms.
- cache(optional): Object, configure cache feature.
- ttl(optional): Number, time to life of cache(minutes), default is 1 minute.
- maxsize(optional): Number, maxsize of cache file on disk(Kilobyte), default is 1MB.
- path(optional): String, cache file store path, default is 'cache'.
- debug(optional): Boolean, debug mode, it will open chrome without headless mode.
Client side(web)
Installation
npm
npm install puppeteer-server-side-render --saveyarn
yarn add puppeteer-server-side-renderCopy proxy(.htaccess) and middleware(ssr.php) to web root
You can find them in puppeteer-server-side-render in node_modules
cd node_modules/puppeteer-server-side-render/dist/Use client library in web
We provide a client side library to trigger server side render service
import ServerSideRenderClient from 'puppeteer-server-side-render'
ServerSideRenderClient()
// when your all async data are ready and render
SSR.done()
// when your page is in error type
SSR.error()
// when you want to custom error status code in error page
SSR.error(403)Apache configuration
Enable apache proxy/proxy_http modules
sudo a2enmod proxy
sudo a2enmod proxy_http
sudo service apache2 restartSetup domain
<VirtualHost *:80>
ServerName example.com
ServerAlias www.example.com
ProxyRequests Off
ProxyPreserveHost On
ProxyVia Full
<Proxy *>
Require all granted
</Proxy>
<Location />
ProxyPass http://localhost:3000/
ProxyPassReverse http://127.0.0.1:3000
</Location>
</VirtualHost>Enable domain and restart apache
sudo a2ensite example.com.conf
sudo service apache2 restartManage service with PM2
PM2 is an advanced Node.js process manager.
You can manage server side render service easily by using PM2.
Installation
sudo yarn global add pm2Management
# start service
pm2 start app.js --name "ssr" --cwd==/path-to-ssr
# stop service
pm2 stop ssr
# delete service
pm2 delete ssr
# show status
pm2 status ssr
# show log
pm2 log ssr