@ex7r3me/loopback-connector-es v1.4.3
loopback-connector-elastic-search
Basic Elasticsearch datasource connector for Loopback.
Table of Contents generated with DocToc
- Overview
- Install this connector in your loopback app
- Configuring connector
- About the example app
- How to achieve Instant search
- Troubleshooting
- Testing
- Contributing
- Frequently Asked Questions
- Release notes
Overview
libdirectory has the entire source code for this connector- this is what gets downloaded to your
node_modulesfolder when you runnpm install loopback-connector-es --save --save-exact
- this is what gets downloaded to your
examplesdirectory has a loopback app which uses this connector- this is not published to NPM, it is only here for demo purposes
- it will not be downloaded to your
node_modulesfolder! - similarly the
examples/server/datasources.jsonandexamples/server/datasources.<env>.jsfiles are there for this demo app to use - you can copy their content over to
<yourApp>/server/datasources.jsonor<yourApp>/server/datasources.<env>.jsif you want and edit it there but don't start editing the files insideexamples/serveritself and expect changes to take place in your app!
- it will not be downloaded to your
- this is not published to NPM, it is only here for demo purposes
testdirectory has unit tests- it does not reuse the loopback app from the
examplesfolder - instead, loopback and ES/datasource are built and injected programatically
- this directory is not published to NPM.
- Refer to
.npmignoreif you're still confused about what's part of the published connector and what's not.
- Refer to
- it does not reuse the loopback app from the
- You will find the
datasources.jsonfiles in this repo mention various configurations:elasticsearch-sslelasticsearch-plaindb- You don't need them all! They are just examples to help you see the various ways in which you can configure a datasource. Delete the ones you don't need and keep the one you want. For example, most people will start off with
elasticsearch-plainand then move on to configuring the additional properties that are exemplified inelasticsearch-ssl. You can mix & match if you'd like to have mongo and es and memory, all three! These are basics of the "connector" framework in loooback and not something we added.
- Don't forget to edit your
model-config.jsonfile and point the models at thedataSourceyou want to use.
Install this connector in your loopback app
cd <yourApp>
npm install loopback-connector-es --save --save-exactConfiguring connector
Required:
- host: Elasticsearch engine host address.
- port: Elasticsearch engine port.
- name: Connector name.
- connector: Elasticsearch driver.
- index: Search engine specific index.
- apiVersion: specify the major version of the Elasticsearch nodes you will be connecting to.
Recommended:
- mappings: an array of elasticsearch mappings for your various loopback models.
- if your models are spread out across different indexes then you can provide an additional
indexfield as an override for your model - if you don't want to use
type:ModelNameby default, then you can provide an additionaltypefield as an override for your model
- if your models are spread out across different indexes then you can provide an additional
Optional:
- log: sets elasticsearch client's logging, you can refer to the docs here
- defaultSize: total number of results to return per page.
- refreshOn optional array with method names you want to set refresh option as true
- requestTimeout: this value is in milliseconds
- ssl: useful for setting up a secure channel
- protocol: can be
httporhttps(httpis the default if none specified) ... must behttpsif you're usingssl - auth: useful if you have access control setup via services like
es-jettyorfoundorshield - amazonES: configuration for
http-aws-esNOTE: The package needs to be installed in your project. Its not part of this Connector.
Sample:
Edit datasources.json and set:
"db": { "connector": "es", "name": "<name>", "index": "<index>", "hosts": [ { "protocol": "http", "host": "127.0.0.1", "port": 9200, "auth": "username:password" } ], "apiVersion": "<apiVersion>", "refreshOn": ["save","create", "updateOrCreate"], "log": "trace", "defaultSize": <defaultSize>, "requestTimeout": 30000, "ssl": { "ca": "./../cacert.pem", "rejectUnauthorized": true }, "amazonES": { "region": "us-east-1", "accessKey": "AKID", "secretKey": "secret" }, "mappings": [ { "name": "UserModel", "properties": { "realm": {"type": "string", "index" : "not_analyzed" }, "username": {"type": "string", "index" : "not_analyzed" }, "password": {"type": "string", "index" : "not_analyzed" }, "email": {"type": "string", "analyzer" : "email" } } }, { "name": "CoolModel", "index": <useSomeOtherIndex>, "type": <overrideTypeName>, "properties": { "realm": {"type": "string", "index" : "not_analyzed" }, "username": {"type": "string", "index" : "not_analyzed" }, "password": {"type": "string", "index" : "not_analyzed" }, "email": {"type": "string", "analyzer" : "email" } } } ], "settings": { "analysis": { "filter": { "email": { "type": "pattern_capture", "preserve_original": 1, "patterns": [ "([^@]+)", "(\\p{L}+)", "(\\d+)", "@(.+)" ] } }, "analyzer": { "email": { "tokenizer": "uax_url_email", "filter": ["email", "lowercase", "unique"] } } } } }You can peek at
/examples/server/datasources.jsonfor more hints.
About the example app
- The
examplesdirectory contains a loopback app which uses this connector. - You can point this example at your own elasticsearch instance or use the quick instances provided via docker.
Run both example and ES in docker
As a developer, you may want a short lived ES instance that is easy to tear down when you're finished dev testing. We recommend docker to facilitate this.
Pre-requisites You will need docker-engine and docker-compose installed on your system.
Step-1
- Set desired versions for node and Elasticsearch
- here are the valid values to use for Node
- here are the valid values to use for Elasticsearch
# combination of node v0.10.46 with elasticsearch v1
export NODE_VERSION=0.10.46
export ES_VERSION=1
echo 'NODE_VERSION' $NODE_VERSION && echo 'ES_VERSION' $ES_VERSION
# similarly feel free to try relevant combinations:
## of node v0.10.46 with elasticsearch v2
## of node v0.12 with elasticsearch v2
## of node v0.4 with elasticsearch v2
## of node v5 with elasticsearch v2
## elasticsearch v5 will probably not work as there isn't an `elasticsearch` client for it, as of this writing
## etc.Step-2
- Run the setup with
docker-composecommands.
git clone https://github.com/strongloop-community/loopback-connector-elastic-search.git myEsConnector
cd myEsConnector/examples
npm install
docker-compose upStep-3
- Visit
localhost:3000/explorerand you will find our example loopback app running there.
Run example locally and ES in docker
- Empty out
examples/server/datasources.jsonso that it only has the following content remaining:{} - Set the
NODE_ENVenvironment variable on your local/host machine- Set the environment variable
NODE_ENV=sample-es-plain-1if you want to useexamples/server/datasources.sample-es-plain-1.js - Set the environment variable
NODE_ENV=sample-es-plain-2if you want to useexamples/server/datasources.sample-es-plain-2.js - Set the environment variable
NODE_ENV=sample-es-ssl-1if you want to useexamples/server/datasources.sample-es-ssl-1.js- a sample docker instance for this hasn't been configured yet, so it doesn't work out-of-the-box, use it only as readable (not runnable) reference material for now
- You can configure your own
datasources.jsonordatasources.<env>.jsbased on what you learn from these sample files.- Technically, to run the example, you don't need to set
NODE_ENVif you won't be configuring via the.<env>.jsfiles ... configuring everything withindatasources.jsonis perfectly fine too. Just remember that you will lose the ability to have inline comments and will have to use double-quotes if you stick with.json
- Technically, to run the example, you don't need to set
- Set the environment variable
Start elasticsearch version 1.x and 2.x using:
git clone https://github.com/strongloop-community/loopback-connector-elastic-search.git myEsConnector cd myEsConnector docker-compose -f docker-compose-for-tests.yml up # in another terminal window or tab cd myEsConnector/examples npm install DEBUG=boot:test:* node server/server.jsVisit
localhost:3000/explorerand you will find our example loopback app running there.
Run example locally
Install dependencies and start the example server
git clone https://github.com/strongloop-community/loopback-connector-elastic-search.git myEsConnector cd myEsConnector/examples npm install- Don't forget to create an index in your ES instance:
curl -X POST https://username:password@my.es.cluster.com/shakespeare - If you mess up and want to delete, you can use:
curl -X DELETE https://username:password@my.es.cluster.com/shakespeare - Don't forget to set a valid value for
apiVersionfield inexamples/server/datasources.jsonthat matches the version of ES you are running.
- Don't forget to create an index in your ES instance:
Set up a
cacert.pemfile for communicating securely (https) with your ES instance. Download the certificate chain for your ES server using this sample (will need to be edited to use your provider) command:cd myEsConnector openssl s_client -connect my.es.cluster.com:9243 -showcerts | tee cacert.pem- The command may not self terminate so you may need to use
ctrl+c - It will be saved at the base of your cloned project
- Sometimes extra data is added to the file, you should delete everything after the following lines:
--- No client certificate CA names sent ---- The command may not self terminate so you may need to use
Run:
cd myEsConnector/examples DEBUG=boot:test:* node server/server.js- The
examples/server/boot/boot.jsfile will automatically populate data for UserModels on your behalf when the server starts.
- The
Open this URL in your browser: http://localhost:3000/explorer
- Try fetching all the users via the rest api console
- You can dump all the data from your ES index, via cmd-line too:
curl -X POST username:password@my.es.cluster.com/shakespeare/_search -d '{"query": {"match_all": {}}}'
- To test a specific filter via GET method, use for example:
{"q" : "friends, romans, countrymen"}
How to achieve Instant search
From version 1.3.4, refresh option is added which support's instant search after create and update. This option is configurable and one can activate or deactivate it according to their need. By default refresh is true which makes response to come only after documents are indexed(searchable).
To know more about refresh go through this article
Ways to configure refresh
Datasource File: Pass refreshOn array from datasource file including methods name in which you want this to be true
"es": {
"name": "es",
"refreshOn": ["save","create", "updateOrCreate"],
.....Model.json file: Configurable on per model and operation level (true, false, wait_for)
"elasticsearch": {
"create": {
"refresh": false
},
"destroy": {
"refresh": false
},
"destroyAll": {
"refresh": "wait_for"
}
}NOTE:- While a refresh is useful, it still has a performance cost. A manual refresh can be useful, but avoid manual refresh every time you index a document in production; it will hurt your performance. Instead, your application needs to be aware of the near real-time nature of Elasticsearch and make allowances for it.
Troubleshooting
- Do you have both
elasticsearch-sslandelasticsearch-plainin yourdatasources.jsonfile? You just need one of them (not both), based on how you've setup your ES instance. - Did you forget to set
model-config.jsonto point at the datasource you configured? Maybe you are using a different or misspelled name than what you thought you had! - Did you forget to set a valid value for
apiVersionfield indatasources.jsonthat matches the version of ES you are running? - Maybe the version of ES you are using isn't supported by the client that this project uses. Try removing the
elasticsearchsub-dependency from<yourApp>/node_modules/loopback-connector-es/node_modulesfolder and then install the latest client:cd <yourApp>/node_modules/loopback-connector-es/node_modules- then remove the
elasticsearchfolder- unix/mac quickie:
rm -rf elasticsearch
- unix/mac quickie:
npm install --save --save-exact https://github.com/elastic/elasticsearch-js.git- to "academically" prove to yourself that this will work with the new install:
- on unix/mac you can quickly dump the supported versions to your terminal with:
cat elasticsearch/package.json | grep -A 5 supported_es_branches - on other platforms, look into the
elasticsearch/package.jsonand search for thesupported_es_branchesjson block.
- on unix/mac you can quickly dump the supported versions to your terminal with:
- go back to yourApp's root directory
- unix/mac quickie:
cd <yourApp>
- unix/mac quickie:
- And test that you can now use the connector without any issues!
- These changes can easily get washed away for several reasons. So for a more permanent fix that adds the version you want to work on into a release of this connector, please look into Contributing.
Testing
- You can edit
test/resource/datasource-test.jsonto point at your ES instance and then runnpm test - If you don't have an ES instance and want to leverage docker based ES instances then:
- If you want to run all tests across all versions in one go, then:
- Run
docker-compose -f docker-compose-for-testing-all.yml up - Then run
npm test - When you're finished and want to tear down the docker instances, run:
docker-compose -f docker-compose-for-testing-all.yml down
- Run
- You can test a specific version of elasticsearch if you want
- elasticsearch version 1.x
- Run
docker-compose -f docker-compose-for-testing-v1.yml up - Then run
npm run testv1- To run tests with additional logging, use:
DEBUG=test:es-v1:* npm run testv1DEBUG=test:es-v1:*,loopback:connector:elasticsearch npm run testv1
- Troubleshoot test with node-inspector if the level of details is still not enough:
npm run testv1 -- --debug-brkDEBUG=test:es-v1:* npm run testv1 -- --debug-brkDEBUG=test:es-v1:*,loopback:connector:elasticsearch npm run testv1 -- --debug-brk
- To run tests with additional logging, use:
- When you're finished and want to tear down the docker instances, run:
docker-compose -f docker-compose-for-testing-v1.yml down
- Run
- elasticsearch version 2.x
- Run
docker-compose -f docker-compose-for-testing-v2.yml up - Then run
npm run testv2- To run tests with additional logging, use:
DEBUG=test:es-v2:* npm run testv2DEBUG=test:es-v2:*,loopback:connector:elasticsearch npm run testv2
- Troubleshoot test with node-inspector if the level of details is still not enough:
npm run testv2 -- --debug-brkDEBUG=test:es-v2:* npm run testv2 -- --debug-brkDEBUG=test:es-v2:*,loopback:connector:elasticsearch npm run testv2 -- --debug-brk
- To run tests with additional logging, use:
- When you're finished and want to tear down the docker instances, run:
docker-compose -f docker-compose-for-testing-v2.yml down
- Run
- elasticsearch version 5.x
- Run
docker-compose -f docker-compose-for-testing-v5.yml up - Then run
npm run testv5- To run tests with additional logging, use:
DEBUG=test:es-v5:* npm run testv5DEBUG=test:es-v5:*,loopback:connector:elasticsearch npm run testv5
- Troubleshoot test with node-inspector if the level of details is still not enough:
npm run testv5 -- --debug-brkDEBUG=test:es-v5:* npm run testv5 -- --debug-brkDEBUG=test:es-v5:*,loopback:connector:elasticsearch npm run testv5 -- --debug-brk
- To run tests with additional logging, use:
- When you're finished and want to tear down the docker instances, run:
docker-compose -f docker-compose-for-testing-v5.yml down
- Run
- elasticsearch version 1.x
- If you want to run all tests across all versions in one go, then:
Contributing
- Feel free to contribute via PR or open an issue for discussion or jump into the gitter chat room if you have ideas.
- I recommend that project contributors who are part of the team:
- should merge
masterintodevelop... if they are behind, before starting thefeaturebranch - should create
featurebranches from thedevelopbranch - should merge
featureintodevelopthen create areleasebranch to:- update the changelog
- close related issues and mention release version
- update the readme
- fix any bugs from final testing
- commit locally and run
npm-release x.x.x -m "<some comment>" - merge
releaseinto bothmasteranddevelop - push
masteranddevelopto GitHub
- should merge
- For those who use forks:
- please submit your PR against the
developbranch, if possible - if you must submit your PR against the
masterbranch ... I understand and I can't stop you. I only hope that there is a good reason likedevelopnot being up-to-date withmasterfor the work you want to build upon.
- please submit your PR against the
npm-release <versionNumber> -m <commit message>may be used to publish. Pubilshing to NPM should happen from themasterbranch. It should ideally only happen when there is something release worthy. There's no point in publishing just because of changes totestorexamplesfolder or any other such entities that aren't part of the "published module" (refer to.npmignore) to begin with.
FAQs
- How do we enable or disable the logs coming from the underlying elasticsearch client? There may be a need to debug/troubleshoot at times.
- How do we enable or disable the logs coming from this connector?
- By default if you do not set the following env variable, they are disabled:
DEBUG=loopback:connector:elasticsearch- For example, try running tests with and without it, to see the difference:
- with:
DEBUG=loopback:connector:elasticsearch npm test - without:
npm test
- with:
- For example, try running tests with and without it, to see the difference:
- By default if you do not set the following env variable, they are disabled:
- What are the tests about? Can you provide a brief overview?
- Tests are prefixed with
01or02etc. in order to run them in that order by leveraging default alphabetical sorting. - The
02.basic-querying.test.jsfile uses two models to test various CRUD operations that any connector must provide, likefind(), findById(), findByIds(), updateAttributes()etc.- the two models are
UserandCustomer - their ES mappings are laid out in
test/resource/datasource-test.json - their loopback definitions can be found in the first
beforeblock that performs setup in02.basic-querying.test.jsfile ... these are the equivalent of aMyModel.jsonin your real loopback app.- naturally, this is also where we define which property serves as the
idfor the model and if its generated or not
- naturally, this is also where we define which property serves as the
- the two models are
- Tests are prefixed with
- How do we get elasticserch to take over ID generation?
- An automatically generated id-like field that is maintained by ES is
_uid. Without some sort of es-field-level-scripting-on-index (if that is possible at all) ... I am not sure how we could ask elasticsearch to take over auto-generating an id-like value for any arbitrary field! So the connector is setup such that addingid: {type: String, generated: true, id: true}will tell it to use_uidas the actual field backing theid... you can keep using the doingmodel.idabstraction and in the background_uidvalues are mapped to it. - Will this work for any field marked as with
generated: trueandid: true?- No! The connector isn't coded that way right now ... while it is an interesting idea to couple any such field with ES's
_uidfield inside this connector ... I am not sure if this is the right thing to do. If you hadobjectId: {type: String, generated: true, id: true}then you won't find a realobjectIdfield in your ES documents. Would that be ok? Wouldn't that confuse developers who want to write custom queries and run 3rd party app against their ES instance? Don't useobejctId, use_uidwould have to be common knowledge. Is that ok?
- No! The connector isn't coded that way right now ... while it is an interesting idea to couple any such field with ES's
- An automatically generated id-like field that is maintained by ES is
Release notes
- Release
1.0.6of this connector updates the underlying elasticsearch client version to11.0.1 - For this connector, you can configure an
indexname for your ES instance and the loopback model's name is conveniently/automatically mapped as the EStype. Users must setup
stringfields asnot_analyzedby default for predictable matches just like other loopback backends. And if more flexibility is required, multi-field mappings can be used too."name" : { "type" : "multi_field", "fields" : { "name" : {"type" : "string", "index" : "not_analyzed"}, "native" : {"type" : "string", "index" : "analyzed"} } } ... // this will treat 'George Harrison' as 'George Harrison' in a search User.find({order: 'name'}, function (err, users) {..} // this will treat 'George Harrison' as two tokens: 'george' and 'harrison' in a search User.find({order: 'name', where: {'name.native': 'Harrison'}}, function (err, users) {..}Release
1.3.4add's support for updateAll for elasticsearchv-2.3and above. To make updateAll work you will have to add below options in yourelasticsearch.ymlconfig filescript.inline: true script.indexed: true script.engine.groovy.inline.search: on script.engine.groovy.inline.update: onTBD