0.0.11 • Published 10 years ago

mongo-fast-join v0.0.11

Weekly downloads
45
License
-
Repository
github
Last release
10 years ago

##mongo-fast-join

Join sub documents from other collections into the original query documents and do it as fast as mongo can.

###Intro

This is our stab at the mongo join problem. As you know, joins are not supported by MongoDB natively, and this can be a pain. We sought to create a project which performs a join query on mongodb and does it quickly. After a few attempts with less than stellar results, we arrived at the current implementation.

####How did we do it?

mongo-fast-join is fast because we paginate our join queries. Each document that is going to be joined to another document represents a unique query. This is accomplished with an $or clause. When dealing with only 10,000 original documents, this was miserably slow, taking up to a minute to return results on the local network. What a useless tool this would be if it took that long to join a measly 10,000 records. Turns out that splitting query into small queries with only 5 conditions in each $or clause sped the performance up by many orders of magnitude, joining 10,000 documents in less than 200ms.

We think that the reason the single query performed so poorly is because the $or clause was not intended to handle 10,000 conditions. It also seems that the query is executed in a single thread (just a guess).

####Shut up and take my query!

This is the syntax we arrived at:

var MJ = require("mongo-fast-join"),
    mongoJoin = new MJ();
        
mongoJoin
    .query(
      //say we have blog posts and we store comments in a separate collection
      db.collection("blog_posts"),
        {}, //query statement
        {}, //fields
        {
            limit: 10000//options
        }
    )
    .join({
        joinCollection: db.collection("comments"),
        //respect the dot notation, multiple keys can be specified in this array
        leftKeys: ["comment_ids"],
        //This is the key of the document in the right hand document
        rightKeyPropertyPaths: ["_id"],
        //This is the new subdocument that will be added to the result document
        newKey: "comments",
        pageSize: 25,//would recommend experimenting, to find the best size
    })
    .join({
        //say that we want to get the users that commented too
        joinCollection: db.collection("users"),
        //This is cool, you can join on the new documents you add to the source document
        leftKeys: ["comments.userId", "comments.userName"],
        rightKeyPropertyPaths: ["_id", "name"],
        //unfortunately, as of now, you can only add subdocuments at the root level, not to arrays of subdocuments
        newKey: "users_participating",
        pageSize: 25
    })
    //Call exec to run the compiled query and catch any errors and results, in the callback
    .exec(function (err, items) {
        
        console.log(items);
    });

The resulting document should have 10000 blog posts with the comments and participating users attached to each relevant post in a structure like this:

{
    title: "The title of the hypothetical blog post",
    body: "The body of the hypothetical blog post"
    date: "tomorrow",//isoDate format!
    comment_ids: [1],
    
    comments: {//LOOK: This subdocument will be an object if there is only one match and an array if there are many matches
        _id: 1,
        userId: "someId",
        userName: "Grizzly Adams"
    },
    users_participating: {
        _id: "someId",
        name: "Grizzly Adams"
    }

}
0.0.11

10 years ago

0.0.10

10 years ago

0.0.9

10 years ago

0.0.8

11 years ago

0.0.7

11 years ago

0.0.6

11 years ago

0.0.5

11 years ago

0.0.4

11 years ago

0.0.3

11 years ago