You are currently offline, serving cached version
10 December 2024 : If you are using angular-expressions to parse expressions such as {user.name}, {#users.length > 10}, please upgrade asap to angular-expressions@1.4.3 for security reasons : View Github issue

Deep dive into how to write a parser yourself

This article is for very advanced users, most users will have enough features with the angular-parser option.

Here's an example of how you could write a parser yourself :

Simple Parser example for [lower] and [upper]

Here's an example parser that allows you to lowercase or uppercase the data if writing your tag as: {user[lower]} or {user[upper]}:

const options = {
    parser(tag) {
        // tag can be "user[lower]", "user", or "user[upper]"
        const lowerRegex = /\[lower\]$/;
        const upperRegex = /\[upper\]$/;
        let changeCase = "";
        if (lowerRegex.test(tag)) {
            changeCase = "lower";
            // transform tag from "user[lower]" to "user"
            tag = tag.replace(lowerRegex, "");
        }
        if (upperRegex.test(tag)) {
            changeCase = "upper";
            // transform tag from "user[upper]" to "user"
            tag = tag.replace(upperRegex, "");
        }
        return {
            get(scope) {
                let result = null;
                // scope will be {user: "John"}
                if (tag === ".") {
                    result = scope;
                } else {
                    // Here we use the property "user" of the object {user: "John"}
                    result = scope[tag];
                }

                if (typeof result === "string") {
                    if (changeCase === "upper") {
                        return result.toUpperCase();
                    } else if (changeCase === "lower") {
                        return result.toLowerCase();
                    }
                }
                return result;
            },
        };
    },
    paragraphLoop: true,
    linebreaks: true,
};
new Docxtemplater(zip, options);

Explanation of the internals of the parser

To understand this option better, it is good to first know how docxtemplater manages the scope.

Whenever docxtemplater needs to render any tag, for example {name}, docxtemplater will use a scopemanager to retrieve the value for a given tag.

The scopemanager internally does the following:

  • it compiles the tag, by calling parser('name') where 'name' is the string representing what is inside the docxtemplater tag. For loop tags, if the tag is {#condition}, the passed string is only condition (it does not contain the #).

    The compilation of that tag should return an object containing a function at the get property.

  • whenever the tag needs to be rendered, docxtemplater calls parser('name').get({name: 'John'}), if {name: 'John'} is the current scope.

When inside a loop, for example:

{#users}
{name}
{/users}

There are several "scopes" in which it is possible to evaluate the {name} property.

The "deepest" scope is always evaluated first, so if the data is:

doc.render({
    users: [
        { name: "John", hobby: "Basketball" },
        { hobby: "Football" },
    ],
    name: "Mary",
});

the parser calls the function parser('name').get({ name: "John", hobby: "Basketball" }). Now if the returned value from the .get method is null or undefined, docxtemplater will call the same parser one level up, until it reaches the end of the scope.

If the root scope also returns null or undefined for the .get call, then the value from the nullGetter is used.

As a second argument to the parser() call, you receive additional meta data about the tag of the document (and you can for example test if it is a loop tag for example).

As a second argument to the get() call, you receive more meta data about the scope, including the full scopeList.

Let's take an example, If your template is:

{#users}
{name}
{/users}

And you call

doc.render({
    users: [
        { name: "John", hobby: "Basketball" },
        { name: "Mary", hobby: "Football" },
    ],
});

Then you will have the following:

const options = {
    // This is how docxtemplater is configured by default
    parser(tag, meta) {
        /*
         * tag is the string "users", or whatever you have put inside the
         * brackets, eg if your tag was {a==b}, then the value of tag would be
         * "a==b"
         * meta.tag will have the following shape :
         * {
         *    tag: {
         *      type: 'placeholder',
         *      module: 'loop',
         *      inverted: false,
         *      value: 'users',
         *      offset: 6,
         *      endLindex: 29,
         *      lIndex: 4,
         *      raw: '#users',
         *      sectPrCount: 0,
         *      lastParagrapSectPr: '',
         *      subparsed: [
         *        [Object],
         *        [Object],
         *        [Object],
         *        [Object],
         *        [Object],
         *        [Object],
         *        [length]: 6
         *      ]
         *    }
         * }
         */

        return {
            get(scope, context) {
                /*
                 * When treating the nested property "name" for the first call, the value will be :
                 * The scope will be :
                 *
                 *   { name: "John", hobby: "Basketball" },
                 *
                 * The context variable has following properties :
                 *
                 *   {
                 *      scopeList: [ { users: [Array] }, { name: "John", hobby: "Basketball" }],
                 *      resolved: undefined,
                 *      scopePath: [ 'users'],
                 *      scopeTypes: [ 'array'],
                 *      scopePathItem: [ 0],
                 *      scopePathLength: [ 2]
                 *   }
                 */

                if (tag === ".") {
                    return scope;
                }
                // Here we return the property "users" of the object {users: [....]}
                return scope[tag];
            },
        };
    },
};
const doc = new Docxtemplater(zip, options);
doc.render(/* data */);

Angular Parser

A very useful parser is the angular-expressions parser, which has implemented useful features.

See angular parser for comprehensive documentation

Deep Dive on the parser

The parser get function is given two arguments,

For the template

Hello {#users}{.}{/}

Using following data:

doc.render({
    users: ["Mary", "John"],
});

And with this parser

const options = {
    // This is how docxtemplater is configured by default
    parser(tag) {
        return {
            get: function parser(scope, context) {
                console.log(scope);
                console.log(context);
                return scope[tag];
            },
        };
    },
};
const doc = new Docxtemplater(zip, options);
doc.render(/* data */);

For the tag . in the first iteration, the arguments will be:

const scope = { name: "Jane" };
const context = {
    num: 1, // This corresponds to the level of the nesting,
    // the {#users} tag is level 0, the {.} is level 1
    scopeList: [
        {
            users: [
                {
                    name: "Jane",
                },
                {
                    name: "Mary",
                },
            ],
        },
        {
            name: "Jane",
        },
    ],
    scopePath: ["users"],
    scopePathItem: [0],
    /*
     * Together, scopePath and scopePathItem describe where we
     * are in the data, in this case, we are in the tag users[0]
     * (the first user)
     */
};

Simple Parser example for {$index} and {$isLast} inside loops

As an other example, it is possible to use the {$index} tag inside a loop by using following parser:

function parser(tag) {
    return {
        get(scope, context) {
            if (tag === "$index") {
                const indexes = context.scopePathItem;
                return indexes[indexes.length - 1];
            }
            if (tag === "$isLast") {
                const totalLength =
                    context.scopePathLength[
                        context.scopePathLength.length - 1
                    ];
                const index =
                    context.scopePathItem[
                        context.scopePathItem.length - 1
                    ];
                return index === totalLength - 1;
            }
            if (tag === "$isFirst") {
                const index =
                    context.scopePathItem[
                        context.scopePathItem.length - 1
                    ];
                return index === 0;
            }
            return scope[tag];
        },
    };
}

Those magic values can also be created using the angular parser.

Parser example to avoid using the parent scope if a value is null on the main scope

When using following template:

{#products}
{name}, {price} €
{/products}

With following data:

doc.render({
    name: "Santa Katerina",
    products: [
        {
            price: "$3.99",
        },
    ],
});

The {name} tag will use the "root scope", since it is not present in the products array.

If you explicitly don't want this behavior because you want the nullGetter to handle the tag in this case, you could use the following parser:

function parser(tag) {
    return {
        get(scope, context) {
            if (context.num < context.scopePath.length) {
                return null;
            }
            // You can customize your parser here instead of scope[tag] of course
            return scope[tag];
        },
    };
}

The context.num value contains the scope level for this particular evaluation.

When evaluating the {name} tag in the example above, there are two evaluations:

// For the first evaluation, when evaluating in the {#users} scope
context.num = 1;
context.scopePath = ["users"];
/*
 * This evaluation returns null because the
 * first product doesn't have a name property
 */

// For the second evaluation, when evaluating in the root scope
context.num = 0;
context.scopePath = ["users"];
// This evaluation returns null because of the extra added condition

Note that you could even make this behavior dependent on a given prefix, for example, if you want to by default, use the mechanism of scope traversal, but for some tags, allow only to evaluate on the deepest scope, you could add the following condition:

function parser(tag) {
    const onlyDeepestScope = tag[0] === "!";
    if (onlyDeepestScope) {
        // Remove the leading "!", ie: "!name" => "name"
        tag = tag.substr(1);
    }
    return {
        get(scope, context) {
            if (
                onlyDeepestScope &&
                context.num < context.scopePath.length
            ) {
                return null;
            }
            return scope[tag];
        },
    };
}

Parser example to always use the root scope

Let's say that at the root of your data, you have some property called "company".

You need to access it within a loop, but the company is also part of the element that is looped upon.

With following data:

doc.render({
    company: "ACME Company",
    contractors: [
        { company: "The other Company" },
        { company: "Foobar Company" },
    ],
});

If you want to access the company at the root level, it is not possible with the default parser.

You could implement it this way, when writing `{$company}`:

const options = {
    parser(tag) {
        return {
            get(scope, context) {
                const onlyRootScope = tag[0] === "$";
                if (onlyRootScope) {
                    if (context.num !== 0) {
                        return null;
                    }
                    // Remove the leading "$", ie: "$company" => "company"
                    tag = tag.substr(1);
                }
                /*
                 * You can customize the rest of your parser here instead of
                 * scope[tag], by using the angular-parser for example.
                 */
                return scope[tag];
            },
        };
    },
};
const doc = new Docxtemplater(zip, options);
doc.render(/* data */);

Parser example to use the parent scope (or nth parent)

Let's say that you would like to access the "company" property, but not the one in the current scope, the one in the parent scope.

You need to access it within a loop, but the "company" property is also part of the element that is looped upon.

With following data:

doc.render({
    company: "ACME Company",
    contractors: [
        { company: "The other Company" },
        { company: "Foobar Company" },
    ],
});

You could implement it this way, when writing {..company}:

With this syntax :

  • {company} will retrieve company in the local scope
  • {..company} will retrieve company in the parent scope
  • {...company} will retrieve company in the grandparent scope (level 2), …
const options = {
    parser(tag) {
        const matchesParent = /^(\.{2,})(.*)/g;
        let parentCount = 0;
        if (matchesParent.test(tag)) {
            parentCount =
                tag.replace(matchesParent, "$1").length - 1;
            tag = tag.replace(matchesParent, "$2");
        }
        return {
            get(scope, context) {
                if (
                    context.scopePath.length - context.num <
                    parentCount
                ) {
                    return null;
                }
                return scope[tag];
            },
        };
    },
};
const doc = new Docxtemplater(zip, options);
doc.render(/* data */);
Talk with sales Contact us