1.1.1 • Published 2 years ago

minhtml v1.1.1

Weekly downloads
-
License
MIT
Repository
github
Last release
2 years ago

minhtml - a Node.JS package for easily minifying HTML

minhtml, short for "minify html" is a highly configurable, well-tested, JavaScript-based HTML minifier. It also supports Mustache/Handlebars templates and should also reasonably support Angular, React and Vue components.

See corresponding blog post by the original author for all the gory details of how it works, description of each option, testing results and conclusions.

Minification comparison

How does minhtml compare to other solutions — HTML Minifier from Will Peavy (1st result in Google search for "html minifier") as well as minimize?

SiteOriginal size (KB)minhtmlminimizeWill Peavy
Google49444951
Twitter69616868
Stack Overflow202164178178
minhtml222136200214
Bootstrap CSS271260269229
BBC333307331332
Amazon459410449461
Wikipedia692558671697
New York Times770688766755
Eloquent Javascript870815840864
NBC1857171218441864
ES draft591251945384n/a

Options Quick Reference

Most of the options are disabled by default.

OptionDescriptionDefault
caseSensitiveTreat attributes in case sensitive manner (useful for custom HTML tags)false
collapseBooleanAttributesOmit attribute values from boolean attributesfalse
collapseInlineTagWhitespaceDon't leave any spaces between display:inline; elements when collapsing. Must be used in conjunction with collapseWhitespace=truefalse
collapseWhitespaceCollapse white space that contributes to text nodes in a document treefalse
conservativeCollapseAlways collapse to 1 space (never remove it entirely). Must be used in conjunction with collapseWhitespace=truefalse
continueOnParseErrorHandle parse errors instead of aborting.false
customAttrAssignArrays of regex'es that allow to support custom attribute assign expressions (e.g. '<div flex?="{{mode != cover}}"></div>')[ ]
customAttrCollapseRegex that specifies custom attribute to strip newlines from (e.g. /ng-class/)
customAttrSurroundArrays of regex'es that allow to support custom attribute surround expressions (e.g. <input {{#if value}}checked="checked"{{/if}}>)[ ]
customEventAttributesArrays of regex'es that allow to support custom event attributes for minifyJS (e.g. ng-click)[ /^on[a-z]{3,}$/ ]
decodeEntitiesUse direct Unicode characters whenever possiblefalse
html5Parse input according to HTML5 specificationstrue
ignoreCustomCommentsArray of regex'es that allow to ignore certain comments, when matched[ /^!/, /^\s*#/ ]
ignoreCustomFragmentsArray of regex'es that allow to ignore certain fragments, when matched (e.g. <?php ... ?>, {{ ... }}, etc.)[ /<%[\s\S]*?%>/, /<\?[\s\S]*?\?>/ ]
includeAutoGeneratedTagsInsert tags generated by HTML parsertrue
keepClosingSlashKeep the trailing slash on singleton elementsfalse
maxLineLengthSpecify a maximum line length. Compressed output will be split by newlines at valid HTML split-points
minifyCSSMinify CSS in style elements and style attributes (uses clean-css)false (could be true, Object, Function(text, type))
minifyJSMinify JavaScript in script elements and event attributes (uses Terser)false (could be true, Object, Function(text, inline))
minifyURLsMinify URLs in various attributes (uses relateurl)false (could be String, Object, Function(text))
preserveLineBreaksAlways collapse to 1 line break (never remove it entirely) when whitespace between tags include a line break. Must be used in conjunction with collapseWhitespace=truefalse
preventAttributesEscapingPrevents the escaping of the values of attributesfalse
processConditionalCommentsProcess contents of conditional comments through minifierfalse
processScriptsArray of strings corresponding to types of script elements to process through minifier (e.g. text/ng-template, text/x-handlebars-template, etc.)[ ]
quoteCharacterType of quote to use for attribute values (' or ")
removeAttributeQuotesRemove quotes around attributes when possiblefalse
removeCommentsStrip HTML commentsfalse
removeEmptyAttributesRemove all attributes with whitespace-only valuesfalse (could be true, Function(attrName, tag))
removeEmptyElementsRemove all elements with empty contentsfalse
removeOptionalTagsRemove optional tagsfalse
removeRedundantAttributesRemove attributes when value matches default.false
removeScriptTypeAttributesRemove type="text/javascript" from script tags. Other type attribute values are left intactfalse
removeStyleLinkTypeAttributesRemove type="text/css" from style and link tags. Other type attribute values are left intactfalse
removeTagWhitespaceRemove space between attributes whenever possible. Note that this will result in invalid HTML!false
sortAttributesSort attributes by frequencyfalse
sortClassNameSort style classes by frequencyfalse
trimCustomFragmentsTrim white space around ignoreCustomFragments.false
useShortDoctypeReplaces the doctype with the short (HTML5) doctypefalse

Sorting attributes / style classes

Minifier options like sortAttributes and sortClassName won't impact the plain-text size of the output. However, they form long repetitive chains of characters that should improve compression ratio of gzip used in HTTP compression.

Special cases

Ignoring chunks of markup

If you have chunks of markup you would like preserved, you can wrap them <!-- htmlmin:ignore -->.

Preserving SVG tags

SVG tags are automatically recognized, and when they are minified, both case-sensitivity and closing-slashes are preserved, regardless of the minification settings used for the rest of the file.

Working with invalid markup

minhtml can't work with invalid or partial chunks of markup. This is because it parses markup into a tree structure, then modifies it (removing anything that was specified for removal, ignoring anything that was specified to be ignored, etc.), then it creates a markup out of that tree and returns it.

Input markup (e.g. <p id="">foo)

Internal representation of markup in a form of tree (e.g. { tag: "p", attr: "id", children: ["foo"] })

Transformation of internal representation (e.g. removal of id attribute)

Output of resulting markup (e.g. <p>foo</p>)

minhtml can't know that original markup was only half of the tree; it does its best to try to parse it as a full tree and it loses information about tree being malformed or partial in the beginning. As a result, it can't create a partial/malformed tree at the time of the output.

Installation Instructions

From NPM for use as a (global) command line app:

npm install minhtml -g

From NPM for programmatic use:

npm install minhtml

From Git:

git clone https://github.com/R4356th/minhtml.git
cd minhtml
npm link .

Usage

Note that almost all options are disabled by default. For command line usage, please see minhtml --help for a list of available options. Experiment and find what works best for you and your project.

  • Sample command: minhtml --collapse-whitespace --remove-comments --remove-optional-tags --remove-redundant-attributes --remove-script-type-attributes --use-short-doctype --minify-css true --minify-js true

Node.js

var minify = require('minhtml').minify;
var result = await minify('<p title="blah" id="moo">foo</p>', {
  removeAttributeQuotes: true
});
result; // '<p title=blah id=moo>foo</p>'
1.1.1

2 years ago

1.1.0

2 years ago

1.0.1

2 years ago

0.7.1

2 years ago

0.7.0

2 years ago

0.6.0

3 years ago

0.5.0

3 years ago

0.5.2

3 years ago

0.5.1

3 years ago

0.4.0

3 years ago

0.3.0

3 years ago

0.2.0

3 years ago

0.1.0

3 years ago

1.0.0

4 years ago