Skip to content

anonyco/FastestSmallestTextEncoderDecoder

Repository files navigation

npm version GitHub stars GitHub file size in bytes GitHub file size in bytes npm bundle size (version) CC0 license npm downloads

This Javascript library provides the most performant tiny polyfill for window.TextEncoder, TextEncoder.prototype.encodeInto, and window.TextDecoder for use in the browser, in NodeJS, in RequireJS, in web Workers, in SharedWorkers, and in ServiceWorkers.

Quick Start

Add the following HTML Code inside the <head>:

<script src="https://dl.dropboxusercontent.com/s/r55397ld512etib/EncoderDecoderTogether.min.js?dl=0" nomodule="" type="text/javascript"></script>

If no script on the page requires this library until the DOMContentLoaded event, then use the the much less blocking version below:

<script defer="" src="https://dl.dropboxusercontent.com/s/r55397ld512etib/EncoderDecoderTogether.min.js?dl=0" nomodule="" type="text/javascript"></script>

Alternatively, either use https://dl.dropboxusercontent.com/s/47481btie8pb95h/FastestTextEncoderPolyfill.min.js?dl=0 to polyfill window.TextEncoder for converting a String into a Uint8Array or use https://dl.dropboxusercontent.com/s/qmoknmp86sytc74/FastestTextDecoderPolyfill.min.js?dl=0 to only polyfill window.TextDecoder for converting a Uint8Array/ArrayBuffer/[typedarray]/global.Buffer into a String.

The nomodule attribute prevents the script from being needlessly downloaded and executed on browsers which already support TextEncoder and TextDecoder. nomodule does not test for the presence of TextEncoder or TextDecoder, but it is very safe to assume that browsers advanced enough to support modules also support TextEncoder and TextDecoder.

EncodeInto

See the MDN here for documentation. For the TextEncoder.prototype.encodeInto polyfill, please use https://dl.dropboxusercontent.com/s/i2e2rho1ohtbhfg/EncoderDecoderTogether.min.js?dl=0 for the full package, https://dl.dropboxusercontent.com/s/nlcgzbr0ayd5pjs/FastestTextEncoderPolyfill.min.js?dl=0 for only TextEncoder and TextEncoder.prototype.encodeInto, and npm i fastestsmallesttextencoderdecoder-encodeinto for NodeJS, es6 modules, RequireJS, AngularJS, or whatever it is that floats your boat. The encodeInto folder of this repository contains the auto-generated encodeInto build of the main project. The npm project is fastestsmallesttextencoderdecoder-encodeinto:

npm install fastestsmallesttextencoderdecoder-encodeinto

RequireJS and NodeJS

For dropping into either RequireJS or NodeJS, please use the fastestsmallesttextencoderdecoder npm repository, this minified file, or the corresponding source code file. To install via npm, use the following code.

npm install fastestsmallesttextencoderdecoder

Alternatively, if one do not know how to use the command line, save the script corresponding to one's operating system to the directory where the nodejs script will run and use the file manager to run the script (on Windows, it's a double-click).

After installing via npm, one can use require("fastestsmallesttextencoderdecoder"). Alternatively, one can drop the EncoderAndDecoderNodeJS.min.js file into the same directory as their NodeJS script and do require("./EncoderAndDecoderNodeJS.min.js"). Both methods are functionally equivalent.

AngularJS

Open a terminal in the project's directory, and install fastestsmallesttextencoderdecoder via npm.

npm install fastestsmallesttextencoderdecoder

Then, add import 'fastestsmallesttextencoderdecoder'; to the polyfills.ts file.

Benchmarks

Don't take my word that FastestSmallestTextEncoderDecoder is the fastest. Instead, check out the benchmarks below. You can run your own benchmarks by cloning this repo and running npm run benchmark, but beware that you need a beefy computer with plenty of free RAM, as the NodeJS garbage collector is disabled via --noconcurrent_sweeping --nouse-idle-notification so that it does not interfer with the timing of the tests (the GC is runned manually via global.gc(true) at the conclusion of the tests).

The tests below were performed on an ascii file. To ensure consistancy, all test results are the mean of the IQR of many many trials.

Library Decode 32 bytes Decode 32768 Decode 16777216 Encode 32 bytes Encode 32768 Encode 16777216
fast-text-encoding 44247 Kb/sec 73765 Kb/sec 99360 Kb/sec 11135 Kb/sec 20393 Kb/sec 27898 Kb/sec
text-encoding 5483 Kb/sec 6718 Kb/sec 7982 Kb/sec 6001 Kb/sec 5198 Kb/sec 7950 Kb/sec
TextEncoderTextDecoder.js 5989 Kb/sec 5240 Kb/sec 6376 Kb/sec 8608 Kb/sec 7815 Kb/sec 12733 Kb/sec
TextEncoderLite 19731 Kb/sec 24746 Kb/sec 13843 Kb/sec 10357 Kb/sec 8690 Kb/sec 13093 Kb/sec
text-encoding-shim 15415 Kb/sec 17477 Kb/sec 26779 Kb/sec 6692 Kb/sec 9431 Kb/sec 15477 Kb/sec
FastestSmallestTextEncoderDecoder 48268 Kb/sec 106573 Kb/sec 193986 Kb/sec 14203 Kb/sec 21734 Kb/sec 31994 Kb/sec
Native 20248 Kb/sec 895592 Kb/sec 886559 Kb/sec 17925 Kb/sec 26714 Kb/sec 36302 Kb/sec

Needless to say, FastestSmallestTextEncoderDecoder outperformed every other polyfill out there. Infact, it is so fast that it outperformed the native implementation on a set of 32 ascii bytes. The tests below were performed on a mixed ascii-utf8 file.

Library Decode 32 bytes Decode 32768 Decode 16777216 Encode 32 bytes Encode 32768 Encode 16777216
fast-text-encoding 39218 Kb/sec 65368 Kb/sec 93653 Kb/sec 12144 Kb/sec 20309 Kb/sec 29522 Kb/sec
text-encoding 4931 Kb/sec 6405 Kb/sec 8323 Kb/sec 6906 Kb/sec 4352 Kb/sec 8077 Kb/sec
TextEncoderTextDecoder.js 7675 Kb/sec 5423 Kb/sec 7264 Kb/sec 8946 Kb/sec 6816 Kb/sec 12716 Kb/sec
TextEncoderLite 9034 Kb/sec 10380 Kb/sec 10372 Kb/sec 10771 Kb/sec 7452 Kb/sec 12727 Kb/sec
text-encoding-shim 13735 Kb/sec 14886 Kb/sec 24207 Kb/sec 7634 Kb/sec 9097 Kb/sec 15349 Kb/sec
FastestSmallestTextEncoderDecoder 42538 Kb/sec 65701 Kb/sec 118350 Kb/sec 14834 Kb/sec 21717 Kb/sec 31811 Kb/sec
Native 23929 Kb/sec 364062 Kb/sec 512951 Kb/sec 19296 Kb/sec 26902 Kb/sec 37260 Kb/sec

Suprisingly, FastestSmallestTextEncoderDecoded once again beats the native implementation by a wide margin on a set of 32 characters. In the next test, let's examine a more real world example—the 1876 The Russian Synodal Bible.txt. It's a whoping 4.4MB rat's-nest of complex Russian UTF-8, sure to give any encoder/decoder a bad day. Let's see how they perform at their worst.

Library Decode Russian Bible Encode Russian Bible
fast-text-encoding 93174 Kb/sec 34207 Kb/sec
text-encoding 8120 Kb/sec 8026 Kb/sec
TextEncoderTextDecoder.js 6102 Kb/sec 12512 Kb/sec
TextEncoderLite 15564 Kb/sec 12650 Kb/sec
text-encoding-shim 28146 Kb/sec 17437 Kb/sec
FastestSmallestTextEncoderDecoder 106154 Kb/sec 35684 Kb/sec
Native 870569 Kb/sec 43547 Kb/sec

Browser Support

This polyfill will bring support for TextEncoder/TextDecoder to the following browsers.

Feature Chrome Firefox Opera Edge Internet Explorer Safari Android Samsung Internet Node.js
Full Polyfill 7.0 4.0 11.6 12.0** 10 5.1 (Desktop) / 4.2 (iOS) 4.0 1.0 3.0
Partial Polyfill* 1.0** 0.6 7.0 (Desktop) / 9.5** (Mobile) 12.0** 4.0 2.0 1.0** 1.0** 0.10

Also note that while this polyfill may work in these old browsers, it is very likely that the rest of one's website will not unless if one make a concious effort to have their code work in these old browsers.

* Partial polyfill means that Array (or Buffer in NodeJS) will be used instead of Uint8Array/[typedarray].

** This is the first public release of the browser

API Documentation

Please review the MDN at window.TextEncoder and window.TextDecoder for information on how to use TextEncoder and TextDecoder.

As for NodeJS, calling require("EncoderAndDecoderNodeJS.min.js") yields the following object. Note that this polyfill checks for global.TextEncoder and global.TextDecoder and returns the native implementation if available.

module.exports = {
	TextEncoder: function TextEncoder(){/*...*/},
	TextDecoder: function TextDecoder(){/*...*/},
	encode: TextEncoder.prototype.encode,
	decode: TextDecoder.prototype.decode
}

In NodeJS, one does not ever have to use new just to get the encoder/decoder (although one still can do so if they want to). All of the code snippets below function identically (aside from unused local variables introduced into the scope).

    // Variation 1
    const {TextEncoder, TextDecoder} = require("fastestsmallesttextencoderdecoder");
    const encode = (new TextEncoder).encode;
    const decode = (new TextDecoder).decode;
    // Variation 2
    const {encode, decode} = require("fastestsmallesttextencoderdecoder");
    // Variation 3 (a rewording of Variation 2)
    const encodeAndDecodeModule = require("fastestsmallesttextencoderdecoder");
    const encode = encodeAndDecodeModule.encode;
    const decode = encodeAndDecodeModule.decode;

Or, one can use the new and shiny ES6 module importation statements.

    // Variation 1
    import {TextEncoder, TextDecoder} from "fastestsmallesttextencoderdecoder";
    const encode = (new TextEncoder).encode;
    const decode = (new TextDecoder).decode;
    // Variation 2
    import {encode, decode} from "fastestsmallesttextencoderdecoder";
    // Variation 3 (a rewording of Variation 2)
    import * as encodeAndDecodeModule from "fastestsmallesttextencoderdecoder";
    const encode = encodeAndDecodeModule.encode;
    const decode = encodeAndDecodeModule.decode;

Demonstration

Visit the GithubPage to see a demonstation. As seen in the Web Worker hexWorker.js, the Github Pages demonstration uses a special encoderAndDecoderForced.src.js version of this library to forcefully install the TextEncoder and TextDecoder even when there is native support. That way, this demonstraton should serve to truthfully demonstrate this polyfill.

npm Project

This project can be found on npm here at this link.

Development

On Linux, the project can be developed by cloning it with the following command line. The development scripts are designed to be interpeted by Dash, and whether they work on Mac OS is unknown, but they certainly won't work on Windows.

git clone https://github.com/anonyco/FastestSmallestTextEncoderDecoder.git; cd FastestSmallestTextEncoderDecoder; npm run install-dev

Emphasize the npm run install-dev, which downloads closure-compiler.jar into the repository for minifying the files.

Now that the repository is cloned, edit the files as one see fit. Do not edit the files in the encodeInto folder. Those are all auto-generated. Also, do not run npm run build in the encodeInto. That's done automatically when npm run build is runned in the topmost folder. Now that the files have been edited, run the following in the terminal in the root folder of the repository in order to minify the NodeJS JavaScript files.

npm run build

To edit tests, edit test/node.js. These tests are compared against the native implementation to ensure validity. To run tests, do the following.

npm run test

Continuity

I try my best to be a realist, and what's more realistic than death? I am going to die someday and it may be tomorrow in a car crash. You never know. As I have no coder freinds to look out for my projects, I'm looking for anyone who wants to be a collaborator on this project in the event of the unforseen. Reach out to me at wowzeryest@gmail.com. If issues/pulls start piling up over the course of months, assume the worst. As I am trying my best to do my part to help the community, I encourage every developer to share their projects with other people to ensure continuity.

Releases

No releases published

Packages

No packages published

Contributors 3

  •  
  •  
  •  

Languages