Marpa

Update of ”handling unicode”
Login

Update of ”handling unicode”

Tcl 2016 Conference, Houston/TX, US, Nov 14-18
Send your abstracts to tclconference@googlegroups.com by Sep 12.

Many hyperlinks are disabled.
Use anonymous login to enable hyperlinks.

Overview

Artifact ID: 20f5c8ef2d2ffab257f37a1f7d7dbd86db47fbda
Page Name:handling unicode
Date: 2017-06-27 18:18:44
Original User: aku
Mimetype:text/x-markdown
Parent: 2d88c6bb26e54dab973c551df9863baba966a324 (diff)
Next ddbe4b5d463909c0c250c3b2325fbb2f9716f1e4
Content

Up: Notes

Unicode

Engine

Grammars

Relevant references

Regardless, at the bottom the engine has to support only bytes and byte-ranges, or even only bytes, with the ranges rewritten into alternations. (Finite, at most 256 for a full range [00-ff]).

As a side effect we can support the full range of unicode character classes, despite Tcl itself not supporting them.