Unicode

Unicode 是一种字符集，它为世界上所有字符提供唯一的编码。例如 \u6211，表示在 Unicode 字符集中，6211 表示的就是中文“我”。Unicode 只是规定了字符集，但是并未规定如何存储，这些表示如何存储就称为编码，编码分为 3 种：UTF-8、UTF-16、UTF-32。

UTF-8、UTF-16、UTF-32

因为 UTF-8 是定了顺序的。举个例子，110xxxxx 10xxxxxx 和 10xxxxxx 110xxxxx 是没有区别的，按照 UTF-8 的规则，110xxxxx 一定是头，10xxxxxx 一定是尾，所以字节序无关。

1字节：0xxxxxxx
2字节：110xxxxx 10xxxxxx
3字节：1110xxxx 10xxxxxx 10xxxxxx
4字节：11110xxx 10xxxxxx 10xxxxxx 10xxxxxx

"𠮷".charCodeAt(0).toString(16); // d842，这个是错误的值
"𠮷".codePointAt(0).toString(16); // 20bb7