caml-list - the Caml user's mailing list
 help / Atom feed
* [Caml-list] Unicode 13.0.0 update for Uucd, Uucp, Uunf and Uuseg
@ 2020-03-11  9:54 Daniel Bünzli
  2020-03-13 14:30 ` orbifx
  0 siblings, 1 reply; 2+ messages in thread
From: Daniel Bünzli @ 2020-03-11  9:54 UTC (permalink / raw)
  To: caml-list; +Cc: David Kaloper Meršinjak

Hello, 

Unicode 13.0.0 was released on the 10th of march.

It adds 5390 characters to the standard including graphic symbols for legacy computing. If you were looking for characters representing seven-segment decimal digits, now you [have them][0]. For the curious, the [encoding proposal][1] has the motivation and source of these new symbols. For more information about all the other additions, see [this page][2]. 

Accordingly the libraries mentioned at the end of this message had to be updated, consult the individual release notes for details. Both Uucd and Uucp are incompatible releases sinces new script and block enumerants had to be added.

Uucp has a new Emoji module with the new emoji properties introduced in 13.0.0 which are now used by Uuseg to improve emoji segmentation. The overall compiled size of Uucp shrinked a bit; here uucp.cmxs went from 7.8Mo to 4.6Mo. Further reduction can likely be achieved with more work. Thanks to David Kaloper Meršinjak for helping on this.

A periodic reminder, if Unicode still puzzles you, read an absolute minimal Unicode introduction and OCaml Unicode tips on [this page][3] (also available via `odig doc uucp`).

Happy retro computing,

Daniel

P.S. The OCaml compiler [detected][4] an obsolete rule in the 13.0.0 update of the Unicode line breaking algorithm.

[0]: https://www.unicode.org/charts/PDF/U1FB00.pdf
[1]: https://www.unicode.org/L2/L2019/19025-terminals-prop.pdf
[2]: http://blog.unicode.org/2020/03/announcing-unicode-standard-version-130.html
[3]: https://erratique.ch/software/uucp/doc/unicode.html
[4]: https://www.unicode.org/mail-arch/unicode-ml/y2020-m03/0000.html

---

Uucd 13.0.0 Unicode character database decoder for OCaml.
http://erratique.ch/software/uucd

Uucp 13.0.0 Unicode character properties for OCaml.
http://erratique.ch/software/uucp

Uunf 13.0.0 Unicode text normalization for OCaml.
http://erratique.ch/software/uunf

Uuseg 13.0.0 Unicode text segmentation for OCaml.
http://erratique.ch/software/uuseg

^ permalink raw reply	[flat|nested] 2+ messages in thread

* Re: [Caml-list] Unicode 13.0.0 update for Uucd, Uucp, Uunf and Uuseg
  2020-03-11  9:54 [Caml-list] Unicode 13.0.0 update for Uucd, Uucp, Uunf and Uuseg Daniel Bünzli
@ 2020-03-13 14:30 ` orbifx
  0 siblings, 0 replies; 2+ messages in thread
From: orbifx @ 2020-03-13 14:30 UTC (permalink / raw)
  To: caml-list

On 11/03/2020 09:54, Daniel Bünzli wrote:
> Unicode 13.0.0 was released on the 10th of march.
> A periodic reminder, if Unicode still puzzles you, read an absolute minimal Unicode introduction and OCaml Unicode tips on [this page][3]

Thanks for sharing!

^ permalink raw reply	[flat|nested] 2+ messages in thread

end of thread, back to index

Thread overview: 2+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2020-03-11  9:54 [Caml-list] Unicode 13.0.0 update for Uucd, Uucp, Uunf and Uuseg Daniel Bünzli
2020-03-13 14:30 ` orbifx

caml-list - the Caml user's mailing list

Archives are clonable: git clone --mirror https://inbox.ocaml.org/caml-list

AGPL code for this site: git clone https://public-inbox.org/ public-inbox