From: Eli Zaretskii <eliz@gnu.org> Date: Tue, 24 Jun 2014 16:58:06 +0000 (+0300) Subject: admin/notes/unicode: Some notes about what to do when a new Unicode version is imported. X-Git-Tag: emacs-24.3.92~27 X-Git-Url: http://git.eshelyaron.com/gitweb/?a=commitdiff_plain;h=2394ef283c781297f3a1da79c8c8c92ca9d1b5e5;p=emacs.git admin/notes/unicode: Some notes about what to do when a new Unicode version is imported. --- diff --git a/admin/ChangeLog b/admin/ChangeLog index 1fb3a6efd1a..1abbf36e9dc 100644 --- a/admin/ChangeLog +++ b/admin/ChangeLog @@ -1,3 +1,8 @@ +2014-06-24 Eli Zaretskii <eliz@gnu.org> + + * notes/unicode: Some notes about what to do when a new Unicode + version is imported. + 2014-06-24 Glenn Morris <rgm@gnu.org> * unidata/BidiMirroring.txt: Update to 7.0.0 (only comment changes). diff --git a/admin/notes/unicode b/admin/notes/unicode index 654580639f7..cff619fd8b8 100644 --- a/admin/notes/unicode +++ b/admin/notes/unicode @@ -3,6 +3,39 @@ Copyright (C) 2002-2014 Free Software Foundation, Inc. See the end of the file for license conditions. +Importing a new Unicode Standard version into Emacs +------------------------------------------------------------- + +Emacs uses the following files from the Unicode Character Database +(a.k.a. "UCD): + + . UnicodeData.txt + . BidiMirroring.txt + . IVD_Sequences.txt + +First, these files need to be copied into admin/unidata/, and then +Emacs should be rebuilt for them to take effect. Rebuilding Emacs +updates several derived files elsewhere in the Emacs source tree, +mainly in lisp/international/. + +When Emacs is rebuilt for the first time after importing the new +files, pay attention to any warning or error messages. In particular, +admin/unidata/unidata-gen.el will complain if UnicodeData.txt defines +new bidirectional attributes of characters, because unidata-gen.el, +bidi.c and dispextern.h need to be updated in that case; failure to do +so will cause aborts in redisplay. + +Next, review the changes in UnicodeData.txt vs the previous version +used by Emacs. Any changes, be it introduction of new scripts or +addition of codepoints to existing scripts, need corresponding changes +in the data used for filling char-script-table, see characters.el +around line 1300. Other databases and settings in characters.el, such +as the data for char-width-table, might also need changes. + +Any new scripts added by UnicodeData.txt will also need updates to +script-representative-chars defined in fontset.el. Other databases in +fontset.el might also need to be updated as needed. + Problems, fixmes and other unicode-related issues -------------------------------------------------------------