UTF8 locale MFC for DragonflyBSD
    Joerg Sonnenberger 
    joerg at britannica.bec.de
       
    Mon Mar 29 08:48:30 PST 2004
    
    
  
On Mon, Mar 29, 2004 at 11:09:08AM -0500, Dave Cuthbert wrote:
> My personal opinion: UCS-4 wastes a lot of space given that Unicode 3.1 
> is a ~21-bit set and nobody is really using the >=U+10000 space in a 
> practical manner (yet?).  But if you need to have a one-to-one mapping, 
> you don't have much choice.
IIRC there are already some scripts outside the base plane. Anyway, nobody
forces you to encode anything in UCS-4, use UTF-8 or UTF-16 for that. But
if you need to have a hard-wired assumption about the size of a "character",
4 bytes is much more reasonable then 2 bytes in the mid to long term.
Joerg
> 
> Unless you have a machine which uses 21-bit bytes, of course. ;-)
    
    
More information about the Submit
mailing list