[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[pd@kubism.ku.dk: 8-bit fonts for TeX ?]

You may be interested in a scheme I am almost ready to release,
which allows pretty free mapping of METAFONTS in the 128-255
area.  (down below too, but that can get you into difficulties
in the related TeX coding).  Why not have a hard and fast scheme?
Karl givess one reason, and the enclosed copy of a letter that
I never got around to mailing indicates another.

  \opening{Dear Mr. Freytag,} 

  When I received the Unicode Draft over a year ago, I was working on
  other alphabets, and I am afraid I did not notice till today a curious
  omission from all the coding tables.  In a sense, it is
  understandable, since the missing letters are not part of the {\it
  official} alphabet of any country, but then neither are IPA
  characters, and they have been allowed spaces.

  I enclose proof sheets of the 11 characters I have recently been
  working on (lower case only, but they all have upper case siblings).
  These are used in the Roman-letter transcription of the principal
  Islamic (Arabic Script) languages, and probably have a wider
  international currency than many of the obscurer characters in the
  Latin 1 and Extended Latin set.  There are a couple of duplications in
  function, but no ambiguities unless ``s with bar under'' gets
  misused as a substitute for ``s with cedilla''.  Transcriptions concerned
  primarily with the rendering of Arabic Script in an Arabic context
  will use ``t with bar under'' and ``d with bar under'' where in a
  Persian or Turkish context, you are more likely to find ``s with bar
  under'' and ``z with bar under'' as representing {\bf 062B} Arabic
  Letter THEH and {\bf 0630} Arabic Letter THAL respectively.  There are
  literally millions of pages a year published making extensive use of
  the characters I enclose.  I am not sure when the equivalences were
  first agreed on, but it is well back in the 19th century.

  I became aware of the absence of these characters from Unicode when I
  was planning to remap a thoroughly idiosyncratic font table to some
  more general standard.  My first choice was Unicode, but I found to my
  surprise that half the characters I had developed have no identity
  whatsoever in any of the tables.  But every Latin letter periodical
  with a primary interest in the Near Eastern and Islamic studies uses
  these characters intensively, as do most of the monographs and books
  published in the field, and the Central Asiatic Republics in the
  Turkish language regions of the former Soviet Union, who speak with a
  somewhat different phonetic repertory than the Turks of the
  Ankara-based Republic are very likely to adopt some of this set.  I am
  a bit surprised that no one from the Research Libraries Group noted
  the extensive use of this character set in bibliographic references.

  I fear it is probably too late to do much about this now, which is
  rather a pity, since the particular design effort I have been working
  on is part of an attempt to develop a standardized text database using
  SGML.  But I felt it would do no harm to communicate with you about
  the matter, since I really do not know what stage the negotiations
  about Unicode have reached.  


The general idea is to have a distinct mapping file in which
accented characters are given names with purely descriptive
content, and to make that mapping file as easy as possible
to re-edit into an included TeX macro file.  It works
rather nicely, and by using an intermediate stage of
ligatures with a post-positive accent convention, I can
even feed several different ugly word-processor codings
through the same tex input file with only a change in the
small mapping file.  

If you're interested, I'll send a typical mapping file, and
in a short while I'll send the full schema.

Incidentally, it treats all of Don Knuth's computer modern
character descriptions (or any similar ones for that matter)
as read-only sources, to avoid the creation of dialect versions
of well-known characters.

Email concerned with UnixTeX distribution software should be sent primarily
to:	elisabet@max.u.washington.edu           Elizabeth Tachikawa
otherwise to:  mackay@cs.washington.edu		Pierre A. MacKay
Smail:  Northwest Computing Support Center	TUG Site Coordinator for
	Thomson Hall, Mail Stop DR-10		Unix-flavored TeX
	University of Washington
	Seattle, WA 98195
	(206) 543-6259