|
|
Articles |
|
There is no clearly suitable Unicode character to satisfy IUPAC
recommendations to use the dashed vertical bar (
) and double dashed vertical bar (
) glyphs for drawing line representations of
electrochemical cells. New Unicode characters are recommended.
While continuing the project to transcribe Professor
|
The unusual characters are “dashed vertical bar” ( ) and “double dashed vertical bars” (
)
specified in the current IUPAC Compendium on Analytical Nomenclature
(a.k.a. “Orange Book”) (see Ref. 5.1). The
characters are specified in Section 1.3.10, Conventions converning
the signs of electric potential differences, electromotive forces, and
electrode potential. A relevant excerpt from the online version of
the Orange Book is shown in Figure 2.
|
From this excerpt it is obvious that the glyph Devoe uses differs from
that shown in the online version of the Orange Book. This is because
IUPAC's PDF file utilizes a glyph that appears to be the one used for
the Unicode U+00A6 BROKEN BAR character (¦) to
represent the dashed vertical bar. However, a glyph more closely
representing DeVoe's glyph appears in IUPAC's 1975 Manual of
Symbols and Terminology for Physicochemical Quantities and Units,
Appendix III (see Figure 3, Ref. 5.2).
Unfortunately, the scan quality of the 1975 document available on the
IUPAC website is not very high. Despite that difficulty the dashed
vertical bar glyph is shown to consist of five vertical line segments,
in contrast to the U+00A6 BROKEN BAR's two (¦).
In Figure 4, I drew my interpretation of the “dashed
vertical bar” ( ), “double dashed
vertical bars” (
) glyphs alongside the typical ASCII vertical line
glyph; all three characters should have the same glyph height.
Note: For clarity, this document uses my vector
drawings when representing the missing glyphs in parentheses (e.g.
and
). I had to go out of my way to create custom
macros in TeXmacs to make them visible; they are rendered as PNG
images when this document is exported to HTML format.
|
Additionally, while searching IUPAC literature mentioning electrochemistry notation I found that drafts of some chapters of the new edition of the Orange Book are available. A draft of the chapter covering galvanic cell diagrams was published in Pure and Applied Chemistry (see reference 5.3). This draft continues the current Orange Book's use of the typical BROKEN BAR glyph (¦) to represent the missing dashed vertical bar character (see Figure 5).
|
Lately as I've been transcribing DeVoe's Thermodynamics and Chemistry, my modus operandi after encountering an unusual glyph is to first check TeXmacs's extensive coverage of math symbols. Failing that, I search websites such as unicode-table.com for similar Unicode glyphs. I'll now summarize my hunt to search for appropriate characters to represent “dashed vertical bar” and “double dashed vertical bars”.
The closest symbol I could find in TeXmacs that approximates a “dashed vertical bar” is the vertical elipsis . Using this symbol as a substitute has the advantage of being available for quick entry via a keyboard shortcut (. . Tab Tab Tab) instead of inserting directly via Unicode point (control+q # 2 2 e e). However, typical glyphs used to represent the vertical elipsis symbol usually consist of three dots instead of line segments. I cannot find a symbol in TeXmacs consisting of vertical line segments. I could write a custom macro that constructs the symbol from other symbols or create a small drawing (which is what DeVoe did in the LaTeX source code for Thermodynamics and Chemistry). However, custom macros reduce compatibility when a document must be exported to other formats; for example, I am editing this article in TeXmacs for possible export to PDF but the reader is likely reading this article in HTML format. Ideally, the “dashed vertical bar” glyph would be associated with its own Unicode character with a code point that both TeXmacs and web browsers parsing this article's HTML version could understand.
Each Unicode character is visually represented by a glyph. To quote a reference page on glyphsapp.com :
Characters are what you type, glyphs are what you see.
To be more verbose, each Unicode character has a unique code point (e.g. U+007C) which is associated with a character name (e.g. VERTICAL LINE) and a glyph (e.g. |). A glyph can be shared by several Unicode characters but the typesetting rules a program may apply to each character may differ.
I scanned unicode-table.com for characters with glyphs that match both DeVoe's custom glyph and anything that might match the description “dashed vertical bar”.
I also performed a search of unicode-search.net for glyphs containing the string “VERTICAL” in their descriptions. This yielded more results.
I also checked the Unicode “Mathematical Symbols” code charts for possibly useful glyphs. Note, there are no code sets dedicated to modern chemistry although there is a set for alchemical symbols.
Table 1 lists various Unicode characters relevant to my search that I found.
|
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Table 1. Unicode characters with possible uses in electrochemistry diagrams. The “Potential use in electrochemistry notation” column definitions are taken from the current draft of the 4th edition of the IUPAC Orange Book (see Figure 5). These glyph definitions are:
|
Some characters and glyphs from the “C0 Controls and Basic
Latin”, “C1 Controls and Latin-1 Supplement”, and
“General Punctuation” categories may be useful as-is. For
example, as mentioned earlier, U+00A6 BROKEN BAR
(¦) and U+007C VERTICAL LINE (|) have glyphs
which are already used by IUPAC (see Figure 2); VERTICAL
LINE (|) represents a phase boundary and BROKEN BAR
(¦) represents a miscible liquid boundary. However, the BROKEN BAR glyph does not closely match the “dashed
vertical bar” glyph ( ) recommended by
IUPAC in the 1975 document predating the Unicode standard (see Figure 3).
The “Box Drawing” category covers characters used in command-line interfaces that use glyphs to draw lines for window-like graphical environments. Several of the glyphs, such as U+250A, BOX DRAWINGS LIGHT QUADRUPLE DASH VERTICAL (┊), are visually similar to the 1975 IUPAC recommended glyphs (see Figure 3). However, the characters themselves (the code point and idea the glyph represents) are meant to be used in a monospace environment (see Figure 6) with no kerning. Kerning is how glyphs are spaced between one another and is important for readability of math equations. For example, using several U+2502 BOX DRAWINGS LIGHT VERTICAL (│) characters in a row in this paragraph without space characters in-between results in: ││││││││. In contrast, using several U+007C VERTICAL LINE (|) characters in a row results in: ||||||||. The two characters may use visually similar glyphs but their kerning rules may differ.
|
Music characters in Table 1 have glyphs that may be useful. In TeXmacs, the vertical bars render with almost no horizontal spacing. However, like the box drawing characters, the character code points themselves should not be used in math or chemistry equations.
IUPAC's recommendation to use “dashed vertical bar” ( ) and “double dashed vertical bars” (
)
glyphs (as early as 1975, see Ref. 5.2) predates the
Unicode standard (first published in October 1991, see Ref. 5.4).
So the Unicode Consortium could have added characters with such glyphs
had IUPAC requested it. I can find little correspondance on the unicode.org website mentioning IUPAC beyond
clarification about how to spell sulfur/sulphur, a superscript
comma issue that could be solved with MathML, and how to name some elements in chinese. The absence of an
appropriate character and glyph in a mathematics-related code set may be
the result of inaction on the part of IUPAC members. This is not
surprising since most characters used in chemistry publications are
present in Unicode. For example, the unusual glyph
is regularly used in chemistry textbooks to indicate a reversible
reaction like so:
The glyph is used by the Unicode character U+21CC RIGHTWARDS HARPOON OVER LEFTWARDS HARPOON.
Other non-ASCII glyphs in this example chemical equation may include:
U+0394 GREEK CAPITAL LETTER DELTA (Δ): Indicates a change in variable .
U+2218 RING OPERATOR (∘): Indicates that Gibbs free energy, , is measured at some standard condition.
Since the upcoming version of IUPAC's Orange Book continues to reference the missing glyphs (see Ref. 5.3), I believe Unicode should add the characters in Table 2.
|
|||||||||||||||
Table 2. Characters to be added to Unicode to satisfy IUPAC Orange Book recommendations for drawing galvanic cell diagrams. For lack of appropriate glyphs, the musical glyph U+1D104 MUSICAL SYMBOL DASHED BARLINE (𝄄) was modified (see EPS file) to construct both. When created, the new glyphs should match the height of U+007C VERTICAL LINE glyph (|); see Fig 4. |
and
are two glyphs recommended by IUPAC for line
representations of electrochemical cells since at least 1975 yet have no
corresponding Unicode character. The missing glyphs are depicted in
Table 2. I recommend two new Unicode characters be added
incorporating these missing glyphs.
If you, the reader, are aware of some upcoming change to Unicode or some solution that already exists that supplies the missing glyphs I would ask you to notify me (Twitter, Email, etc.).
IUPAC Compendium on Analytical Nomenclature, Definitive Rules 1997, 3rd Edition, IUPAC Orange Book, prepared for publication by J. Inczedy, T. Lengyel, and A.M. Ure, Blackwell Science, 1998 [ISBN 0-632-05127-2]
Paul, Martin A. Manual of Symbols and Terminology for Physicochemical Quantities and Units. London: Butterworths, 1975. Appendix III. Print. OCLC: 2299040. Archive link.
Pingarrón, José M., Labuda, Ján, Barek, Jiří, Brett, Christopher M. A., Camões, Maria Filomena, Fojta, Miroslav and Hibbert, D. Brynn. “Terminology of electrochemical methods of analysis (IUPAC Recommendations 2019)” Pure and Applied Chemistry, vol. 92, no. 4, 2020, pp. 641-694. https://doi.org/10.1515/pac-2018-0109
History of Unicode Release and Publication Dates. Accessed 2020-06-17. unicode.org.