[Unicode] The Unicode Standard Tech Site | Site Map | Search
 

Web Bookmarks for the UnicodeĀ® Standard

Current Practice for Web Bookmarks

Starting with Unicode 16.0, the definitive edition of the core specification is the HTML version. For any given Unicode version, the core specification is accessible via a versioned URL of the form:

https://www.unicode.org/versions/Unicode16.0.0/core-spec/

Version-specific Bookmarks

Stable bookmarks (permalinks) for any chapter, section, table, or figure of the core specification can be extracted directly from the online text. Elements with stable anchors for bookmarks show a "#" symbol in the left margin when they are hovered over. That symbol is the permalink pointing to the element itself, and it can be clicked on or copied from. So, for example, the following bookmark points directly to Section 12.1.3, Rendering Devanagari in Unicode 16.0:

https://www.unicode.org/versions/Unicode16.0.0/core-spec/chapter-12/#G78484

Version-independent Bookmarks

Anchors for bookmarks are maintained across versions of the core specification, so it is also possible to refer to the latest version of the core specification:

https://www.unicode.org/versions/latest/core-spec/

Because the anchors for bookmarks are global IDs in the text, a bookmark to specific text in the latest version omits the chapter directory, just using the anchor at the top level. For example:

https://www.unicode.org/versions/latest/core-spec/#G78484

Using this convention future proofs your bookmark against possible future restructuring of parts of the core specification, which conceivably could split chapters or move some portion of the text into another chapter.

Best Practice for Labeling Links

When labeling links into the latest core specification, it is best practice not to cite specific section, figure, or table numbers, as the exact numbers may change between versions. Thus, for example, the figure illustrating Tamil Two-Part Vowels happens to be Figure 12-17 in Unicode 16.0, so a version-specific link could be labeled as follows:

Figure 12-17, Tamil Two-Part Vowels in Unicode 16.0.

But a version-independent link to that table in the latest core specification would better omit the specific figure number:

The figure Tamil Two-Part Vowels in the Unicode Standard.

Bookmarks for the Archival PDF for the Core Specification

The anchors for chapters, sections, figures, and tables in the core specification also work for the single, archival PDF of the core specification. Because the archival PDF is a single file, and is not broken up into chapter-specific files, the anchors should be suffixed directly to the archival PDF file name. Thus, for example, the Rendering Devanagari section can be bookmarked in the 16.0 archival PDF as follows:

https://www.unicode.org/versions/Unicode16.0.0/UnicodeStandard-16.0.pdf#G78484

Legacy Practice for Web Bookmarks

For Unicode versions prior to Unicode 16.0, and extending back to Unicode 6.0, the PDF versions of the core specification are definitive, and there was no HTML edition available online. Because bookmarks were more difficult to locate and extract from those PDF files, the Unicode Consortium published version-specific bookmarks pages for each version of the core specification. Each bookmarks page lists the contents of that version and has links to the chapters, sections, subsections, figures, and tables of the core specification.

For Unicode 4.0 and 5.0, the published book versions of the standard were definitive, but per chapter PDF files were also published online for those versions. Accordingly, there are also web bookmarks pages for those versions, as well.

Legacy Web Bookmarks Pages

Versions of the Unicode Standard for which the core specification was not published as separate online PDFs, such as Version 5.1, do not have web bookmarks and are therefore not included in this list. Bookmarks for versions prior to Version 6.0 do not include bookmarks for figures or tables.

Bookmarks to sections of the text for specific versions of the Unicode core specification needed to include the specific chapter pdf. Thus, for example, the link for the Rendering Devanagari subsection in Version 11.0 of the core specification is:

https://www.unicode.org/versions/Unicode11.0.0/ch12.pdf#G78484

And the link for the corresponding subsection in Version 6.0 of the core specification is:

https://www.unicode.org/versions/Unicode6.0.0/ch09.pdf#G78484

Note that every attempt is made to keep bookmarks stable between versions — the structure of the core specification is only modified when necessary, and section numbering, figures, and tables are kept as stable as possible. However, section, figure, or table numbering occasionally changes between versions. Even in such cases, the bookmarks still work correctly, but may be pointing to a section with a different number. For example, Dogra in Version 12.0 is Section 15.16, but Dogra in Version 13.0 is Section 15.17. However, in both cases, the actual pdf file for the chapter and the anchor for the bookmark are identical, so the bookmarks will continue to point to the expected material across versions, even though the section numbering has changed.

Redirection of Legacy Latest Bookmarks

In the legacy scheme for core specification bookmarks, it was possible to use the mechanism of replacing the specific version with the "latest" version in the URL, to obtain links that would continue to work across versions. However, those latest links would only work as long as the content they pointed to stayed in the same chapter. Thus in the example above for the Rendering Devanagari subsection, a latest link created in the 6.0 version time frame would start to fail in the 7.0 version time frame (and later), because the Devanagari section moved from Chapter 9 to Chapter 12, as various chapters were split.

Starting with Unicode 16.0, any such legacy latest URLs are automatically redirected to the corresponding content in the current (HTML) version of the core specification. So the following legacy latest URLs are both correctly redirected:

https://www.unicode.org/versions/latest/ch12.pdf#G78484 [created in Unicode 11.0 time frame]

https://www.unicode.org/versions/latest/ch09.pdf#G78484 [created in Unicode 6.0 time frame]


Access to Copyright and terms of use