<?xml version='1.0'?>
<!DOCTYPE slideshow SYSTEM "slide.dtd" >
<slideshow>
  
  <showhead>XML and Unicode: Implications for Global Publishing</showhead>
  
  <showtitleslide>
    <title><line>XML and Unicode: Implications for Global
	Publishing</line></title>
    <subtitle>
      <line>Thirteenth International Unicode Conference</line>
    </subtitle>
    <subtitle>
      <line>San Jose, September 1998</line>
    </subtitle>
    <subtitle>
      <line>Jon Bosak</line>
      <line>Sun Microsystems</line>
    </subtitle>
  </showtitleslide>
  
  <slidemodule>
    <modhead>XML Basics</modhead>

    <moduletitleslide>
      <title><line>XML Basics</line></title>
    </moduletitleslide>
    
    <slide>
      <slidehead>What is XML?</slidehead>
      <bulletlist>
        <item><p>Extensible Markup Language</p></item>
        <item><p>An activity of the World Wide Web Consortium
	    (W3C) organized by Sun to put SGML on the World Wide Web</p></item>
        <item><p>Will create new data-centric Web applications</p>
	  <bulletsublist>
	    <item><p>Database exchange</p></item>
	    <item><p>Distribution of processing to clients</p></item>
	    <item><p>Client-side manipulation of views into the
	    data</p></item>
	    <item><p>Customization of information by intelligent
		agents</p></item>
	    <item><p>Management of document collections</p></item>
	  </bulletsublist>
	</item>
        <item><p>Will fundamentally change publishing on the web and
	    then publishing in general</p></item>
      </bulletlist>
    </slide>
    
    <slide>
      <slidehead>Who developed XML?</slidehead>
      <p>The XML specification was developed from 1996 through 1998 by
	a wide-ranging group of markup language experts from industry
	and academia.</p>
      <bulletlist>
        <item><p><emph>Computer industry:</emph> Sun Microsystems,
	    Hewlett-Packard, Microsoft, Netscape, Adobe, Fuji
	    Xerox</p></item>
        <item><p><emph>SGML vendors and system integrators:</emph>
	    ArborText, Inso, SoftQuad, Grif, Isogen, Texcel</p></item>
        <item><p><emph>Academic and research community:</emph> Text Encoding
	    Initiative (TEI), NCSA, James Clark</p></item>
        <item><p><emph>Early adopters:</emph> DataChannel, Vignette</p></item>
        <item><p><emph>Recent additions (after XML 1.0):</emph> IBM, Oracle,
	    Omnimark, Lotus, Xerox, SAP, Merrill Lynch, ...</p></item>
      </bulletlist>
    </slide>

    <slide>
      <slidehead>Where is XML going?</slidehead>
        <p>It is now generally agreed that Web content will be
            managed using standards based on XML.</p>
      <p>Key predictions for the future:</p>
      <numberedlist>
        <item><p>XML will certainly be the basis on which future Web standards
	    are built.</p></item>
        <item><p>XML will probably become the universal serial format
	    for structured data.</p></item>
        <item><p>XML will almost certainly become the basis for 
	    international publishing.</p></item>
        <item><p>XML may replace all existing word processing and
	    desktop publishing formats.</p></item>
      </numberedlist>
    </slide>
    
    <slide>
      <slidehead>The XML concept</slidehead>
      <image src='concept.gif'>[Presentation generated from
	content+structure]
      </image>
      <p>XML allows you to specify the content and structure of a document
	in a way that lets you generate particular presentations
	as needed.</p>
    </slide>
    
    <slide>
      <slidehead>Proof of concept: this presentation</slidehead>
      <p><i>(These are links in the online version.)</i></p>
      <bulletlist>
        <item><p>The <hlink href="show.xml">XML source</hlink> from which this
	      presentation was produced</p></item>
        <item><p>The optional <hlink href="slide.dtd">XML DTD</hlink>
	    used to validate the XML source</p></item>
        <item><p>The <hlink href="hslide.dsl">DSSSL style
	      sheet</hlink> for the HTML used in the online
	    version</p></item>
        <item><p>The <hlink href="pslide.dsl">DSSSL style
	      sheet</hlink> for the RTF used in the printed
	    version</p></item>
        <item><p>The <hlink href="http://www.jclark.com/jade">Jade
	      DSSSL engine</hlink> used to produce both the HTML and
	    RTF files</p></item>
        <item><p>A <hlink href="show.ps">PostScript version</hlink> of this
	      presentation produced from Jade's RTF output</p></item>
      </bulletlist>
    </slide>

    <slide>
      <slidehead>Why take this approach?</slidehead>
      <p>Separating content and structure from presentation and
	behavior makes possible</p>
      <bulletlist>
        <item><p>Reusable information</p></item>
        <item><p>Media-independent publishing</p></item>
        <item><p>One-on-one marketing</p></item>
        <item><p>Intelligent downstream document processing</p></item>
        <item><p>Large-scale information management</p></item>
      </bulletlist>
    </slide>
  </slidemodule>    

  <slidemodule>
    <modhead>XML Internationalization</modhead>

    <moduletitleslide>
      <title><line>XML Internationalization</line></title>
    </moduletitleslide>

    <slide>
      <slidehead>XML and Unicode</slidehead>
      <bulletlist>
        <item><p>XML has been based on Unicode from Day One</p>
	  <bulletsublist>
	    <item><p>There is nothing in an XML file but Unicode
                characters</p></item>
	    <item><p>Unicode is used for both content and
		markup (can mix languages, even in tag names)</p></item>
	    <item><p>XML tools <emph>must</emph> support both UTF-8 and
		UTF-16</p></item>
	  </bulletsublist>
	</item>
        <item><p>The widespread adoption of XML for data management
	  and electronic commerce 
          will probably make Unicode support universal</p></item>
      </bulletlist>
    </slide>

    <slide>
      <slidehead>Unicode example: an international bookstore</slidehead>
      <image src='bcat.gif'>[Japanese book catalog]</image>
    </slide>

    <slide>
      <slidehead>With stylesheet for reader of Japanese</slidehead>
      <image src='bcat-j.gif'>[Catalog rendered for reader of
	Japanese]</image>
    </slide>

    <slide>
      <slidehead>With stylesheet for reader of English</slidehead>
      <image src='bcat-e.gif'>[Catalog rendered for reader of
	English]</image>
    </slide>

    <slide>
      <slidehead>Source files for the bookstore example</slidehead>
      <p><i>(These are links in the online version.)</i></p>
      <bulletlist>
        <item><p>The <hlink href="bcat.uc.txt">UTF-16 XML
	      source</hlink> from which the different versions were
	    produced</p></item>
        <item><p>The <hlink href="ssj-dsl.uc">UTF-16 DSSSL style
	      sheet</hlink> used to produce the version for the reader
	    of Japanese</p></item>
        <item><p>The <hlink href="sse-dsl.uc">UTF-16 DSSSL style
	      sheet</hlink> used to produce the version for the reader
	    of English</p></item>
        <item><p>The <hlink href="http://www.jclark.com/jade">Jade
	      DSSSL engine</hlink> used to produce RTF files from the
	    source and the style sheets</p></item>
        <item><p>The <hlink href="bcat-j.rtf">UTF-16 RTF file</hlink>
	    for the reader of Japanese (font association done in Word
	    7)</p></item>
        <item><p>The <hlink href="bcat-e.rtf">UTF-16 RTF file</hlink>
	    for the reader of English (font association done in Word
	    7)</p></item>
      </bulletlist>
    </slide>

    <slide>
      <slidehead>Lessons from the example</slidehead>
      <bulletlist>
        <item><p>The catalog example shows that the distinction
	    between data exchange and publishing is ultimately an
	    artificial one (the same source would also be used to
	    create the printed catalog)</p></item>
        <item><p>The rendition in each case occurs <emph>on the
	      web client</emph></p></item>
        <item><p>The database owner can publish <emph>a single data
	      stream</emph> to the entire world</p></item>
        <item><p>Consider the alternative:</p>
	  <bulletsublist>
	    <item><p>Generation of a different HTML output stream for
		<emph>every possible</emph> user and target
		platform</p></item>
	    <item><p>Much greater load on the server</p></item>
	    <item><p>No user autonomy</p></item>
	  </bulletsublist>
	</item>
      </bulletlist>
    </slide>

  </slidemodule>

  <slidemodule>
    <modhead>XML and Standardization</modhead>

    <moduletitleslide>
      <title><line>XML and Standardization</line></title>
    </moduletitleslide>

    <slide>
      <slidehead>The XML accomplishment</slidehead>
        <p>XML is a <emph>big open-standards victory</emph> for users.</p>
      <bulletlist>
	<item><p>Freely extensible</p>
          <bulletsublist>
	    <item><p>No tag name limitations</p></item>
	    <item><p>No language limitations</p></item>
          </bulletsublist>
	</item>
	<item><p>Human-readable</p>
	  <bulletsublist>
	    <item><p>Can maintain data using basic text
		tools like sed, awk, perl, Word macros</p></item>
          </bulletsublist>
	</item>
	<item><p>Open standard</p>
	  <bulletsublist>
	    <item><p>In theory, XML users can't be held hostage to
		vendor control</p></item>
          </bulletsublist>
	</item>
	<item><p>Easy to implement</p>
	  <bulletsublist>
	    <item><p>There will be many powerful, cheap, off-the-shelf
		commercial XML tools</p></item>
	    <item><p>There is already an ever-growing set of free XML
		tools (almost all of them Java-based)</p></item>
          </bulletsublist>
	</item>
      </bulletlist>
    </slide>

    <slide>
      <slidehead>Current W3C XML work</slidehead>
            <p>Work on the XML family of standards is taking place
            in six different
            W3C working groups:</p>
      <bulletlist>
        <item><p><emph>XML Linking WG:</emph> 
            Developing next-generation
	    hypertext mechanisms</p>
        </item>
        <item><p><emph>XML Schema WG:</emph> 
            Developing a DTD alternative that will support 
            inheritance and data typing
          </p>
        </item>
        <item><p><emph>XML Fragments WG:</emph> 
            Developing a method for transmitting XML fragments
          </p>
        </item>
        <item><p><emph>XML Information Set WG:</emph> 
            Defining the XML objects that are available
            to applications
          </p>
        </item>
        <item><p><emph>XML Syntax WG:</emph> 
            Investigating XML simplification and style sheet
            linking
          </p>
        </item>
        <item><p><emph>XSL (Extensible Stylesheet Language) WG:</emph> 
            Developing an industrial-strength internationalized
	    stylesheet language for XML uses on and off the Web</p>
        </item>
      </bulletlist>
    </slide>
  </slidemodule>
  
  <slidemodule>
    <modhead>XML and Global Publishing</modhead>
    
    <moduletitleslide>
      <title><line>XML and Global Publishing</line></title>
    </moduletitleslide>

    <slide>
      <slidehead>Beyond a narrow interpretation of XML</slidehead>
      <numberedlist>
        <item><p>XML is not just about exchanging data between
	    machines</p>
	  <bulletsublist>
	    <item><p>It's also about communication between humans</p></item>
	  </bulletsublist>
	</item>
        <item><p>XML is not just about the web</p>
	  <bulletsublist>
	    <item><p>It's about information in general</p></item>
	  </bulletsublist>
	</item>
        <item><p>XML is not just about technology</p>
	  <bulletsublist>
	    <item><p>It's also about the relationship between content
		creators and software vendors</p></item>
	  </bulletsublist>
	</item>
      </numberedlist>
    </slide>

    <slide>
      <slidehead>XML inherits the social agenda of SGML</slidehead>
      <p>The social agenda of SGML has always been about
	<emph>freedom from third-party control of content</emph>.</p>
      <bulletlist>
        <item><p>Freedom from proprietary data formats</p></item>
        <item><p>Vendor neutrality</p></item>
        <item><p>Platform neutrality</p></item>
        <item><p>Language neutrality</p></item>
      </bulletlist>
      <p>Its foundation on Unicode adds <emph>practical 
        internationalization</emph>
        to the implicit social agenda.</p>
    </slide>

    <slide>
      <slidehead>Next phase of the revolution: XSL</slidehead>
      <p>Large-scale cross-platform XML publishing demands that XML
	deliver on the <i>display-oriented</i> promises:</p>
      <bulletlist>
        <item><p>User-configurable views</p></item>
        <item><p>More powerful display-centric client-side
	    applications</p></item>
        <item><p>Media-independent publishing</p></item>
	<item><p>In particular, printed and online deliverables from
	    the same source</p></item>
	<item><p>Asian-language rendering support</p></item>
      </bulletlist>
      <p>XSL is intended to complete the internationalized
	media-independent publishing story.</p>
    </slide>

    <slide>
      <slidehead>Implications of XML+XSL for I18N</slidehead>
      <bulletlist>
        <item><p>Properly done (like DSSSL), XSL enables XML to serve
          as an international publishing format</p></item>

        <item><p>XML+XSL could allow the whole operation to be based
          end-to-end
          on a uniformly processible Unicode text stream</p>
	  <bulletsublist>
	    <item><p>Data</p></item>
	    <item><p>Stylesheets</p></item>
	    <item><p>Formatting objects</p></item>
	    <item><p>Instructions to the formatter</p></item>
	  </bulletsublist>
	</item>
        <item><p>A mass transition to XML+XSL
          will change the whole translation business</p>
	  <bulletsublist>
	    <item><p>Data human-readable and freely
              interchangeable</p></item>
	    <item><p>Tools cheap and freely interoperable</p></item>
	    <item><p>Qualified personnel available everywhere</p></item>
	    <item><p>A <emph>common infrastructure</emph></p></item>
	  </bulletsublist>
	</item>
      </bulletlist>
    </slide>

    <slide>
      <slidehead>The larger implications of XML+XSL</slidehead>
      <p>The combination of XML and XSL can replace all existing
	word-processing and publishing formats.</p>
      <bulletlist>
        <item><p>A single format for both print and online
	    publishing</p></item>
        <item><p>A single format across different products</p></item>
        <item><p>A single format for all languages</p></item>
      </bulletlist>
      <p><emph>What does this mean?</emph></p>
      <bulletlist>
	<item><p>Users no longer tied to a proprietary
	    format</p></item>
        <item><p>A change in the relationship between software vendors
	    and customers</p></item>
        <item><p>An end to control of the market by a few big
	    companies</p></item>
        <item><p>An end to control of the market by a few big
	    countries</p></item>
      </bulletlist>
    </slide>

    <slide>
      <slidehead>What can go wrong?</slidehead>
      <p>The complete implementation of XML and XSL means an end to
	control of users through proprietary formats.</p>
      <p><emph>Companies that have built their business models on
	  proprietary formats can be expected to resist this.</emph></p>
      <p>The most obvious ways to subvert the user-empowerment agenda
	of XML are:</p>
      <bulletlist>
        <item><p>Vendor control of standard schemas</p></item>
        <item><p>Incomplete Unicode and XSL support</p></item>
      </bulletlist>
    </slide>

    <slide>
      <slidehead>Getting to global publishing</slidehead>
      <bulletlist>
        <item><p>Beware of strategies that limit XML to the role of
	    middleware; it's more than that</p></item>
        <item><p>Support the vendor-independent development of
	    standard schemas (DTDs) and namespaces</p></item>
        <item><p>Beware of sleazy dodges: nonstandard extensions,
            incomplete UTF-8/UTF-16 support</p></item>
        <item><p>Insist on real XSL support: the ability to render
	    formatting objects, not just HTML tags</p></item>
        <item><p>W3C members: work within W3C to make sure that 
            the requirements for international publishing are met</p></item>
        <item><p>Support platform-independent tools vendors</p></item>
        <item><p>Support the only organization dedicated to
	    interoperable document standards in general: <hlink
	      href="http://www.oasis-open.org">OASIS</hlink>
	    (http://www.oasis-open.org)</p></item>
      </bulletlist>
    </slide>

    <slide>
      <slidehead>The payoff for XML and Unicode</slidehead>
      <bulletlist>
        <item><p>Interoperability of both content and style</p></item>
        <item><p>Freedom from vendor control of our data</p></item>
        <item><p>Creator control of markup depth</p></item>
        <item><p>User control of views into the data</p></item>
        <item><p>A level playing field for independent software
	    developers</p></item>
        <item><p>A common infrastructure for translation</p></item>
        <item><p>True international publishing across all media</p></item>
      </bulletlist>
    </slide>

  </slidemodule>
</slideshow>
