The web and the word processor

Be the first to comment | 1I like it!
April 13, 2007, 01:02 PM —  ITworld.com — 

In one form or another, I do words for a living. I spend part of my time writing words (as I am doing right now). I also spend part of my time writing complex sets of words (known as 'computer programs') to manage other complex sets of words (known as 'enterprise content').



On an almost daily basis I must address the question of how best to organize content so that it can be managed effectively. On an almost daily basis I find myself oscillating between two distinct-yet-closely related worlds...



On one hand there is the classic word processor world. These things generally provide cozy, self-contained mechanisms for managing words, tables, graphics up to about, say, 500 pages of stuff. Once the document size goes beyond that, things become more complicated. The good news is that most enterprise content can sensibly be managed in units of 500 pages or less. There seems to be two main reasons for this rule-of-thumb limit. First, if you are going to print something to paper you must worry about how to bind it. The thicker the book, the more complex that gets. Second, word processors tend to work by loading the entire document into memory and therefore the memory size of your computer dictates what constitutes a comfortable document size.



On the other hand, there is the less cozy but utterly compelling world of the web and web-oriented editing tools. Web editing tools have a completely different feel to them. A completely different comfort zone. Web pages get unwieldy well south of 500 pages, there is no self-contained environment for managing images along with words. Typographically speaking, they are rather limited compared to modern word processors...



Yet few would argue that the fundamentals of caring and feeding of content are present in both worlds. Why oh why is there such a dichotomy in the tools used? Why - in this day and age - do we have to 'convert' content to Web format? Why - in this day and age - do we have so many problems producing decent printed pages from web browsers?



In more technical language, why do we have XML-based markup languages like ODF and OOXML for managing words when we also have XML/SGML-based markup languages like XHTML/HTML for managing words? Does it make any sense to have two competing approaches? Are these approaches converging technologically or diverging?



Opinions differ of course. My take is that convergence is inevitable. I also have a suggestion for a key piece of the jigsaw puzzle that is currently missing. If it existed, convergence would - in my opinion - proceed faster than it currently is.



The missing piece is that the Web has no concept of a 'collection of small documents that can be edited/browsed/searched as a unit'. What is a 450 page user manual really? It is a collection of smaller documents that have been hooked together into a hierarchical and sequential ordering to create a full work. How do we know this? Because the table of contents makes the internal boundaries/structure

I like it!
Post a comment
The content of this field is kept private and will not be shown publicly.
  • Allowed HTML tags: <a> <em> <strong> <cite> <code> <ul> <ol> <li> <dl> <dt> <dd>
  • Lines and paragraphs break automatically.
Resources
White Paper

Symantec Backup Exec 12 and Backup Exec System Recovery 8 deliver industry leading Windows data protection and system recovery. Download this whitepaper to find out the top reasons to upgrade and how to get continuous data protection and complete system recovery.

Webcast

Data and system loss — from a hard drive failure, malicious attack, natural disaster, or simple human error — can happen anytime. Don’t leave your business vulnerable. Make sure you have a secure recovery strategy in place. Symantec's latest backup and system recovery technology can efficiently restore critical applications, individual emails and documents and even restore your entire system in minutes in the event of a loss.

White Paper

Businesses face a growing challenge to ensure that the IT environment is properly protected. Backup Exec 12 integrates with other applications in the Symantec family of products, to complement your current data protection strategy, keep your data securely backed up and make it recoverable when you need it most.

Free stuff

Enterprise 2.0 Implementation
By Aaron C. Newman, Jeremy Thomas
Published by McGraw-Hill
Learn more!

Deploying Cisco Wide Area Application Services
By Zach Seils, Joel Christner
Published by Cisco Press
Learn more!

Featured Sponsor

AISO founders envisioned a Web hosting company that was environmentally friendly. While the company employed energy-efficient innovations like solar panels, its infrastructure produced unacceptable power and cooling requirements. Find out how AISO leveraged AMD technology to overcome their challenge in this case study white paper.

In this whitepaper, Scalar explores the opportunity to change the landscape with respect to mission critical databases built around Oracle. Leveraging technologies such as Linux, high-end commodity processing power and Oracle RAC technology to architect, design, build and maintain database infrastructure that delivers maximum availability, reliability and performance at a fraction of traditional cost.

On a typical day, weather.com, the Web site for The Weather Channel in Atlanta, serves up between 15 million and 20 million page views. But in September 2004, when back-to-back hurricanes ransacked Florida, the peak traffic on one day more than tripled: over 70 million page views by more than 7 million unique visitors. Read the full success story now.

More Resources