Technical Writing Quote of the MomentFew people have the imagination for reality. Who's onlineThere are currently 2 users and 33 guests online.
Online users
Latest Classified Ad |
IndividualsWhat’s All This Talk About Intelligent Content?By Ann Rockley, The Rockley Group Content has often been managed as documents. Metadata for search and retrieval has become more and more important as the amount of content has increased. In recent years with the increased interest in the use of Extensible Markup Language (XML) for content creation and the rising popularity of the Darwin Information Typing Architecture (DITA) XML standard, content has begun to move from unstructured documents to structured XML-based component-based content. And with the advent of XQuery, an XML query language that searches on the structure of content, then manipulates and renders content, we can do so much more than just full-text searching. We’ve gone from documents which are “black boxes” to intelligent content which is structurally rich and semantically aware, and is therefore automatically discoverable, reusable, reconfigurable and adaptable. Let’s look at the definition for intelligent content in more detail: Structurally Rich This means that the content is structured content and more importantly it is semantically structured content, content where the structure has meaning. We could look at something as simple as a whitepaper which could include a structure like (executive summary, introduction, discussion and conclusion) or a marketing brochure that could include a structure like (positioning statement, value proposition, features and benefits). Or we could have content which follows a standard like DITA or DocBook. But each of these structures enable us to understand what content we have so that we could search within positioning statements only, or pull out that statement and use it in another type of content. If we have a structure in our content we can manipulate it. For example we can automatically determine how to publish it to multiple channels (print, web, mobile) or we can filter out some content (e.g., tables may not work as well in the mobile environment). Also if it is structurally rich we can perform searches or narrow our search to the particular type of information we are interested in (e.g., look for all occurrences of the word like high definition in positioning statements). When it is structurally rich we can do so much more. Semantically Aware The word semantic means “meaning”. Semantically aware content is content which has been tagged with metadata to identify the kind of content it is. For example, you might tag your content with industry sector, role or audience, and product information. If it is tagged with semantic metadata it is possible to automatically build customized information sets based on audience or industry for example. As more organizations start to create personalized content (content which is dynamically assembled on demand that specifically matches a users need, behavior or user profile) this type of metadata becomes really important. In addition, as content is pushed to wikis, integrated through mashups or pipes it becomes even more important to ensure that our content is semantically tagged. Without semantic metadata it is very difficult to automatically, let alone manually, find the content we need. Discoverable If the content has semantic tags and is structurally rich it is a whole lot easier to find exactly what we are looking for. And when it is structurally rich, and assuming our content is in XML, we can use XQuery to question the structure of the content to find specific information. Then when we add semantic tagging to the content we have a great deal of information that will allow us to zero in on exactly the content we are looking for. You’ve heard of data mining? Well, now we can do content mining! Sure, we could have done this with regular old unstructured content, but we would have to develop (and maintain) highly complex algorithms to interpret the content; with intelligent content, discovering relevant content is much much easier. Reusable Reusable content, content which is created once and used many times throughout an information set, has been used for years in technical documentation, but its popularity is quickly moving into business documents like marketing materials, proposals, contracts, and policies and procedures. We can create modular structured content that can either be easily retrieved for opportunistic reuse (manual reuse) or automatically retrieved for systematic reuse (automatic reuse). Reconfigurable Structured content is content separate from format, in other words the look-and-feel of the content is not embedded in the content. That makes it very powerful. Knowing the structure of the content, we can output it to multiple channels, reconfiguring it to best meet the needs of the channel, or we can automatically mix-and-match content to provide us with the information our customers need. We can even transform content (reconfigure it) from one structure to another (mobile online help, print publication, website knowledge center), but only if we know what the structure is in the first place. Adaptable We frequently create our content for a particular need or audience, but content can be adapted (used in a different way), often without our knowledge, to meet a new need. Think of mashups, we don’t know how our content is being aggregated, but it can be easily reused by both humans and computers because we have structured and tagged it intelligently. A Few Scenarios The following are a few scenarios that illustrate intelligent content. Customization: Mobile Phones A major cell phone manufacturer and distributor produces over 100 different phones. They range from simple handsets to highly capable models designed to support email, video, and conferencing applications. While each handset has a unique market position, there are numerous features that are common across handsets, for example, texting. The description of texting is the same no matter what model of phone we examine, but each handset may require different key sequences, images, key names, and so on. In addition to handset configurations, there are regional differences that determine the features a particular handset will support, as well as region-specific safety standards and language requirements. Some of the handsets are made available through specific carriers (mobile phone sales partners, for instance) requiring different contact information and branding. One component of information could have as many as 500 variations!
To reach their goal they make the content intelligent to facilitate automatic builds and content filtering by:
By utilizing intelligent content, they reduced translation costs, and optimized processes. And, they are actually making more money because they can now provide highly-relevant, personalized content, which has led to an increase sales. Personalization of Healthcare Insurance Program People have been talking about personalized content for years, but often back off because creating personalized content is a lot of work with traditional content. Not so with intelligent content. A Health Maintenance Organization (HMO) provides health insurance for 100’s of companies with thousands of policyholders. The HMO builds a self-serve site to enable employees to review coverage, submit claims and get customized health and wellness information.
Personalization is supported by intelligent content through:
Dynamic Delivery of Investment Information A financial services company has been producing content for its investors for both the web and print. There are daily News and Notes to keep investors informed of key breaking news, weekly reports to summarize a an area of particular interest, monthly reports and annual reports. The reports have always been produced as PDF while the News and Notes are web-based. There is more than a decade worth of content squirreled-away on the web and in fileservers throughout the organization. This pattern of content delivery has been pretty effective until now. With the economic melt down, investors are clamoring for content daily—even hourly—and they want more than just the information the financial services company can provide. To calm investors and to provide as broad a swath of high-quality content as possible for investors who have decided to stay in for the long haul, they decide to change their paradigm and offer personalized dynamic content delivered automatically to their investors. They already have much of their content in XML and what they don’t they decide to convert to XML. They also set up RSS and other feeds to ensure that content can be gathered from multiple sources, incorporated with their own information, and automatically delivered to investors. They built a set of profiles of their investors and provide a simple way for investors to indicate the types of information they would like to receive. Some investors choose to see an historical perspective to see what has happened in previous periods of market instability, and all have asked for hourly updates. Investors can search based on specific criteria and assemble customized views of the information that meet their specific investment interests. New information coming into the investment services company is captured, converted to XML, searched and retrieved based on specific criteria, assembled and transformed, then delivered to investors.
Dynamic delivery is supported by intelligent content through:
Who is Using Intelligent Content? There are a number of industries that are making use of intelligent content. Companies whose product is content (publishing and media companies, for example) have begun to adopt intelligent content as a methodology for moving away from their traditional print products to a truly multi-channel, and often personalized content offering. Companies who produce huge volumes of content such as life sciences (pharmaceutical, medical device, and health and hospital organizations) and financial companies (insurance, investments, banking) use intelligent content to optimize access and retrieval. The high technology industry has been moving towards intelligent content for a number of years, but are not yet making a lot of use of metadata and personalization. Government is starting to use intelligent content to manage and deliver legislative content. Benefits of Intelligent Content
There are many benefits of intelligent content. We can:
And…
Learning More About Intelligent Content Creating intelligent content is an emerging practice—one that holds tremendous promise for streamlining business processes, improving service, reducing expenses, and empowering content consumers. If you’re interested in learning more about how to create, manage and deliver intelligent content, consider attending Intelligent Content 2009 in Palm Springs, CA. This intimate educational event—brought to you by The Content Wrangler and The Rockley Group, takes place January 29-30, 2009 at Le Parker Méridien Palm Springs. Early bird discount tickets are still available. Register today to reserve your seat! About Ann Rockley Ann Rockley is President of The Rockley Group, a consultancy that has an international reputation for developing customer-centric enterprise content management strategies and underlying information architecture. Rockley is a frequent contributor to trade and industry publications and a featured speaker at numerous conferences in North America and Europe. She has been instrumental in establishing the field in online documentation, single sourcing (content reuse), unified content strategies, and content management best practices. Rockley is co-chair of the OASIS DITA Enterprise Business Documents Subcommittee. Rockley led Content Management Professionals to a prestigious eContent 100 award in 2005. Rockley is a Fellow of the Society for Technical Communication and has a Master of Information Science from the University of Toronto. Rockley is the co-author of the best-selling book, Managing Enterprise Content: A Unified Content Strategy (New Riders Publishing ISBN 0-7357-1306-5). Categories: Individuals, Interesting and Relevant Sites
PowerXEditor Eases Online Collaborative Authoring and WorkflowBy Rahel Bailie, special to The Content Wrangler
As the market develops for XML tools that work for a growing number of user types in a variety of industries, we are starting to see creative products come on the market. One need that is becoming more common is for the average business author – that is, someone who is not a technical writer and hasn’t studied the craft of writing – to create documents that validate to an XML DTD.
The PowerXEditor authoring tool is meant for use in publishing environments where writing is more than an occasional activity. For an organization to invest in software that unifies the tasks of authoring with workflow that takes that content through its review/revision and approval stages, the writing is likely high-value content that is published to strict standards, and involves authors who are subject matter experts (SME), and a publishing process that includes roles such as editors, art directors, and so on.
The .NET framework, combined with the power of AJAX and Java, means that the browser-based application is cross-platform and cross-browser compatible. There is no software to download or install, and the controls are simple enough for any author to quickly learn. The content remains format-neutral – in other words, in XML – so there are none of the barriers posed by proprietary systems. PowerXEditor can integrate with a content management system or other workflow system through its application programming interface. Though a Microsoft SQL database controls content and workflow, the XML editor can be integrated with other systems.
The product is meant for multiple publishing channels, with a strong supports built in for publishers of print. In addition to standard features such as word count, PowerXEditor feeds high-end composition from InDesign and Quark to LaTeX. Its mini-castoff calculator provides a much more accurate idea of page layout restrictions early on, supporting editorial staff in their decisions about how to judge the amount of content in relation to the available number of pages.
For more information about PowerXEditor or the entire Aptara PowerSuite, contact one of the Aptara offices, at http://www.aptaracorp.com, or contact them at info@aptaracorp.com.
About Rahel Bailie
Categories: Individuals, Interesting and Relevant Sites
The Power of the Crowd: Finding DITA Resources and InformationYou can find a wide variety of groups that focus on DITA on The Content Wrangler Community, the global network for content professionals. Membership is free and requires online registration. Each group is moderated by an industry expert. Some of the most popular groups include:
Categories: Individuals, Interesting and Relevant Sites
DITA Metrics: Developing Cost MetricsBy Mark Lewis, special to The Content Wrangler Table of Contents
Introduction
You’ve read all the papers (and attended all the webinars) on return on investment (ROI) for XML and you get it. You’ve already concluded that moving to the Darwin Information Typing Architecture (DITA) will likely save you tons of time and money. But management says, “Prove it!”. This paper helps you determine the cost portion of the ROI calculation. What are my costs now? What will my new costs be with DITA? And what is the difference—my savings? This white paper is the first in the DITA Metrics series. The series will discuss cost metrics, reuse metrics, and a reuse strategy. This paper is the first in the DITA Metrics series. It describes one model for calculating the cost of a DITA project. After doing some content analysis on your own documentation set, you can customize this cost model to suit your documentation project needs. In the end, you should be able to speak the financial language of managers and prove to them in dollar signs the value of moving to DITA. To benefit from this article, you should have at least an intermediate level understanding of DITA including topic structure, elements, conrefs, child maps, and filtering/conditional processing. For your convenience, we’ve provided a downloadable PDF of this article. Cost Metrics OverviewThe cost to develop content and reuse percent values are standard components in many ROI calculations. We need the cost and reuse values for a DITA project to determine DITA ROI. This paper focuses on the cost of content creation and introduces various levels of reuse into the model. We’ll begin with a deep dive into the cost of creating DITA topics and then incorporate the cost of unique content, identical content and similar content. Over the course of the series, we will discuss the following components of our cost model:
Let’s show how our model is used to determine the cost of creating user guides for three models of a fictitious personal digital assistant (PDA). PDA One has a base set of features. PDA Two has all the features of PDA One, plus several additional features. PDA Three has all the features of PDA One and PDA Two, plus several features unique to PDA Three. This documentation project contains lots of identical content and unique content. The first version of our documentation project is relatively simple so that the associated cost model is also simple. Later, we’ll introduce more of the content reuse and conditional reuse features of DITA, and show how to incorporate these into the model. Gradually, we’ll increase the complexity of our project and our cost model. Cost of a TopicThe first step is design the content creation component of the model. Traditional cost metrics focus on the cost of a page [1]. Since pages are similar to topics, we will start our discussion with determining the cost o creating DITA topics. Costs are expressed in terms of content creator labor hours. Table 1 shows the cost of creating Task and Concept topics. For simplicity, we exclude Reference and other topic types, but you can easily customize the model to include them if needed. The scope of these estimates is a topic and does not include time for project/publication level activities such as designing the document outline, user task analysis, project management, implementing context-sensitive help, testing, status meetings or design meetings.
Table 1 (Cost of a Topic)
Now we have an approximate cost range in hours for creating Task and Concept topics that we can use in our model. Cost of User Guides Without Topic ReuseTable 2 through Table 5 shows the cost of developing a user guide for each of the three PDAs in DITA without taking advantage of topic reuse.
Table 2 (Cost of PDA One User Guide)
Table 3 (Cost of PDA Two User Guide)
Table 4 (Cost of PDA Three User Guide)
Table 5 (Total Cost All User Guides Without Topic Reuse)
This is really a worst case scenario that is not realistic because moving to DITA and not reusing any content is highly unlikely. But this simple project is a good starting point for the model and a base to which we can add reuse features of DITA. As you will see, the reuse features that you incorporate can be different for each documentation project. Cost of User Guides With Topic ReuseNow we’ll incorporate the cost of reusable topics (reusable in more than one user guide) in our cost model. Reusing topics is nothing new. This feature has been available in help authoring tools for more than 10 years. Table 6 through Table 10 shows the cost of developing a user guide for each of the three PDAs taking advantage of reusable topics. Some topics for the PDA One user guide may be reused verbatim in the PDA Two and PDA Three user guides because the topics are identical.
Table 6 (Cost of Reusable Topics)
Table 7 (Cost of Topics Unique to PDA One User Guide)
Table 8 (Cost of Topics Unique to PDA Two User Guide)
Table 9 (Cost of Topics Unique to PDA Three User Guide)
Table 10 (Total Cost All User Guides With Topic Reuse)
Although the scenario without topic reuse is highly unlikely and not realistic, just to be thorough, we are showing a cost comparison in Table 11 through Table 13.
Table 11 (Total Cost All User Guides Without Topic Reuse)
Table 12 (Total Cost All User Guides With Topic Reuse)
Table 13 (Savings)
For over a decade, significant savings have been achieved reusing topics in multiple publications. Cost of a Reusable Master TopicThe project is simple when topics can be reused verbatim in multiple publications. But what happens to our model when there is sufficient variation in our products that we cannot write a single topic to describe a given feature? Perhaps the product screen shots are different, an extra note or warning is needed, or a button has a different label. For all three user guides, some topics are similar. Most of a similar topic is the same for each PDA and can be shared. So, if all three versions of that content are included in one topic, then all versions of the user guides may be published from this topic. Using filtering metadata, content that is unique is marked as belonging to a specific PDA. When the user guide for a specific PDA is published, content that is specific to the other PDAs is filtered. The filtering feature in DITA is also known as conditional processing and it is what allows us to create and use reusable master topics. Table 14 shows the cost of creating reusable master Task and Concept topics.
Table 14 (Cost of a reusable master topic)
It would complicate our model to incorporate conrefs to a large variety of content types. Therefore, we are limiting the incorporation of conrefs in our model. We’ll discuss conrefs later, but for now a simple example is to reuse feature descriptions or screen shots that were created in Task topics by conref’ing them in your Concept topics. This would reduce the cost of the Concept topic by several hours. You will be able to customize the model to incorporate your use of conrefs in your project. Now we have the estimated cost for reusable master Task and Concept topics that we can use in our model. Cost of User Guides With Reusable Master TopicsNow we’ll incorporate the cost of reusable master topics into our cost model. Table 15 through Table 16 shows the cost of developing a user guide for each of the three PDAs taking advantage of reusable master topics.
Table 15 (Cost of Reusable Master Topics)
Table 16 (Total Cost All User Guides With Topic Reuse and Reusable Master Topics)
Table 17 compares the cost of creating unique topics to reusable master topics.
Table 17 (Cost of Unique versus Reusable Master Topics)
Reusable master topics cost significantly more to create, but with proper planning they can be used in multiple publications such that the overall cost of content creation for your publications drops dramatically. Cost Comparison: Topic Reuse Versus Reusable Master TopicsTable 18 through Table 20 shows a cost comparison of a project taking advantage of reusable topics to a project that is taking advantage of both reusable topics and reusable master topics. When filtering and reusable master topics are used, the number of unique topics and reusable topics that have to be created is drastically reduced. This is where the cost savings of using DITA really leaps ahead of conventional help authoring tools and technology.
Table 18 (Total Cost All User Guides With Topic Reuse and Reusable Master Topics)
Table 19 (Total Cost All User Guides With Topic Reuse)
Table 20 (Savings)
“What is the cost of DITA?” is the question that is the focus of this paper. Chances are that if you are reading this paper, then you’ve read the papers on the promise of XML, the ROI of DITA, the clear savings in translation costs alone, and you get it. However, you are being asked to justify the cost. How much is it going to cost to develop and publish content using DITA? Knowing percent reuse is important, but equally as important is the upfront cost? And, the savings are in more areas that just translation. There are savings in both content creation and content maintenance due to content reuse and conditional reuse. The cost to develop content and percent reuse are often discussed separately. But to know the cost of a DITA-based documentation project, both of these values need to be accurately incorporated into a cost model. In this paper, we focused on cost metrics. Later in this series we will focus on reuse metrics. For a given project, we must know the cost to create content without reuse and the cost with reuse. The difference is the savings. The cost to create content with reuse is equal to the cost to create the unique content and the reused content. That’s the upfront cost. Knowing this will allow us to more accurately calculate the cost of translation as well. What must be translated is reduced to unique content and reused content.
In Part 1 of this paper, we covered the following cost metrics:
The most important things we accomplished in this paper included determining the cost of creating a DITA-based topic rather than a traditional page. We then incorporated conditional reuse/filtering and determined the average cost of creating a reusable master topic. We observed that the biggest savings resulted when reusable master topics are incorporated. The flexibility and diversity of conditional reuse in DITA differentiate it from typical help authoring tool technologies and offer greater savings in not only content creation, but also content maintenance. The ultimate question.
In Part 2 of this series (coming soon), we will look at other costs that need to be accounted for in a DITA project:
Other papers in the DITA Metrics series (coming soon!):
Join the DITA Metrics Group
Mark Lewis works for YOU, the tech writer, and the information architect. Mark has received STC awards for Distinguished Chapter Service and the Florida Technical Communications Competition. Currently, he is the DITA Product Manager for Usability and a product Evangelist for Quark. He provides product direction and user experience designs for Quark’s structured authoring products allowing everyday authors to create reusable content without knowing the details of XML syntax. Mark presents at conferences and other industry events on topics including: object oriented design methodologies (for non-programmers to help jumpstart their understanding of structured authoring), information architecture, and the promise of DITA and XML. Mark manages The Content Wrangler Community Groups – WritingOBJECTively and DITA Metrics. Send questions or feedback to or hyperwriters@hotmail.com.
------------
Categories: Individuals, Interesting and Relevant Sites
DITA Metrics: Mark Lewis on Calculating The Cost of a DITA ProjectBy Mark Lewis special to The Content Wrangler You’ve read all the papers on return on investment (ROI) for XML and you get it. You’ve already concluded that moving to the Darwin Information Typing Architecture (DITA) will likely save you tons of time and money. But management says, “Prove it.” This paper was created to help you determine the cost portion of the ROI calculation. What are my costs now? What will my new costs be with DITA? And what is the difference—my savings? This is the first in the DITA Metrics series of white papers. The series will discuss cost metrics, reuse metrics, and a reuse strategy. This installment describes one model for calculating the cost of a DITA project. After doing some content analysis on your own documentation, you can customize this cost model to suit your documentation project. In the end, you should be able to speak the financial language of managers and prove to them in dollar signs the value of moving to DITA. [Note: To benefit from this article, you should have at least an intermediate level understanding of DITA including topic structure, elements, conrefs, child maps, and filtering / conditional processing.]
The DITA Metrics series:
Categories: Individuals, Interesting and Relevant Sites
Alfresco Is Not A Picnic: The Problem With Metaphors And Content Management SystemsBy Felice Bochman, special to The Content Wrangler “Humans are wired to put things in buckets. We have an innate need to create categories and sort things into them.”—Richard Hamilton on content delivery, The Content Wrangler, September 2008 Many editors and other language-oriented professionals I know look for metaphors as a way of figuring things out. We tend to see the world through a “this is like this” lens complemented by a “how does this fit into the big picture” manner of thinking. At its most elemental, this is simply a way of bucketing things. It’s a matter of perception not taken lightly—I hope. What “things” do we humans bucket? Anything, really, but for editors this usually applies to content of some kind. And, once we have content of some kind, it’s only a hop, skip, and a jump to content of a particular kind, as in content of one type or another. Alas, content doesn’t magically appear on web pages. We “webitors” must use a CMS to publish language to our websites. In and of itself, the idea of a CMS is no big deal—one usually needs some kind of delivery system to display or publish language pretty much anywhere in any medium—unless mental telepathy is involved. But, it’s how the CMS is structured that is at issue. In the content management system I currently use, I’ve noticed no less than nine metaphors, which are meant serve as organizing principles, but they don’t. Granted, the particular tool I use isn’t really meant for gobs and gobs of editorial work, but nonetheless its organization and structure were likely created by a developer within arm’s reach of a bottle of tequila. Really. Imagine the following. You’re brand new to the publishing tool you will use to do your work. You’re neither a stranger to web publishing, nor to any of several content tools you’ve used in the past. So, what’s so special about this one? This one is Alfresco. (Cue the choir.) The name connotes the outdoors, picnicking, of a la piscine, of spring—not to mention something about renaissance painting (that would be a fresco but still, close enough). You get the idea—pastels, a breeze, distant laughter, potato chips—all swirling about erroneously in this identity crisis we call a publishing tool (not that there’s anything wrong with that). Is it not true that when you encounter something brand new and unknown, the first thing that comes to mind is, “Do I know anything like this?” Or, based on what I do know or can relate to, “how should I get around this thing and make it work?” One faces the fear of the unknown by resorting to the familiar. It’s like driving in England—you know the deal—car, steering wheel, directional, seatbelt, traffic, horn, and round abouts, except that you’re on the wrong side of the street (and you could die instantly—unlike using Alfresco, though it feels like you want to). The reality is that there are complicated heuristics lurking behind our human ability to make the aforementioned leap of faith when faced with the unknown. I mean, there’s no proof that using something familiar can help you decipher the unfamiliar. Right? Enter Alfresco. See company home. Oh, good! Home. I know that! It’s a house, with a roof and rooms, and I live there with my family. Good! Let me get this content up on our site. I suppose I need to find a “door” or perhaps a “hallway” or “room” which would signify the next logical step in my trek though this “home.” Perhaps I’ll find a “kitchen” where I will cook up content, or maybe an “attic” or a “basement” where I will store the stuff. But no! No. No. No. I leave the “home” and go to a playground—something for children. Well…it must be a playground as I’m supposed to find a “sandbox” there. Ok, fine—maybe it’s in the backyard of the home. I’m sure that once I’m in this sandbox, I’ll see some other playground equipment that will help me get this content on the site. Or, I’ll see some other children and they will help me. Like a kid new to the neighborhood, I find my way to the fields I need—a mere eighteen clicks away (that’s not army chat for 1.6 miles in case you were wondering). This might be a slight exaggeration, but when you’re new to the ‘hood, your perception of distance may be skewed. I am armed with keywords and categories and conversion tools, which for some reason are tiny—even tinier than I am (just over five feet). But again, no! Quick, quick, close my eyes and click my heels together—there’s no place like home, there’s no place like home. Crap. I’m still in the root folder (potato, rutabaga, garden?). Maybe I’m not wearing the right shoes. The content seems to be in there somewhere, but there’s a sudden invasion—the military has arrived! I’m faced with a deployment. My thoughts turn to troops, war rooms, strategy, weapons, and uniformed men flicking their laser pointers at topographic charts. It all seems very far away from home or even from the sandbox where I once was. It’s okay, though. I’m smart and flexible and I can switch metaphors again. I’m in the army now. What modified items have to do with the army (perhaps a SNAFU or something gone FUBAR), I really don’t know, but I’m willing to give the military metaphor a chance in hell. I submit the chance in hell. Excellent. I’m ready for the next maneuver from my commanding officer, Colonel Alfresco. But, alas. He is gone. He went away and a photographer took his place. What the . . . ?! This photographer cannot possibly be a war correspondent with a camera. No—he’s just taking a snapshot. What happened to my war? The troops? The plan of attack? The battle? Gone, gone, gone. Sudden peacetime is a little distressing because I don’t have a camera or film or even subject matter to photograph. There are no lenses, f-stops, apertures, tripods, light meters or anything even remotely photographic other than the mysterious snapshots. Home. Sandbox. Root. Deployment. Snapshot. What do the above have in common? Nothing. Zero. The null set. And that wasn’t a trick question. You could force a commonality by saying, “a sandbox might be in the backyard of one’s home,” as I mentioned earlier in this article. But that’s a stretch. No. Actually, it’s a Hail Mary—and that would mean we’re talking about football. Does this seem kinda random to you? I pause to check my location and am hoping for something like a GPS metaphor (a.k.a. a map). Where the heck am I? Oh, I see. I’m in production. For real? I don’t know if I can squeeze in another metaphor without having to buy rush tickets, if you know what I mean. You don’t? That’s funny, I mean, you look so much like someone I know. Where is wardrobe when you need them? Anyway… I have apparently arrived on Broadway, during a deployment, with the snapshot I found in the sandbox in the backyard of my home. And the potato. The god of dynamic systems is laughing his ass off. I am possibly in a novel by Kundera. There is only one thing left to try. Space travel. You laugh, but it is indeed the truth. I’ve fallen off so many metaphors that I’m sure at least one snapshot will need fixing. In order to do that, space travel is involved. This is very lucky because I was just at Epcot Center where I rode this cool space ride. I was the virtual navigator of my own ship (my frightened flight team with barf-bags in tow). I see the navigator tool. And good heavens when I click it there is a shelf (hey, that makes ten metaphors, no wait, eleven if you include the potato, and twelve with the theater fiasco). Does this mean there’s a bookcase or library on my spaceship? No, of course not. That wouldn’t make any sense at all. Perhaps it’s a continental shelf, though I really hope it isn’t as I’d prefer not to introduce a geological or oceanographic metaphor to this affair (though I do like to swim in the ocean). By some miracle of semiotics, I launch the content I came to publish. I don’t know what else to call it. I was on a spaceship (or perhaps it was a boat), so the least I can do is launch…something. If the content is a rocket, then I have just launched it from a…sandbox? Houston, we have a problem. I’m fresh out of metaphors, save one. Though there is no chocolate factory in sight, there is a Charlie and a Chuck (two genius tech dudes in my office who, lucky for me, are very kind and often bemused)—and they are worth their weight in golden tickets. HEY YOU GUYS!!! Are there Oompa-Loompas in here too? I’m hopelessly, hopelessly literary, but that doesn’t make me a technophobe. In fact, I crave meaning within systems, pathways to usage that make sense, and sense which is based on experience of the empirical kind. Build a tool. Pick a metaphor to represent the structure and functions—extend the metaphor as needed—but use only one. Trust this editor—that metaphor will contain all the buckets you need.
As for the picnic—I said it before—there’s no place like home, in your backyard, in the sandbox, potato salad and all.
Categories: Individuals, Interesting and Relevant Sites
Microsoft, Welcome to the SaaS World (and See You in a Year)By Rodrigo Vaca, Director Marketing, Zoho Microsoft confirmed recently a widely circulated rumor and announced, with gran fanfare, that next year they will be announcing a web-based version of their Office product. Yes, you read it right, Microsoft announced that they will be announcing… You can read the full story in PC Magazine, ComputerWorld and many other publications. The question for many will be… what does this announcement of an announcement means for Zoho and other SaaS vendors? It’s simple: it means two things. First—it means fantastic news! Microsoft had been pooh-poohing the whole SaaS world… even going as far as denying the inevitable and creating its own Software-plus-Services trend-of-one. But Microsoft took one big step forward, and added some extra validation to the whole concept of productivity applications delivered using nothing but a browser. Second—and particularly to Zoho—it means business at usual. Will there be increased competition in the on-line productivity space? You betcha. But it’s not like we had a monopoly on that market to start with. We thrive on competition. We have multiple competitors for each and every one of the 19 (and counting!) different services we provide. That only makes us better. But beyond that, our users get value from having so many tightly-integrated SaaS applications. Zoho is much more than the on-line productivity suite. We have the most comprehensive portfolio of on-line productivity and business applications. Maybe the real question is—what will this mean for Microsoft? What will this mean for their business model and their uses? For their business model—I wonder if they’ll charge the same for the on-line version as they charge for the old, dinosauric version? Are they still going to be able to collect the absurdly high CAL fees for office users? They surely risk loosing a grip on the desktop, as well, you don’t need Windows to run applications on a browser. In any case, and beyond the business implications—let’s see how this works for the most important folks on earth: users. If Microsoft MSN Live Search, Microsoft MSN Live HotMail and a host of other Microsoft on-line products are a proof of Microsoft’s Internet prowess… Microsoft, welcome to the SaaS world. See you in a year (or so). P.S. I’m taking bets on the simple and elegant name the Microsoft on-line office will get. I’m betting on: Microsoft Office Live 2010 Standard Web Edition.
About the Author
Categories: Individuals, Interesting and Relevant Sites
Information Visualization: A Look At U.S. Newspapers And Their Picks For PresidentThis interactive map from the folks at 1000 Words is a collective representation of the endorsements made by US newspapers for the 2008 presidential candidates. As of the date of this posting, Senator Barack Obama (D) leads Senator John McCain (R) in the number of endorsements he has received from major newspapers. To view individual newspaper endorsements, click on the corresponding red or blue balloons. Use the zoom feature to target the papers you’re interested in viewing in markets in which multiple major papers exist. According to 1000 Words, the newspapers represented on the map were gathered from the 100 largest U.S. newspapers by circulation. In states where no newspaper ranks in the top 100, the paper with the highest daily circulation in the state—according to data provided from the Audit Bureau of Circulation was included. National newspapers like USA Today were not included. In some larger markets in which more than one major news outlet exists, multiple newspapers may be included Cities with more than one paper may overlap (i.e. The New York Times, New York Post, and Daily News). The creators of the map like to remind everyone that the map “is not an endorsement of any particular candidate” and that is will be updated to reflect new endorsements as they become available.
Categories: Individuals, Interesting and Relevant Sites
Content Reuse: Is It Harmful?By Richard Hamilton, special to The Content Wrangler A place for everything and everything in its place—Isabella Mary Beeton, The Book of Household Management, 1861 For a number of years it has been a matter of faith that the more content a technical documentation team reuses, the more efficient they are presumed to be. Vasont Systems, a content management system (CMS) vendor, claims its users average 71% content reuse. That is a bold claim, but I suspect that if you could show even 30% or 40% content reuse, you would earn bonus brownie points with nearly any manager. But, are you really more efficient? Let’s take a deeper look.
Terminology
[Note: Any given piece of content can be reused or single sourced or both. Defining single sourcing and reuse separately may seem to be nitpicking, but the distinction is important.] Why Minimize Reuse? I doubt anyone would argue against minimizing duplication. The benefits are clear and the exceptions are relatively few. I also agree wholeheartedly that single sourcing makes very good sense. But, I differ with the mainstream regarding reuse. I believe you should minimize reuse, not maximize it.
There are two main reasons for minimizing reuse:
Driving out Duplication
Most efforts to maximize reuse start by looking for and driving out duplication. The search for duplication typically identifies places where there is an exact or close match between two or more pieces of content. For each of those matches, you have three choices:
All three of these choices cost something. Choice one abandons reuse, giving you more content to maintain. That is not always a bad choice; if there are enough differences in the content to make maintaining one version more expensive than maintaining two versions, this might be a valid option. Choice two is classic reuse; you will have some additional work making the module work in multiple contexts, and you will have some additional work over time maintaining that independence. But, you will usually save effort over maintaining separate versions. Choice three eliminates duplication and reuse. If you can eliminate all but one of the situations that used the content, you have not only eliminated the duplication, you have reduced the overall size of your deliverables. When it works, this choice is the most efficient of the three. The Bias Towards Maximizing Reuse
I see a bias towards choice two in most of what I have read about content reuse; in fact, often choice three is nowhere in sight. While structure is given its due, I see little discussion about structuring content to minimize the need for reuse. Several factors fuel this bias:
If unchecked, these biases can leave you with a lot of unnecessary reuse. You can argue that’s not a big deal, but even when well structured, a heavily reused module will take more maintenance than one that is used in just one place. In addition, it will needlessly increase the bulk of your deliverables. Both of these factors decrease efficiency. If you are serious about maximizing your efficiency, you need to structure your documentation with a bias against both duplication and reuse. Implications for Modular Documentation So, am I arguing against modular documentation? No. Consistent structure and style help people use your documentation. And good methodologies give your authors the guidelines they need to produce consistent structure and style. Where things go off the rails is when you try to treat your documentation as a set of modules that can be indiscriminately mixed and matched to create whatever deliverables you want. Content that is central to your message deserves a context within which it can live. If it is pulled out of context, it will either be confusing, or it will require additional information to provide that context, either as part of the module itself, or in the including document. Jon Bosak summed up the problem nicely in his Closing Keynote at the XML 2006 conference: Another ancient subject that seems to be popping up again is the idea of modular document creation. This is one of those concepts that comes through about once a decade, seduces all the writing managers with the prospect of greater efficiency, takes over entire writing departments for a couple of years, and then falls out of favor as people finally realize that document reuse is not a solvable problem in document delivery but rather an intractable problem in document writing – which is, how to retain any sense of logical connection between pieces of information while writing as if your target audience consisted entirely of people afflicted with ADD. While I do not have quite as pessimistic a view of modular documentation as Bosak expresses here, I do think that maximizing reuse without considering context and structure yields documentation that is difficult to use. Even if your structure allows you to easily reuse modules, there is benefit in doing so only when you have a compelling reason. What I advocate is to first build a structure that minimizes the need to reuse content, then judiciously choose where, within that structure, you will reuse. Obvious cases include glossary entries, legal boilerplate, and repeated procedures. And you will find other places where it simply makes sense to include content rather than sending the user off somewhere else to find it. If you start with Isabella Beeton’s words in mind, you will end up with less reuse, better structured documentation, a more efficient process, and maybe customers who do not feel you are inflicting ADD upon them.
About the Author
Categories: Individuals, Interesting and Relevant Sites
Economic Woes Signal Content Industry Job Losses: It Could Happen To You!By Maxwell Hoffmann, special to The Content Wrangler
It was just another beautiful autumn day in Portland, Oregon. Since it was October 1st, I swiped my credit card and bought my monthly MAX train pass for $86. If I’d been psychic, I could have bought the $4.75 one day pass. I wouldn’t need the train for awhile.
“Although my instincts are to paint a pretty picture, I’m sharing my raw experience because this could happen to you. If it doesn’t, it will happen to someone you know before the end of the year.”
Now I have a mortgage. Now I have no income. The woman at my grocery check out stand has a job; I don’t. How could this be happening? “It’s the economy.” Since this happened on October 1st, before the biggest stock drop in history, one can only imagine how many others have heard those words across the country, in every segment of the workplace. So, before someone in your company HQ decides to change your row in a salary-sorted Excel spreadsheet to pink, read on for some useful tips.
New tools for tough times
How do you get noticed with 500 competitors for the same job? Avoid “canned” or boilerplate responses to job listings. Write each cover letter from scratch (even the on-line ones) and always mention several points addressed in the job description. Have multiple resumes prepared ahead of time that highlight a variety of strengths for different careers you may be focusing on. I have cover letters and resumes for five different job searches, (1) Translation/Localization, (2) Desktop Publishing/Production (3) XML and Content Management (4) Author/edit content and (5) Course Development and Training. I have created a resume for each discipline which highlights appropriate career accomplishments and mentions relevant former customers or business partners. So, what if you’re not laid off yet and you want it to stay that way?
There are a variety of things you can do to increase your value, especially if you are involved in content creation:
What not to say to a laid off friend
But enough about you and your potential traumas. What about my pain and suffering? Having never been laid off before, I hadn’t realized how stinging the handful of “insensitive” replies from friends can be. I’m sure that I’m guilty of having said at least one of the phrases below in the past to a laid off pal. Here are some actual quotes from responses I got to a mass e-mail that made it clear I had been laid off, have a mortgage and need a job soon:
If you don’t know of any possible leads or advice just be say so. That gets you off the hook, and you avoid a silence that says “I don’t care.”
Categories: Individuals, Interesting and Relevant Sites
Effective Content Reuse: Storing Paragraphs, Not Topics, Is Key to Content Management SuccessBy Paul Trotter, CEO, Author-it Software Corporation Various reports have shown that knowledge workers spend about 30% of their time looking for content that has already been created. If that sounds like a colossal waste of time and money, it is. But in terms of waste, time and money is only the tip of the iceberg. The more pressing problem is that by continually creating corporate documents from scratch, companies run the risk of producing external and internal communications that are inconsistent in style, appearance, and - even worse - message. The ramifications of these shortcomings can be disastrous, particularly with respect to industry-specific compliance issues. Consider a financial institution that generates vast amounts of investment offers for its customer base. Disseminating the most up-to-date information quickly and accurately to end users is critical. However, if a law is changed or the restrictions on an investment vehicle are revised, and the material reflecting these changes is either inaccurate or not distributed in a timely manner, an institution leaves itself vulnerable to potentially massive fines for non-compliance – perhaps even a litigious situation. Clearly, such examples, as well as less dramatic ones, highlight the need for content management within most organizations. However, many of these entities are relying on content management systems (CMS’s) that are woefully insufficient or no systems at all! Frankly, many organizations that are authoring and managing huge volumes of content are using nothing more than simple desktop products. What they’ve realized, whether it’s triggered by compliance issues, lack of resources to manage the problem, fear of litigation because of inconsistency in the content, or the increasing cost of content translation (localization) is, “We’re not doing this very efficiently.” At its core, the primary reason people consider component content management is that there is duplication of content and they realize that they can save time and money, as well as increase the consistency of their internal and external communications, by simply reusing previously approved information. Further, it holds the promise of improving the speed and productivity of the people producing these communications. There are “trickle down” effects as well: editing is easier and translation to multiple languages is simpler, to name just two. When company management reaches the conclusion that reuse is not feasible using the current tool set, they explore the idea of reuse with increased diligence; what they find is a fundamental difference at the very core of competitive systems. A Topical Approach Most content management organizations promote the concept that in order to reuse content you must segment content into topics. This approach works well for technical information because with technical content you are describing concepts, asking people to perform tasks or follow steps, or providing reference material. Consequently, you can reasonably and easily create topics that represent concise ideas, and ultimately, small chunks of content. However, while people might comprehend the benefits that topic-oriented documentation provides, they generally don’t grasp the downsides of such an approach. One of the first requirements that need to be fulfilled in order to utilize the topic-based method is for people to start examining how they write. They must figure out how they’re segmenting topics. They also have to write in a style that is consistent, so that when other personnel are assembling documents, the documents sound and read like they’ve been written by the same group of people. But this is not always possible from document to document. For instance, a user document is typically written in a “second-person” format, i.e., “you will do this or that.” Conversely, a sales proposal, or a response to an RFP would normally be developed in a “third-person” format, i.e., “the user will do this or that.” This minor difference in the way diverse types of documents are written presents a formidable challenge in whether topic-based reuse will be practical or not. Secondly, topics must be crafted in a way that makes them reusable so they can be slotted into any number of documents without causing problems of context. Then, people have to be disciplined enough to go out and actually seek the information they need and place it in their documents. Obviously, it is essential to have access to tools that will support this task, but it’s even more crucial to have the required discipline in order to gain the benefits that the topic-based content management promises. Few content management companies truly understand the problems behind topic-based content reuse. In the end, all of the problems actually come down to one factor: people. Either people are unwilling to take the time to seek out content similar to what they’re writing, or they don’t even know it exists, so they don’t think to look for it. How can these problems be addressed? If the information is made even more granular, then it’s even harder to work with. Part of the reason topics became difficult for people to work with is that when they’re used to working with a 100-page document, a two-paragraph piece of information seems too small, too granular to deal with. What’s more, the time saved through the reuse of content – by reducing the time spent on other parts of the writing and editing process downstream – is negated by the time and management overhead required to deal with these minute pieces of information. And the more granular the information, the more burdensome it is to manage. If you can’t save money, there’s no point in doing it. Even when a writer finds the topic that he or she is looking for, making it usable to the next person is a job in itself. Let’s say you’re writing some marketing content for a website. You know someone has written something similar, but you’re not sure where it is. Using traditional search methods, you find the piece you were after, but it was inside a larger topic describing the subject in far greater detail than you needed; two of the paragraphs were exactly what you wanted to use, but the rest of it is superfluous. In order to reuse just those two paragraphs using a topic framework, you would first have to turn those two paragraphs into a separate topic. Then you would nest that topic in the original place you found it; you would then have that smaller topic to reuse where you wanted it in the new document. What this all means is that if you’re creating topics, not only do you have to create a topic that makes sense for your current purpose, you must be prescient enough to guess or foresee the ways people might use that content in the future, possibly in more granular form. Clearly, this is not a practical exercise; the amount of effort to break up the desired content and “re-reference” it is substantial - to the point where you’re simply going to copy and paste it. In a topic-oriented world, that creates duplication, the very thing you’re trying to avoid. Paragraphs Preferred The obvious question, then, is, “Why not just save the paragraphs and forget the topics?” In other words, go to the paragraph level, but do it in a way in which the user doesn’t have to really do anything. There are a few products that do store content in paragraphs rather than topics. As a result, if a writer was to copy and paste a paragraph into another topic and save it, the system would say, “I’ve got that paragraph; I’m going to reference it.” In this way, content is never duplicated, and all identical content residing in the company’s database is instantly consolidated. Even fewer of these products can go to the next level. When the developers of these products started looking at paragraphs lined up next to each other, they saw many that were very similar. They may have very tiny differences that were not obvious to the user, but the computer could easily identify them. So the products were equipped with a background algorithm that compares every paragraph to every other paragraph in the database and generates a “matrix of similarity.” Through visual highlighting, the user is shown similar paragraphs to the one that is being written; most of the products use a color to show paragraphs that exhibit similarity of 95% or higher. This affords the user the opportunity to consolidate similar content into one way of saying it. Sometimes, the only differences between paragraphs are white space, punctuation, or capitalization. These elements are placed into a special category called “Exact Match,” while the others are called fuzzy match. Of course, it is also important to prevent people from creating these differences in the first place – that is one of the primary tenets of successful content reuse. Consequently, it is optimal if these differences can be circumvented at the most effective time possible: the actual typing process. As the writer types, similar content is needed, so the writers can choose to reuse content at the point of content creation. If you analyze the value and investment of a CMS, the only opportunity you have to save financial and human resources is at the time the user is typing; once content has been typed, time has been used, and the opportunity evaporates. There is an ancillary benefit to an effective CMS. Not only are users prompted with suggestions of available, matching content, it also becomes obvious to users how certain documents are written. The writer might be composing a piece that is slightly different from the corporate “norm,” but as soon as suggestions of content are presented, it is quickly apparent what the proper style is for that type of document. The writer might well create a unique manuscript, but it is still going to be written and structured the way other content is written and structured within the company, department or division - not from a language or technical point of view but from the perspective of actual sentence structure. Thus, there will be consistency not only in reuse of content but also in style. It should be noted that while it is preferable for a CMS to operate on a paragraph level, the ideal CMS can function on a topic level as well. Once a solution has offered content suggestions at the paragraph level, it will optimally allow the user to view all of the contexts, or topics, in which the information appears. The Human Factor As pointed out earlier, even with the most powerful content management tools offering content suggestions to the user at the paragraph level – without the user being burdened by the need to search or even ask for it – there are human factors that can prevent a successful system implementation. The net effect is that many of these systems never reach their true potential.
The range of human factors is almost as diverse as the humans who use them. Most are thinly veiled excuses that, unfortunately, mask some writers’ insecurities, biases, or inability to adapt to new technology, or even change in general, such as:
Clearly, the goals of the organization, at least in the area of document creation, are sometimes at odds with the goals – or at least the approach - of the people who work there. There are a number of reasons this can occur, as stated above. But in the end, there is usually one overriding factor that serves as the foundation of all the adoption issues: employees don’t view content as a corporate asset that takes time and resources to create. And because few people consider it an asset, few people track it and manage it the way they would manage, say, the company’s financial resources. In the final analysis, the best content management tool will not succeed unless it is easy to use and people are willing to use it. What’s more, there has to be a set of initial guidelines set down at the corporate or department level that people will actually follow. If they don’t adhere to them, the entire exercise is a waste of time. Granted, this will involve some work, and it might seem that the initial effort to integrate the system into the corporate culture will make the overall task of creating documents harder than before. But if done correctly, it will be the classic case of taking one step back to take 10 steps forward.
About the Author
Categories: Individuals, Interesting and Relevant Sites
Editing Modular Documentation: Some Best PracticesBy Michelle Corbin and Yoel Strimling This article presents eight guidelines that we consider to be best practices for editing modular documentation. These guidelines are based on both the key concepts of modular documentation and on our combined experiences of editing modular documentation. We also make three concrete suggestions about how editors can be involved in the process of creating modular documentation. We believe that these guidelines and suggestions can help both writers and editors work together to create clear, consistent, and usable modular documentation. Much has been said about the creation of modular documentation – from content management systems, to information architecture, to delivery forms, to the usability of modular content (content being easier to use, easier to understand, and easier to find, and so on. However, not much has been said about the editing of that content, and what the editor’s role is in such an environment.
Illustration by: Leo Blanchette - Fotolia.com
The Mechanics of Modularization
The main goal of technical documentation is to provide access to technical content in an easy, efficient, and logical way. By separating descriptive from procedural information, organizing this information into discrete, standalone modules called topics, and then linking related topics to each other, writers can help readers quickly and easily find and use the information they need. Writing in a topic-based modular manner also helps writers better organize, construct, and write their documentation because it forces them to think about how to present the information in a clear and succinct way.
Chunk Text into Logical Standalone Topics
Label Topics with Clear and Meaningful Titles
Labeling is the process of creating unique, clear, concise, and accurate headings that correctly identify the content included in a topic. This structured format further separates descriptive from task-oriented information and provides immediate visual clues to orient readers, enabling them to easily search for and access the information they need.
Linking is the process of connecting topics to other related or relevant topics, which enables readers to easily jump back and forth between related subject matter in a document and to find the information they need. While the hierarchy represents the structure and flow of the different standalone topics, the links are the glue that holds all of the different topics together to provide meaningful content. Best Practices for Editing Modular Documentation
Based on these three main concepts of modularization, as well as on our combined editorial experiences, we have identified the following guidelines that we consider to be best practices for editing modular documentation:
Topic Types Must Not Be Mixed
As stated previously, modular documentation must be chunked so that descriptive information (concept and reference topics) is clearly separated from task-oriented information (task topics). Readers who are looking for information about how to do something do not want to wade through too much descriptive information to get to what is relevant to them. Similarly, readers who want to know detailed background information about what a product or service does do not necessarily want to see step-by-step procedures about how to use it.
Topics Must Be Standalone
Because modular documentation is made up of chunked topics that are not read in any particular sequence, each one must be standalone. Readers must be able to understand the topic they are reading without having to read something else, that is, all the information they need is located in this topic.
Introductory Information Must Be Clear and To-The-Point
Chunking documentation into meaningful standalone topics requires the information in the topics to be organized in a logical and usable order. This chunking of information means that the first paragraph of a topic is the most important paragraph because it states the purpose of and summarizes the information presented in that topic. Procedures especially require standard introductory wording. This introductory information helps readers know if they are in the right place for the right information they need. The first sentence of the topic must also be clearly and directly related to the title, so readers can immediately see the connection between it and the topic content.
Topics Can |