Resurgent text-mining technology can greatly increase your firm's 'intelligence' factor

By Maggie Biggs, InfoWorld |  Software 7 comments

CHANCES ARE you're probably already analyzing the information housed in your company's relational databases. Doing so can provide you with vital statistics regarding sales, inventory, customer buying trends, and much more.

But what of the data housed outside your relational databases? We face a serious glut of information today, and unstructured data forms a large part of your corporate information assets.

Documents on your intranet, e-mail that contains internal research, and groupware applications that house customer comments are all examples of unstructured data sources you may already have. Moreover, your operations may also need to tap external, nonrelational data.

Enter text-mining technology -- part of the knowledge management category. Text mining is similar to data mining; both uncover information relationships. However, text mining focuses on nonrelational data -- whether inside or outside your company.

The idea behind text mining is not new; many companies have used it for some time as the technology that drives custom data-analysis solutions. The Gartner Group estimates that companies have already spent nearly $1.5 billion on consulting services to tap into a wider array of corporate data, and the figure is expected to jump to $5 billion by 2001. However, the market for text-mining tools is also expected to expand greatly in the next several months.

Text-mining tools are ideal for information "discovery." The cost and types of text-mining tools vary greatly today because the general-use market is still emerging, but they usually include analysis and search facilities. As one who is always discovering business and technology data, I believe text mining will be a positive investment for those who employ it.

What can you expect to gain from implementing text mining? Obviously, you'll see an increase in the value of your existing data assets. Your staff will be able to quickly draw upon a much wider assortment of information, which should help increase your productivity. Moreover, you can potentially step ahead of your competition by having more complete information to proactively make better-informed decisions.

Because the text-mining tool market is still evolving, those evaluating the technology will need to consider a few key elements before making a purchase. Chief among these should be choosing a tool that does not require a huge first-time categorization, tagging, or integration effort.

The tool you select should be capable of automatically identifying and indexing unstructured data concepts. You should also expect some form of graphical interface that supports a high-level view of the data as well as the capability to drill down to a very detailed level. At this time, the types of interfaces differ by product.

If you want to get a feel for one type of high-level interface and its drill-down capabilities, check out the text-mining demonstration of WEBSOM (a Web Self-Organizing Map), which brings together more than a million documents from 80 Internet newsgroups (see websom.hut.fi/websom/milliondemo/html/root.html).

Text-mining tools are not supposed to be static in nature, given the changing data involved. The tool you select should let users explore data paths that form new relationships. And, the facilities should support ongoing mining activities.

You should alsoo expect that a text-mining solution should integrate with your collaboration tools. After all, much of your unstructured data gems are actually stored inside the heads of many employees. Moreover, you should be able to tap internal and external data -- only some of the tools now available support the latter.

The types of unstructured data formats supported by text-mining tools also vary greatly. You'll want to carefully inspect which formats are supported (such as text, spreadsheets, graphics, presentations, and compressed files). If you're careful, you can locate text-mining tools that are also capable of mining relational data formats.

Some of the text-mining tools available now include Semio's SemioMap (www.semio.com), IBM's Intelligent Mining for Text (www.ibm.com/software/data/iminer/fortext), Autonomy's Agentware (www.autonomy.com), and Megaputer's TextAnalyst (www.megaputer.com).

7 comments

    archieprescott
    archieprescott 15 weeks ago
    I believe that the technology of text-mining can bring about innovation in any firm.
    lanceruch
    lanceruch 1 year ago
    I found this technology something new to my ear. I work in sales before and have some statistical study manually for there are no system like what we have today.Beautiful Naked Woman
    Anonymous 1 year ago
    Nice blogpost, acceptable attractive website, added it to my favorites.[ mobile spy phones ] [ spy mobile phone ] [ mobile spy ] [ mobile phone tracking ] [ cell phone tracking ] [ spy phone ] [ mobile tracker ] [ Cell Phone GPS Tracking ] [ phone tracking ] [ phonetracker ] [ spy phone ] [ spy on a phone ]
    Anonymous 1 year ago
    "Resurgent text-mining technology can greatly increase your firm's 'intelligence' factor" its one of my favorite post. Thanks for informative sharing.
    Anonymous 2 years ago
    There are always good information with regards to data analysis of your sales report. essay writing
    essay writing service
    essay writing service
    Anonymous 3 years ago
    This technology will definately become an cost effective solutions of text mining in enterprises solutions. SEO Services Web Designing SSL Certificate Custom Web Design

      Add a comment

      Post a comment using one of these accounts
      Or join now
      At least 6 characters

      Note: Comment will appear soon after you have activated your account.
      Obscene/spam comments will be removed and accounts suspended.
      The information you submit is subject to our Privacy Policy and Terms of Service.

      ITworld LIVE

      SoftwareWhite Papers & Webcasts

      White Paper

      Best Practices Guide: Microsoft Exchange 2010 on VMware

      This guide provides best practice guidelines for deploying Exchange Server 2010 on vSphere.

      White Paper

      Free Trial: vRanger, the Powerful VMware Recovery Solution

      When disaster strikes, don't waste hours and dollars recovering critical data. vRanger delivers blazing-fast speed and granular recovery for your VMware applications and data. Get your free trial today.

      White Paper

      Executive Guide to Business and Software Requirements

      This paper is designed as an executive briefing on the issues surrounding business and software requirements. It features a wealth of statistics and tactics to help you get requirements right, and includes a tear-out single page summary.

      White Paper

      How to Launch a Successful IT Automation Initiative

      Corporations across all industries are under increasing pressure to cut costs and work more efficiently. In the race to meet both of these requirements, many organizations turn to technology, often purchasing and installing disparate pieces of software in hopes of achieving efficiencies not afforded by manual systems.

      White Paper

      Why Corporations Need to Automate IT Systems Management

      With corporate budgets being slashed and leaders expecting more out of their employees, companies are forced to do more with less, yet are still expected to provide the highest quality experience to customers. This is pushing them to make better use of their IT assets without breaking the budget. Companies are under more pressure than ever, thanks to data management regulations; increasingly complex security threats; and growing demand from management and end users for 24/7 uptime and high performance. These hurdles require a strategic investment in technologies that boost efficiency, save money and position IT as an integral part of the entire firm's operations. IT systems management is helping corporations fill these gaps.

      See more White Papers | Webcasts

      Ask a question

      Ask a Question