IBM, Mayo form open-source health IT consortium

By Grant Gross, IDG News Service |  Open Source, health care, IBM 2 comments

Biomedical informatics researchers at IBM and the Mayo Clinic have launched a new open-source consortium focused on natural language processing (NLP), in an effort to help doctors share diagnosis and treatment information.

The Open Health Natural Language Processing Consortium, announced Thursday, will focus on technology to allow for large-scale data aggregation, allowing doctors to mine medical records in their specialties to find similar cases to study before making difficult diagnoses or before determining treatment.

Doctors will be able to review any physician notes on similar cases, but no personally identifiable patient information will be available in the database, IBM and Mayo said.

With the launch of the consortium, the two organizations have released two projects under open-source licenses, one focused on clinical notes and one on pathology reports. The consortium is using the Apache license, version 2.0.

The organizations are inviting others to help develop NLP tools. "By making it an open-source initiative, we hope to enable wide use of these NLP tools so medical advancements can happen faster and more efficiently," Dr. Christopher Chute, a Mayo Clinic bioinformatics expert and senior consultant on the project, said in a statement.

Two other health care organizations, Seattle Group Health and the U.S. Department of Veterans Affairs Boston Healthcare System, plan to participate in the consortium, and other participants are welcome, IBM and Mayo said.

As more health care providers adopt electronic health records, it will become increasingly important to be able to search those records, the organizations said. Mayo and IBM have developed a system for extracting information from more than 25 million text-based clinical notes based on IBM's open-source Unstructured Information Management Architecture, or UIMA, they said.

The two organizations have also developed a system to extract cancer diseases characteristics from pathology reports, allowing for the computation of cancer stage.

"Large-scale information extraction from the clinical narrative is a vital component in advancing translational research and patient care," Guergana Savova, a medical informatics specialist and Mayo's lead on the project, said in a statement. "It 'unlocks' the clinical textual data that resides in huge repositories. Such technology would allow for large-scale data aggregation, analyses and usage -- just imagine the power of data from millions of patients."

The organizations have not yet determined what NLP projects to work on next, an IBM spokeswoman said. "The goal is to first get feedback from participating institutions on the initial project, and then expand," she said.

2 comments

    trinitycarpet
    trinitycarpet 2 years ago
    Such steps for medical industry can prove much effective, specially such Giant Names working on it. Carpet Tampa
    Olivaruskin
    Olivaruskin 2 years ago
    Appreciative steps taken for HealthCare and Medical Industry, specially focusing on natural language processing, it will be proven great for Doctors and Healthcare organization. Sarasota Home Healthcare

      Add a comment

      Post a comment using one of these accounts
      Or join now
      At least 6 characters

      Note: Comment will appear soon after you have activated your account.
      Obscene/spam comments will be removed and accounts suspended.
      The information you submit is subject to our Privacy Policy and Terms of Service.

      ITworld LIVE

      Open SourceWhite Papers & Webcasts

      White Paper

      Consolidating SAP Applications to Linux on Power by IDC

      IDC studied a group of enterprises that had deployed SAP applications on IBM Power Systems servers running Linux server operating environments and had been working with those systems for several years. Learn about the results...

      White Paper

      An Interactive eGuide: Open Source

      By now, enterprises are well aware of the benefits of open-source software, which boasts a clean design, reliability, and maintainability, as well as support for standards and community values. But perhaps the biggest benefit is quality; since open-source software users have access to source code, bug fixes and enhancements come from multiple sources, often resulting in superior software.

      See more White Papers | Webcasts

      Ask a question

      Ask a Question