• Skip to main content
  • Skip to secondary menu
  • Skip to primary sidebar
  • Skip to secondary sidebar
  • Skip to footer

  • Opinion
  • Health IT
    • Behavioral Health
    • Care Coordination
    • EMR/EHR
    • Interoperability
    • Patient Engagement
    • Population Health Management
    • Revenue Cycle Management
    • Social Determinants of Health
  • Digital Health
    • AI
    • Blockchain
    • Precision Medicine
    • Telehealth
    • Wearables
  • Startups
  • M&A
  • Value-based Care
    • Accountable Care (ACOs)
    • Medicare Advantage
  • Life Sciences
  • Research

Leveraging the AI Opportunity for Healthcare Unstructured Data

by Krishna Subramanian, Co-Founder at Komprise 03/20/2024 Leave a Comment

  • LinkedIn
  • Twitter
  • Facebook
  • Email
  • Print
Krishna Subramanian, Co-Founder at Komprise

Unstructured data – such as medical images, scans and lab reports – opens the door to a wide range of exciting new possibilities in the world of healthcare. By deriving insights from unstructured data, healthcare organizations can deliver better patient care, as well as unlock powerful capabilities – like using AI models to streamline medical note-taking, reshape hospital billing and improve medical coding. 

To do these things, however, you need to be able to track and access all your data. This can be challenging, given that unstructured data comes in many forms and is often scattered across disparate locations. But with the right strategy and tools in place, leveraging unstructured data to power healthcare innovations is possible. 

What is unstructured data, and why is it important in healthcare? 

Unstructured data consists of any data or files that may be organized using folders and directories and is not housed in a database where it can be systematically tracked, secured and protected. Common examples of unstructured data in healthcare include: 

  • X-rays, MRIs and CT scan images. 
  • Text documents containing medical notes or patient evaluations. 
  • Lab reports. 
  • Audio recordings taken during evaluations of patients. 

Healthcare providers routinely need to access data assets like these to deliver quality care to their patients. In addition, researchers may analyze unstructured healthcare data to help answer questions like how effective a given procedure is in mitigating the impact of an illness, or how many patients respond effectively to a new medication. 

In addition, the ongoing AI revolution is opening varied opportunities for leveraging unstructured healthcare data to train AI tools and services. Most AI tools and services work by scanning large volumes of data. The more data you feed them and the more representative that data is of the real-world conditions you want your AI solutions to understand, the more effective they will be. To take full advantage of AI, being able to access and train on all relevant unstructured data is paramount. 

The challenges of managing unstructured healthcare data 

Unfortunately, ensuring that providers and researchers can access data quickly is often deeply challenging, for several reasons. 

One is the sheer size of some types of healthcare data. Text files are typically small, but files such as X-ray and CT images can consume as much as 30 megabytes each. If a facility takes just a few dozen images each day, they quickly add up to fill many gigabytes’ worth of space each month. Specialized data, such as genomic sequencing results, are larger still. Sequencing just one person’s genes requires as much as 200 gigabytes of storage. 

The large size of unstructured healthcare data presents a challenge because the larger your data, the more it costs to store it, especially if you keep it in the same location where it was generated instead of taking advantage of lower-cost storage options where feasible. Healthcare organizations may also be reluctant to back up the data given the high costs of storing backups. And they may be tempted to delete data earlier than they would like in a bid to save on storage costs. 

Healthcare regulations present another challenge. Different types of data are subject to different rules and regulations, such as retention mandates that require providers to store data for a certain period. When you have a large volume of unstructured files to work with, keeping track of which regulations apply to which ones can become quite difficult. 

The fact that healthcare data is often generated by complex, siloed systems makes efficient data management even harder. Within a single organization, there may be dozens of different software systems and platforms collecting healthcare data. This distributed environment makes it hard to track, secure and protect data in a centralized way. 

The cost of poor unstructured data management 

Failing to address challenges like this doesn’t only mean that healthcare providers and researchers will struggle to work efficiently. It also has serious consequences from a business perspective. 

For one, unstructured data that organizations fail to track effectively because it is scattered across too many locations can bloat storage budgets. Instead of consolidating the data into centralized storage repositories – such as “cold” data storage tiers in the cloud – where monthly storage fees are just fractions of a penny per gigabyte, enterprises may be left paying many times that amount due to their inability to easily move data to a more cost-effective storage solution. 

Inadequate data management can also lead to compliance violations. Sensitive data may end up residing in locations without the access controls mandated by frameworks like HIPAA, for example, because the business simply didn’t know it was storing sensitive information in the wrong place. Given that a single HIPAA violation can cost as much as $68,000, the financial impact of noncompliance gets onerous quickly. 

On top of this, inefficient management of unstructured data slows down innovation and time to value. Being able to access and process data quickly is important for initiatives like AI-powered analytics. Healthcare organizations that can’t centrally track or manage their data will struggle to move faster than competitors. 

Putting unstructured healthcare data to better use 

The key to avoiding these risks is to implement a comprehensive strategy for managing unstructured healthcare data. Your strategy should allow you to: 

  • Locate all data assets, no matter which system produced them or where they reside. 
  • Tag and label unstructured data so that you know where it originated, its purpose and which compliance or other special requirements relate. Tagging can also accelerate the process of locating data that is relevant for AI training for a particular use case. 
  • Implement effective access controls, backup routines and other data security and protection measures based on each asset’s requirements. 
  • Identify data that could be moved to a different storage location or tier to reduce costs. 

When you can do these things, you turn unstructured healthcare data from a cost and compliance liability into an asset for long-term value and AI. In turn, you improve outcomes for patients, medical research and business stakeholders and inform the creation of new value-added products and services that patients need. 


About Krishna Subramanian 

Krishna Subramanian is the COO, president, and co-founder of Komprise. In her career, Subramanian has built three successful venture-backed IT businesses and was named a “2021 Top 100 Women of Influence” by Silicon Valley Business Journal.

  • LinkedIn
  • Twitter
  • Facebook
  • Email
  • Print

Tagged With: Artificial Intelligence

Tap Native

Get in-depth healthcare technology analysis and commentary delivered straight to your email weekly

Reader Interactions

Primary Sidebar

Subscribe to HIT Consultant

Latest insightful articles delivered straight to your inbox weekly.

Submit a Tip or Pitch

Featured Insights

2025 EMR Software Pricing Guide

2025 EMR Software Pricing Guide

Featured Interview

Paradigm Shift in Diabetes Care with Studio Clinics: Q&A with Reach7 Founder Chun Yong

Most-Read

Medtronic to Separate Diabetes Business into New Standalone Company

Medtronic to Separate Diabetes Business into New Standalone Company

White House, IBM Partner to Fight COVID-19 Using Supercomputers

HHS Sets Pricing Targets for Trump’s EO on Most-Favored-Nation Drug Pricing

23andMe to Mine Genetic Data for Drug Discovery

Regeneron to Acquire Key 23andMe Assets for $256M, Pledges Continuity of Consumer Genome Services

CureIS Healthcare Sues Epic: Alleges Anti-Competitive Practices & Trade Secret Theft

The Evolving Role of Physician Advisors: Bridging the Gap Between Clinicians and Administrators

The Evolving Physician Advisor: From UM to Value-Based Care & AI

UnitedHealth Group Names Stephen Hemsley CEO as Andrew Witty Steps Down

UnitedHealth CEO Andrew Witty Steps Down, Stephen Hemsley Returns as CEO

Omada Health Files for IPO

Omada Health Files for IPO

Blue Cross Blue Shield of Massachusetts Launches "CloseKnit" Virtual-First Primary Care Option

Blue Cross Blue Shield of Massachusetts Launches “CloseKnit” Virtual-First Primary Care Option

Osteoboost Launches First FDA-Cleared Prescription Wearable Nationwide to Combat Low Bone Density

Osteoboost Launches First FDA-Cleared Prescription Wearable Nationwide to Combat Low Bone Density

2019 MedTech Breakthrough Award Category Winners Announced

MedTech Breakthrough Announces 2025 MedTech Breakthrough Award Winners

Secondary Sidebar

Footer

Company

  • About Us
  • Advertise with Us
  • Reprints and Permissions
  • Submit An Op-Ed
  • Contact
  • Subscribe

Editorial Coverage

  • Opinion
  • Health IT
    • Care Coordination
    • EMR/EHR
    • Interoperability
    • Population Health Management
    • Revenue Cycle Management
  • Digital Health
    • Artificial Intelligence
    • Blockchain Tech
    • Precision Medicine
    • Telehealth
    • Wearables
  • Startups
  • Value-Based Care
    • Accountable Care
    • Medicare Advantage

Connect

Subscribe to HIT Consultant Media

Latest insightful articles delivered straight to your inbox weekly

Copyright © 2025. HIT Consultant Media. All Rights Reserved. Privacy Policy |