Terminology model discovery using natural language processing and visualization techniques

Li Zhou, Ying Tao, James J. Cimino, Elizabeth S. Chen, Hongfang Liu, Yves A. Lussier, George Hripcsak, Carol Friedman

Research output: Contribution to journalArticlepeer-review

9 Scopus citations


Medical terminologies are important for unambiguous encoding and exchange of clinical information. The traditional manual method of developing terminology models is time-consuming and limited in the number of phrases that a human developer can examine. In this paper, we present an automated method for developing medical terminology models based on natural language processing (NLP) and information visualization techniques. Surgical pathology reports were selected as the testing corpus for developing a pathology procedure terminology model. The use of a general NLP processor for the medical domain, MedLEE, provides an automated method for acquiring semantic structures from a free text corpus and sheds light on a new high-throughput method of medical terminology model development. The use of an information visualization technique supports the summarization and visualization of the large quantity of semantic structures generated from medical documents. We believe that a general method based on NLP and information visualization will facilitate the modeling of medical terminologies.

Original languageEnglish (US)
Pages (from-to)626-636
Number of pages11
JournalJournal of Biomedical Informatics
Issue number6
StatePublished - Dec 2006


  • Information visualization
  • Natural language processing
  • Terminology model

ASJC Scopus subject areas

  • Health Informatics
  • Computer Science Applications


Dive into the research topics of 'Terminology model discovery using natural language processing and visualization techniques'. Together they form a unique fingerprint.

Cite this