LFXtractor: Text chunking for long form detection from biomedical text

Min Song, Hongfang Liu

Research output: Contribution to journalArticlepeer-review

Abstract

In this paper, we propose a novel method to detect the corresponding long forms (LFs) of short forms (SFs) from biomedical text. The proposed method is differentiated from others as follows: • it incorporates lexical analysis techniques into supervised learning for extracting abbreviations • it utilises text-chunking techniques to identify LFs of abbreviations • it significantly improves recall. The experimental results show that our approach outperforms the leading abbreviation algorithms, ExtractAbbrev, ALICE and Acrophile and a collocation-based approach at least by 4.8, 6.0, 9.0 and 6.0%, respectively, in both precision and recall on the Gold Standard Development corpus.

Original languageEnglish (US)
Pages (from-to)89-102
Number of pages14
JournalInternational Journal of Functional Informatics and Personalised Medicine
Volume3
Issue number2
DOIs
StatePublished - 2010

Keywords

  • Abbreviation extraction
  • Text chunking
  • Text mining

ASJC Scopus subject areas

  • Clinical Neurology

Fingerprint

Dive into the research topics of 'LFXtractor: Text chunking for long form detection from biomedical text'. Together they form a unique fingerprint.

Cite this