TY - GEN
T1 - Finding Difficult-to-Disambiguate Words
T2 - 3rd IEEE International Conference on Healthcare Informatics, ICHI 2015
AU - Torii, Manabu
AU - Fan, Jung Wei
AU - Zisook, Daniel S.
N1 - Publisher Copyright:
© 2015 IEEE.
PY - 2015/12/8
Y1 - 2015/12/8
N2 - In the biomedical and clinical domain, valuable information is frequently represented in free-text documents. Natural language processing (NLP) is a powerful tool that can extract structured information from theses documents. Word sense disambiguation (WSD) is a critical component in an NLP pipeline that increases the accuracy of the extracted information. However, WSD is expensive to apply for all known ambiguous words. Given limited time and resources, one practical strategy is to prioritize easy-to-disambiguate words and efficiently maximize the coverage of disambiguation. To aid prioritization efforts, we studied two quantitative indicators that are associated with how easy/difficult it is to disambiguate any given word.
AB - In the biomedical and clinical domain, valuable information is frequently represented in free-text documents. Natural language processing (NLP) is a powerful tool that can extract structured information from theses documents. Word sense disambiguation (WSD) is a critical component in an NLP pipeline that increases the accuracy of the extracted information. However, WSD is expensive to apply for all known ambiguous words. Given limited time and resources, one practical strategy is to prioritize easy-to-disambiguate words and efficiently maximize the coverage of disambiguation. To aid prioritization efforts, we studied two quantitative indicators that are associated with how easy/difficult it is to disambiguate any given word.
KW - Medical Informatics
KW - Natural Language Processing
KW - Word Sense Disambiguation
UR - http://www.scopus.com/inward/record.url?scp=84966277133&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84966277133&partnerID=8YFLogxK
U2 - 10.1109/ICHI.2015.66
DO - 10.1109/ICHI.2015.66
M3 - Conference contribution
AN - SCOPUS:84966277133
T3 - Proceedings - 2015 IEEE International Conference on Healthcare Informatics, ICHI 2015
SP - 448
BT - Proceedings - 2015 IEEE International Conference on Healthcare Informatics, ICHI 2015
A2 - Fu, Wai-Tat
A2 - Balakrishnan, Prabhakaran
A2 - Harabagiu, Sanda
A2 - Wang, Fei
A2 - Srivatsava, Jaideep
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 21 October 2015 through 23 October 2015
ER -