Detecting Serendipitous Drug Usage in Social Media with Deep Neural Network Models

Boshu Ru, Dingcheng Li, Lixia Yao

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Serendipitous drug usage refers to unexpected relief of comorbid diseases or symptoms when patients take a drug for another common or known indication. In the history of drug discovery, serendipity has contributed significantly to new and successful indications for many drugs. Our previous research has identified patient reported serendipitous drug usage in social media. If such information could be computationally identified in social media, it could be helpful for generating and validating drug-repositioning hypotheses. In this study, we framed detection of serendipitous drug usage in social media as a binary classification problem and investigated deep neural network models as a solution. We constructed word-embedding features from drug-review posts in the patient forum of WebMD, using the word2vec algorithm. We adopted the convolutional neural network (CNN), long short-term memory network (LSTM), and convolutional long short-term memory network (CLSTM) and redesigned them by adding contextual information that we extracted from drug-review posts, information filtering tools, medical ontology, and medical knowledge. We trained, tuned, and evaluated our deep neural network models on a gold standard dataset containing 15,714 sentences, of which 447 contained serendipitous drug usages. Additionally, we compared our deep neural networks to support vector machine, random forest, and AdaBoost.M1 algorithms. The results showed that adding context information helped to reduce the false-positive rate of deep neural network models. In the presence of an extremely imbalanced dataset and limited instances of serendipitous drug usage, deep neural network models did not outperform other machine learning models with n-gram and context features. However, deep neural network models could more effectively utilize word embedding in feature construction. This advantage made deep neural networks worthy of further investigation and improvement.

Original languageEnglish (US)
Title of host publicationProceedings - 2018 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2018
EditorsHarald Schmidt, David Griol, Haiying Wang, Jan Baumbach, Huiru Zheng, Zoraida Callejas, Xiaohua Hu, Julie Dickerson, Le Zhang
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages1083-1090
Number of pages8
ISBN (Electronic)9781538654880
DOIs
StatePublished - Jan 21 2019
Event2018 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2018 - Madrid, Spain
Duration: Dec 3 2018Dec 6 2018

Publication series

NameProceedings - 2018 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2018

Conference

Conference2018 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2018
Country/TerritorySpain
CityMadrid
Period12/3/1812/6/18

Keywords

  • Data mining
  • drug discovery
  • drug repurposing
  • health informatics
  • social media

ASJC Scopus subject areas

  • Biomedical Engineering
  • Health Informatics

Fingerprint

Dive into the research topics of 'Detecting Serendipitous Drug Usage in Social Media with Deep Neural Network Models'. Together they form a unique fingerprint.

Cite this