Proteogenomic Analysis to Identify Missing Proteins from Haploid Cell Lines

Seung Eun Lee, Jong Keon Song, Korbinian Bösl, André C. Müller, Dijana Vitko, Keiryn L. Bennett, Giulio Superti-Furga, Akhilesh Pandey, Richard K. Kandasamy, Min Sik Kim

Research output: Contribution to journalArticlepeer-review

8 Scopus citations


Chromosome-centric Human Proteome Project aims at identifying and characterizing protein products encoded from all human protein-coding genes. As of early 2017, 19 837 protein-coding genes have been annotated in the neXtProt database including 2691 missing proteins that have never been identified by mass spectrometry. Missing proteins may be low abundant in many cell types or expressed only in a few cell types in human body such as sperms in testis. In this study, we performed expression proteomics of two near-haploid cell types such as HAP1 and KBM-7 to hunt for missing proteins. Proteomes from the two haploid cell lines were analyzed on an LTQ Orbitrap Velos, producing a total of 200 raw mass spectrometry files. After applying 1% false discovery rates at both levels of peptide-spectrum matches and proteins, more than 10 000 proteins were identified from HAP1 and KBM-7, resulting in the identification of nine missing proteins. Next, unmatched spectra were searched against protein databases translated in three frames from noncoding RNAs derived from RNA-Seq data, resulting in six novel protein-coding regions after careful manual inspection. This study demonstrates that expression proteomics coupled to proteogenomic analysis can be employed to identify many annotated and unannotated missing proteins.

Original languageEnglish (US)
Article number1700386
Issue number8
StatePublished - Apr 2018


  • Haploid cell lines
  • Proteogenomics
  • RNA-Seq
  • lncRNA
  • missing protein

ASJC Scopus subject areas

  • Biochemistry
  • Molecular Biology


Dive into the research topics of 'Proteogenomic Analysis to Identify Missing Proteins from Haploid Cell Lines'. Together they form a unique fingerprint.

Cite this