TY - JOUR
T1 - Correcting common errors in identifying cancer-specific serum peptide signatures
AU - Villanueva, Josep
AU - Philip, John
AU - Chaparro, Carlos A.
AU - Li, Yongbiao
AU - Toledo-Crow, Ricardo
AU - DeNoyer, Lin
AU - Fleisher, Martin
AU - Robbins, Richard J.
AU - Tempst, Paul
N1 - Copyright:
Copyright 2008 Elsevier B.V., All rights reserved.
PY - 2005/7
Y1 - 2005/7
N2 - 'Molecular signatures' are the qualitative and quantitative patterns of groups of biomolecules (e.g., mRNA, proteins, peptides, or metabolites) in a cell, tissue, biological fluid, or an entire organism. To apply this concept to biomarker discovery, the measurements should ideally be noninvasive and performed in a single read-out. We have therefore developed a peptidomics platform that couples magnetics-based, automated solid-phase extraction of small peptides with a high-resolution MALDI-TOF mass spectrometric readout (Villanueva, J.; Philip, J.; Entenberg, D.; Chaparro, C. A.; Tanwar, M. K.; Holland, E. C.; Tempst, P. Anal. Chem. 2004, 76, 1560-1570). Since hundreds of peptides can be detected in microliter volumes of serum, it allows to search for disease signatures, for instance in the presence of cancer. We have now evaluated, optimized, and standardized a number of clinical and analytical chemistry variables that are major sources of bias; ranging from blood collection and clotting, to serum storage and handling, automated peptide extraction, crystallization, spectral acquisition, and signal processing. In addition, proper alignment of spectra and user-friendly visualization tools are essential for meaningful, certifiable data mining. We introduce a minimal entropy algorithm, 'Entropycal', that simplifies alignment and subsequent statistical analysis and increases the percentage of the highly distinguishing spectral information being retained after feature selection of the datasets. Using the improved analytical platform and tools, and a commercial statistics program, we found that sera from thyroid cancer patients can be distinguished from healthy controls based on an array of 98 discriminant peptides. With adequate technological and computational methods in place, and using rigorously standardized conditions, potential sources of patient related bias (e.g., gender, age, genetics, environmental, dietary, and other factors) may now be addressed.
AB - 'Molecular signatures' are the qualitative and quantitative patterns of groups of biomolecules (e.g., mRNA, proteins, peptides, or metabolites) in a cell, tissue, biological fluid, or an entire organism. To apply this concept to biomarker discovery, the measurements should ideally be noninvasive and performed in a single read-out. We have therefore developed a peptidomics platform that couples magnetics-based, automated solid-phase extraction of small peptides with a high-resolution MALDI-TOF mass spectrometric readout (Villanueva, J.; Philip, J.; Entenberg, D.; Chaparro, C. A.; Tanwar, M. K.; Holland, E. C.; Tempst, P. Anal. Chem. 2004, 76, 1560-1570). Since hundreds of peptides can be detected in microliter volumes of serum, it allows to search for disease signatures, for instance in the presence of cancer. We have now evaluated, optimized, and standardized a number of clinical and analytical chemistry variables that are major sources of bias; ranging from blood collection and clotting, to serum storage and handling, automated peptide extraction, crystallization, spectral acquisition, and signal processing. In addition, proper alignment of spectra and user-friendly visualization tools are essential for meaningful, certifiable data mining. We introduce a minimal entropy algorithm, 'Entropycal', that simplifies alignment and subsequent statistical analysis and increases the percentage of the highly distinguishing spectral information being retained after feature selection of the datasets. Using the improved analytical platform and tools, and a commercial statistics program, we found that sera from thyroid cancer patients can be distinguished from healthy controls based on an array of 98 discriminant peptides. With adequate technological and computational methods in place, and using rigorously standardized conditions, potential sources of patient related bias (e.g., gender, age, genetics, environmental, dietary, and other factors) may now be addressed.
KW - Bias
KW - Biomarkers
KW - Mass spectrometry
KW - Patterns
KW - Peak alignment
KW - Peptides
KW - Sample processing
KW - Serum
KW - Specimen collection
UR - http://www.scopus.com/inward/record.url?scp=23944456398&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=23944456398&partnerID=8YFLogxK
U2 - 10.1021/pr050034b
DO - 10.1021/pr050034b
M3 - Article
C2 - 16083255
AN - SCOPUS:23944456398
SN - 1535-3893
VL - 4
SP - 1060
EP - 1072
JO - Journal of Proteome Research
JF - Journal of Proteome Research
IS - 4
ER -