In our research we use fresh or frozen (samples are referred to as fresh or frozen) salivary gland tissue samples, healthy or tumor. Peptides are extracted from the tissue samples using phosphate buffer (PBS extracts) or 0.1% trifluoroacetic acid solution in water (TFA extracts). A more detailed description of sample preparation is provided in the Methods section.
2.3. Bioinformatic Analysis
In searching for potential biomarkers of salivary gland cancer, peptides with overlapping sequences are of little interest. The peptides identified in each of the analyses (see
Section 2.2) were compared with each other for overlapping sequences, and then only those not contained in the sequence of other peptides were selected as unique peptides. In addition, stand-alone peptides were also identified that were not included in the sequence of others and did not contain any identified peptides themselves (see
Figure 2). As it is shown in
Figure 2 peptides do not mark * or ** are not consider in further analysis.
Table 2 provides a number of all peptides identified in each analysis, categorized into unique and stand-alone statuses. Less than half of all identified peptides could be classified as unique peptides, and 10-20% as stand-alone peptides. There are no significant differences between the content of unique and stand-alone peptides between tumor samples and controls and between different extraction methods. However, we noted noticeable differences between the peptide composition of samples from salivary glands and saliva. Unique peptides accounted for 46.9% of all peptides identified in salivary gland samples (4869 of 10380) and stand-alone 20% (2077 of 10380). For saliva samples, the values are much lower, at 35% for unique peptides (1490 of 4259) and 10% for stand-alone peptides (420 of 4259). Lower number of unique and stand-alone peptides in saliva samples probably are due higher proteolytic activity in saliva caused by presence of large number of microorganisms in oral cavity.
The next step was to look at peptides identified in saliva and compare them with peptide sequences from salivary gland extracts. We assumed that intense proteolytic enzyme activity occurs in human saliva, so we focused on looking for saliva peptides whose sequences are identical with peptides identified in salivary glands extracts (common peptides) or entirely contained within the sequences of longer peptides of salivary glands origin (overlapping sequences). The results of this analysis are presented in
Table 3.
We analyzed the peptide data sets obtained in the study (e.g., saliva patient) and the sets created by excluding peptides in common with the control (e.g., saliva patient only). The comparison of peptides specific to patients' saliva (Saliva patient only) with peptides specific to salivary gland tumors (SG tumor only) is particularly noteworthy. The peptides identified in this analysis may be potential biomarkers to characterize salivary gland tumors. We found 16 peptides that are common to both data sets compared (Saliva patient only and SG tumor only), but also 109 peptides identified in saliva whose sequences were contained within the sequences of longer peptides identified in salivary gland tumor extracts. Sequences of peptides identified in the group Saliva patient only - SG tumor only (see
Table 3) are shown in
Supplementary Table S4.
2.4. Characterization of Peptide Biomarkers of Salivary Gland Tumors
Our research focuses on peptidomic characterization of salivary gland tumors, but it is challenging to analyze peptides in isolation from the parent protein. Therefore, the next step was to look at the proteins from which the peptides were identified as observed only in patient saliva samples and extracts from tumor tissue (Saliva patient only - SG tumor only in
Table 3). All 109 peptides were assigned to 24 protein groups (see
Table 4). Typical saliva-specific proteins were the most heavily represented, accounting for nearly half of all identified peptides. Essential belongs to basic salivary proline-rich proteins (32 peptides) (marked white in
Table 4) and salivary acidic proline-rich phosphoprotein (17 peptides). In addition, one peptide belonging to the parotid secretory SPLUNC2 (short palate, lung and nasal epithelium clone 2) protein was identified. Among other proteins, relatively many peptides were identified for actin (9 peptides), histones (9 peptides), hemoglobin (5 peptides), and keratin (4 peptides) (marked white in
Table 4). These proteins, and consequently the peptides derived from them, are commonly found in many tissues, so, like saliva-specific proteins, they do not appear to be particularly interesting from a diagnostic point of view.
The situation is different for the remaining proteins: 17 of them, identified based on 30 peptides, have been linked in the literature to various malignancies. The studies that report those findings concern proteomics, not peptidomics, but they remain an exciting benchmark for considering the biological significance of the peptides found in this study.
At least 7 (marked in red in
Table 4) of these 15 proteins have previously been shown to have differential expression in salivary gland tumors: annexin A1 (ANXA1), glyceraldehyde-3-phosphate dehydrogenase (GAPDH), fibrinogen beta chain (FBG), heat shock protein beta-1 (HSPB1), immunoglobulin gamma-1 chain C region (IGHG1), BPI fold-containing family A member 2 (BPIFA2), and vimentin (VIM). Donadio et al. [
13] observed that ANXA1 and GAPDH are up-regulated in pleomorphic adenomas; and FBG, HSPB1, and IGHG1 are up-regulated in Warthin's tumors. Also, Seccia et al. [
12] observed up-regulation of ANXA1 in pleomorphic adenomas and IGHG1 in Warthin's tumors, while Mutlu et al. [
15] observed up-regulation of VIM and down-regulation of FGB in pleomorphic adenomas patients. Pereira et al. [
24] suggested BPIFA2 as a potential biomarker for salivary mucoepidermoid carcinomas. González-Arriagada et al. [
25] showed that BPIFA2 level in head and neck cancer decreases after radiotherapy.
To our knowledge, the remaining proteins (marked yellow in
Table 4) have not been associated with salivary gland tumors but are associated with other types of cancer. Overexpression of alpha-1-antitrypsin (SERPINA1) promotes tumor progression in colorectal [
26] and gastric [
27] cancer, and its high level in plasma was observed in lung and prostate cancer patients [
28]. Apolipoprotein A-II (APOA2) was proposed as a potential marker for urinary bladder [
29] and pancreatic cancer [
30]. Cystatin B was proposed as a biomarker in various tumors like bladder and ovarian cancer [
31,
32]. Glutathione S-transferase P (GSTP1) is downregulated in prostate cancer and up-regulated in many cancer types, e.g., colorectal, thyroid, or breast cancer [
33]. Also, a high level of glyceraldehyde-3-phosphate dehydrogenase (GAPDH) was found in various cancer types [
34]. AHNAK was proposed as a biomarker for bladder urothelial carcinoma [
35] and is strictly related to cell migration in mesothelioma [
36]. Calprotectin, a heterodimer formed from the combination of S100A8 and S100A9 proteins, is commonly up-regulated in many tumors and likely plays a critical (essential) role in inflammation-associated cancers [
37]. Protein-glutamine gamma-glutamyltransferase E (TGM3) was proposed as a marker for some head and neck cancer types [
38,
39]. Thymosin beta-4 (TMSB4X) is overexpressed in pancreatic cancer [
40] and was proposed as a biomarker for colorectal cancer [
41]. Triosephosphate isomerase (TPI1) is up-regulated in neoplasms like gastric or breast cancer and is related to cell migration and invasion [
42,
43].
Analyzing the peptidome of tumor tissue extracts, a complete absence of peptides derived from saliva-specific proteins was observed. As it is shown in
Figure 3 and
Supplementary Figure S2 extracts derived from tumor tissue do not contain peptides derived from statherin (STATH), submaxillary gland androgen-regulated protein 3b (SMR3B), histatin 1 (HTN1) and histatin 3 (HTN3) proteins. Peptides derived from these four proteins are abundantly represented in healthy tissue extracts and saliva samples (both patient and control). To ensure quality of identification we compare our results with reference data form Peptide Atlas database [
44]. As it is shown
Supplementary Table S5, for all mentioned proteins we detect larger number of peptides than is deposited in Peptide Atlas database [
44]. In the case of histatin 1 and histatin 3 we can identify in one extract sample from tumor tissue only one peptide belonging to those proteins, we suspect that such identification in caused by not perfect separation of healthy tissue form tumor (see panel B and C in
Supplementary Figure S2). It is very interesting that all four proteins mentioned above are specifically produced in salivary glands [
45] and their genes are located on chromosome 4 in very close proximity one to another (
www.ncbi.nlm.nih.gov/gene/3347;www.ncbi.nlm.nih.gov/gene/3346;www.ncbi.nlm.nih.gov/gene/6779; www.ncbi.nlm.nih.gov/gene/10879). The lack of identification of peptides from proteins STATH, SMR3B, HTN1, HTN3 in extracts from tumor tissues may suggest that there is a complete suppression of the production of these proteins in tumor tissues. It is of great interest that salivary gland tumors are extremely diverse [
3] and in our study samples from different types of tumors were used (see
Table S1). However, regardless of tumor type, the effect is the same lack of occurrence of peptides derived from proteins STATH, SMR3B, HTN1, HTN3. The fact that peptides from these proteins are present in the saliva of patients with detected cancer is related to the fact that not all salivary glands are occupied by cancer, so that the remaining healthy glands are able to produce the proteins in question.