|
Advertisement | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Molecular & Cellular Proteomics 8:2595-2612, 2009.
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| ABSTRACT |
|---|
|
|
|---|
In solid tumors the angiogenic switch is thought to occur when the balance between proangiogenic and antiangiogenic molecules is shifted in favor of angiogenesis, permitting rapid tumor growth and subsequent development of invasive and metastatic properties (7). Thus, aggressive tumor growth depends on a successful adaptation of the tumor cells to the host microenvironment. In brain tumors no biomarkers are currently available that define different cell populations within human GBMs (for instance tumor cells that show infiltrative growth and those that trigger angiogenesis) or that predict the propensity of low grade (non-angiogenic) gliomas to develop into malignant angiogenic gliomas. We have recently generated a xenograft model for human GBM that displays a highly invasive phenotype and stem cell characteristics (8). By serial transplantation in nude rats new cell clones eventually develop that generate a more rapidly growing aggressive, angiogenesis-dependent phenotype. The transition to an angiogenic phenotype is accompanied by a reduced infiltrative growth (8). Thus, we are able to initiate two distinct phenotypes from human GBMs that classify their growth and progression. Our model is extremely useful for identifying mechanisms causing the switch from angiogenesis-independent to angiogenesis-dependent tumor growth.
This work was aimed at identifying cell membrane markers and molecular pathways that characterize the two phenotypes and may underlie the angiogenic switch. Such markers may represent potential therapeutic targets toward specific cellular subsets within GBMs. Here we applied iTRAQ peptide labeling on membrane-enriched tumor fractions followed by MALDI-TOF/TOF protein identification and bioinformatics analysis to quantify large scale species-specific protein expression over four consecutive generations of the glioma xenograft model.
In a search for disease biomarkers, there has been a rapid development of quantitative protein expression technologies including isobaric peptide tagging (iTRAQ) combined with multidimensional LC and MS/MS analysis (9). This approach allows for sample multiplexing (currently 4- or 8-plex at the time). iTRAQ is particularly powerful when applied on a subfraction of the proteome, thereby increasing the possibility of identifying less abundant proteins (10). Because more than a third of all known biomarkers as well as more than two-thirds of known and potential antitumor protein targets are membrane-related proteins (11–14), we focused on membrane-enriched fractions of the tumor xenografts. In four different iTRAQ experiments we were able to identify over 7000 (redundant) proteins of which 1460 proteins were extracted based on quantifiable and species-specific expression. Correspondence analysis and unsupervised cluster analysis confirmed consistent protein expression profiles in the different xenograft phenotypes generated from different patient samples. The expression of a selection of identified candidates was confirmed by immunohistochemical methods on tissue microarrays (TMAs) from a large number of xenograft tumors and patient gliomas. The differentially expressed proteins identified in the two phenotypes represent unique candidate biomarkers that may represent novel therapeutic targets in GBMs. The information generated also provides novel insight into the molecular networks governing the infiltrative and the angiogenic tumor properties and reveals new mechanisms involved in the angiogenic switch in GBMs.
| EXPERIMENTAL PROCEDURES |
|---|
|
|
|---|
|
|
Sample Labeling
Sample labeling with isobaric tagging reagents was carried out according to the manufacturer's instructions with modifications (iTRAQ® Reagents Multiplex kit; Applied Biosystems/MDS Sciex, Foster City, CA). Briefly 100 µg of membrane suspensions was vacuum-dried, resuspended in 25 µl of Rapigest detergent solution (Waters) dissolved in 125 µl of Dissolution buffer from the iTRAQ kit per vial of detergent, and vortexed. 2 µl of reducing agent (Applied Biosystems/MDS Sciex) was added followed by incubation with vortexing for 2 h at room temperature. After centrifugation, cysteine-blocking agent was added and incubated for 10 min at room temperature followed by addition of 2 µg of trypsin (modified sequencing grade; Promega, Madison, WI) and incubation for 12 h at 37 °C. Digested samples were labeled with four different iTRAQ reagents dissolved in 80 µl of ethanol and vortexed at room temperature for 4 h. After the labeling, the four samples were pooled, 400 µl of 1% (v/v) TFA was added to cleave the Rapigest detergent, and samples were vortexed for 1 h at room temperature and centrifuged. The supernatant was removed and vacuum-dried. Four separate quadruplex iTRAQ experiments were performed. 1) In iTRAQ1 light membranes from early and late generations of two different GBM patients (P6 and P17) were analyzed with P6 first and last labeled with iTRAQ 114 and 115, respectively, and P17 first and last labeled with iTRAQ 116 and 117, respectively. 2) In iTRAQ2 plasma membranes of the same samples were analyzed. 3) and 4) In iTRAQ3 and iTRAQ4 plasma membranes from four consecutive generations of xenografts originating from the two GBM patients (P6 and P17, respectively) were analyzed. Samples were labeled as follows: iTRAQ 114, first (early); iTRAQ 115, second; iTRAQ 116, third; and iTRAQ 117, fourth (late) generation. The experimental flowchart is summarized in Fig. 2.
Two-dimensional Liquid Chromatography
The iTRAQ labeled peptide pool was resuspended in 170 µl of buffer A (10 mM KH2PO4, 20% (v/v) acetonitrile, pH 2.9) and separated on a strong cation exchange (SCX) column (PolySULFOETHYL Aspartamide, 100 x 2.1 mm, 3 µm, 300 Å (Poly LC, Columbia, MD)) under the following conditions: flow, 200 µl, 20-min gradient, 0–60% B (A + 500 mmol KCl); 5 min, 60–100% B; 4 min, 100% B. Minute fractions were collected in a fraction collector and vacuum-dried. 20 peak SCX fractions were dissolved in 30–60 µl of 0.1% (v/v) TFA and further separated on a microcapillary reverse phase column (ReproSil-Pur C18 Aqua 3-µm material, 100-µm-inner diameter, 15-cm column) under the following conditions: flow, 400 µl; buffer A, 5% (v/v) acetonitrile, 0.05% (v/v) TFA; buffer B, 80% (v/v) acetonitrile, 0.04% (v/v) TFA; 6 min, 5% B; 45 min, 5–45% B; 1 min, 45–90% B; 5 min, 90% B; 1 min, 90–5% B. The reverse phase fractions were premixed with the matrix (6 mg/ml
-cyanohydroxycinnaminic acid in 33.3% (v/v) acetonitrile, 1 mM ammonium citrate) and spotted on line onto a MALDI target plate using a Probot Microfraction Collector (Dionex, Sunnyvale, CA). 192 spots were spotted per fraction.
MALDI-TOF-TOF Mass Spectrometry
For all measurements, a 4800 MALDI-TOF/TOF Analyzer (Applied Biosystems/MDS Sciex) operated under 4000 Series Explorer v.3.5.1 software (Applied Biosystems/MDS Sciex) was used. The instrument was equipped with a 20-µJ neodymium-doped yttrium aluminium garnet (Nd-YAG) laser operating at 355 nm and 200 Hz and was used in reflector-positive ion mode. For calibration and tuning purposes, 4700 Cal Mix (Applied Biosystems/MDS Sciex) containing six calibrant peptides in the m/z range of 904–3658 were spotted on 13 dedicated calibration spot locations distributed over each MALDI target plate. Before each job run, the instrument was retuned (including deflector, mirror, time ion selector, and laser settings), and MS and MS/MS default calibration was updated. Automatic jobs were created for two 384-spot MALDI target plates at a time. In MS analyses, 1250 shots (25 subspectra at randomized locations over the whole spot, 50 shots/spectrum) were acquired and averaged over an m/z window of 800–4000. For each spot, up to 30 of the strongest precursor peaks with a minimal signal-to-noise ratio of 35 were automatically selected for MS/MS analysis using a job-wide interpretation method that excludes multiple analyses of the same precursor with a 200-ppm tolerance. In MS/MS analyses, acquisition was from the weakest to the strongest selected precursor for each spot. For each precursor, 2500 shots (50 subspectra at randomized locations over the whole spot, 50 shots/spectrum) were acquired and averaged. All MS/MS fragmentation (collision-induced dissociation) was performed with air at medium pressure (
1 x 10–6 torr) and using 2-keV collision energy.
Analysis of the Spectra and Extraction of Species-specific Proteins
MS/MS spectra were searched against databases with trypsin specificity, allowing one missed cleavage and fixed iTRAQ modifications at lysine residues and the N termini of the peptides, using GPS Explorer v.3.6. (Applied Biosystems/MDS Sciex), which incorporates the Mascot (v.2.1.) search algorithm (Matrix Science Inc., Boston, MA). Mass tolerance was set to 100 ppm for precursor ions and 0.5 Da for fragment ions. The Mascot confidence interval (CI) in percent was calculated by GPS Explorer (Applied Biosystems/MDS Sciex) based on the Mascot ion scores. Because peptides could have been derived from rat (host) or human (tumor) proteins, spectra were annotated twice using the peptides with the highest Mascot CI, once according to the rat Swiss-Prot database and once according to the human Swiss-Prot database (UniProtKB/Swiss-Prot Release 9.6 of February 6, 2007 (Swiss-Prot Release 51.6)). If a spectrum was not annotated using the well curated rat or human Swiss-Prot databases, Mascot searches were performed in the larger but more redundant NCBI rat and human databases (NCBInr release of March 29, 2005). Next the precursor protein sequences of all peptides were retrieved from the respective databases, and NCBI sequences that shared more than 90% similarity over 85% of the sequence length with a Swiss-Prot sequence were clustered together as a single protein cluster (blastclust). Clustering of Swiss-Prot sequences was not permitted, thereby preventing the clustering of human and rat sequences. Next the sequences of the peptides were compared with the sequence of the protein clusters. Peptides were automatically compared with a protein cluster using the VBA script in Excel. Using a text search, the peptide sequences were searched in the sequences of all proteins of a cluster. Peptides were only matched to a protein cluster if the protein sequence was preceded by an Arg or Lys (trypsin cleavage sites) or if the peptide sequence started with a Met and matched to the beginning of a protein sequence. Peptides that matched the sequence of more than one cluster were considered species- or isoform-nonspecific. These nonspecific peptides were not considered for protein identification and quantification. Peak areas for each iTRAQ signature peak (m/z 114.1, 115.1, 116.1, and 117.1) were obtained directly from the Oracle database of the mass spectrometer, corrected according to the manufacturer's instructions to account for the differences in isotopic overlap, and log2-transformed. Possible systematic differences in the starting amounts or labeling efficiencies between samples were normalized by subtracting the mean peak area of a sample from each individual iTRAQ signature peak. Low signature peaks generally have larger variation, which may compromise the quantitative analysis of the proteins. Therefore, peptides with iTRAQ signature peaks less than 10.97 (log2) were not considered for quantification. To compare the abundance of peptides across multiple iTRAQ experiments, within each experiment peptide quantity values were standardized to scores around zero by subtracting the mean peak area of all four samples. These normalized and standardized peptide quantity values were used to calculate mean protein quantity values.
Data Analysis in J-Express
The mean protein quantity values were imported into J-Express v.2.8 (18, 19) for further analysis. Only proteins quantified by at least two peptides in all four iTRAQ signature peaks and having at least one peptide with better Mascot CI value of 95% were imported for quantitative analysis. Note that because data sets from four different iTRAQ experiments were combined, one particular protein can be present up to four times in the final (redundant) data set. Except to calculate mean -fold changes, the separate entry points for one protein were not combined to be able to attribute each data point to the corresponding iTRAQ experiment from which it was derived.
Difference in Protein Abundance Levels
A protein quantity value matrix was formed having four columns representing the following xenograft samples: 1) early generation P6, 2) late generation P6, 3) early generation P17, and 4) late generation P17. Quality filtered data from iTRAQ1 and iTRAQ2 fitted directly into this matrix with different proteins on separate rows. From iTRAQ3 and iTRAQ4 (P6 and P17, respectively) the standardized early and late generation xenograft samples (labels 113 and 117, respectively) protein quantity values were added as additional rows in the protein quantity matrix. To identify consistent differences in protein abundance levels in both P6 and P17 between the early and late generation, we applied the RankProduct method (20) implemented in J-Express. RankProduct is a robust approach designed for experiments with few replicates and provides false discovery rate-equivalent q values as confidence estimates to counter for the issues of multiple testing in high throughput experiments.
Correspondence Analysis (CA) Plots
To visualize a global view of how the samples and proteins relate to each other, the CA approach (21) calculates the principal components of both the sample co-variance matrix and the protein co-variance matrix in such a manner that the resulting most significant principal components can be used to plot both samples and proteins in the same biplot. The biplot thus provides a 2D estimated global view of sample-sample, sample-protein, and protein-protein relationships found in the original higher order dimension data set.
Trend Analysis over Four Generations
The quality filtered data of iTRAQ3 and iTRAQ4, representing the four xenograft generations of P6 and P17, respectively, were imported as two separate data sets in J-Express. Unsupervised analysis approaches like self-organizing map clustering (22) and Profile similarity search were utilized to identify candidate trend profiles over the four generations in each of the two experiments. For biological interpretation of proteins sharing a trend profile, overlap sets were calculated, and annotations were manually inspected.
Immunohistochemistry
Immunostaining on Formalin-fixed Paraffin-embedded Tissue Sections
Immunohistochemical stainings were revealed with the Dako Envision+ System, HRP (anti-mouse K4006, anti-rabbit K4010; Dako, Glostrup, Denmark). Endogenous peroxidases were blocked with 0.03% hydrogen peroxide, and nonspecific binding was blocked with 2% fetal calf serum in 0.1% Triton X-100, Tris-buffered saline (T-TBS, pH 7.6). Sections were then incubated for 1 h at room temperature with primary antibodies (for Annexin A2, overnight incubation at 4 °C) followed by peroxidase-labeled polymer conjugate to anti-mouse or anti-rabbit immunoglobulins. After washing, sections were incubated for 5–20 min with the 3,3'-diaminobenzidine chromogen. For EGFR antibody, the staining was preceded by a demasking treatment: 5 min proteinase k (S3020; Dako) and 30 min retrieval solution (S1700; Dako) at 95 °C.
Primary Antibodies
The following primary antibodies were used: anti-von Willebrand factor/factor VIII, 1:400 (polyclonal rabbit, A0082, Dako); monoclonal mouse anti-Calnexin, 1:50 dilution (MAB3126, Clone C8.B6, Chemicon); monoclonal mouse anti-Annexin A2, 1:50 (610068, BD Biosciences); rabbit anti-Annexin A5, 1:200 (PA120A; Hyphen Biomed, Neuville-sur-Oise, France); monoclonal mouse anti-human EGFR, 1:50 (M7239, E30 Clone, Dako); and monoclonal mouse anti-human nestin, 1:200 (MAB5326, Chemicon). EGFR and nestin antibodies were human-specific. The nestin antibody was used as a control to identify tumor cells on the TMAs, allowing removal of the tumor-negative spots from the statistics.
Tissue Microarrays
Xenograft TMA
The xenograft TMA was prepared from six generations of rat xenograft brain tumors derived from seven different GBM patients. A total of 20 early generation xenografts and 10 late generation tumors were spotted in triplicates. Five samples of normal rat brain were used as control.
Glioma TMA
A high density tissue microarray of primary gliomas and normal human brain tissue (GL208, US Biomax, Inc., Rockville, MD) was used. Tumor sections (5-µm thickness) of four grades were present in three replicates: astrocytoma grade 1 (eight patients), astrocytoma grade 2 (22 patients), astrocytoma grade 3 (16 patients), and glioblastoma multiforme (15 patients).
Staining
Endogenous peroxidase activity was blocked with 0.03% hydrogen peroxide, and nonspecific binding was blocked with 2% fetal calf serum in T-TBS, pH 7.6. The sections were then incubated for 1 h at room temperature with primary antibody. Immunohistochemical stainings were revealed using the Envision+ System, HRP (K4010, Dako). After washing, sections were incubated for 15 min with the 3,3'-diaminobenzidine chromogen.
TMA Quantification
For quantification of the immunostaining, all pictures were taken in the same background with a 50x magnification. The eight-bit pictures were analyzed with Image J software. The rolling ball algorithm was used to subtract the background, and the mean pixel intensity was determined on a normalized gray scale between 0 and 255. Values were visualized as box plots, and statistical analysis was done using the Mann-Whitney Wilcoxon rank test (p value
0.001).
Ingenuity Pathway Analysis
Functional analysis of the data set was done using the Ingenuity Pathway Knowledge Base (Ingenuity Systems, Redwood City, CA). The functional analysis identified biological functions and/or diseases that were most significant to the data set. Proteins showing highest expression in the angiogenic phenotype with a false discovery rate (FDR) of 30% (rank product list, supplemental Table 1) as well as proteins identified from the two major profiles from the trend data set over the four tumor generations (supplemental Table 2) were considered for the analysis. The analysis was done against the Ingenuity Pathway Knowledge Base. Fischer's exact test was used to calculate a p value determining the probability that each biological function and/or disease assigned to that data set is due to chance alone. Canonical pathway analysis identified the pathways from the Ingenuity Pathway Knowledge Base that were most significant to the data set. The significance of the association between the data set and the canonical pathways was measured 1) by the ratio of the number of proteins from the data set that map to the canonical pathway divided by the total number of proteins that map to the canonical pathway and 2) by Fisher's exact test (see above). The focus genes from the two short lists (supplemental Tables 1 and 2) were overlaid onto a global molecular network developed from information contained in the Ingenuity Pathway Knowledge Base. This generated networks based on the connectivity of the individual proteins.
| RESULTS |
|---|
|
|
|---|
Species-specific iTRAQ Analysis of the Xenograft Brain Tumor Model
To identify membrane-associated proteins involved in the angiogenic switch we applied high throughput quantitative proteomics analysis on the xenograft tumor phenotypes. Isobaric peptide tagging coupled to LC and tandem MS was performed on membrane-enriched protein fractions of xenografted brain tumors derived from two different GBM patients (P6 and P17). Comparisons were done on samples from four generations of serially transplanted tumors and between first and late generation tumors (see Fig. 2 for experimental work flow). Membrane protein enrichment was confirmed by the accumulation in the plasma membrane fraction of EGFR, a tyrosine kinase receptor known to be up-regulated in a majority of human GBMs (Fig. 2C).
After protein digestion and peptide labeling, the samples were separated by two dimensions of LC: SCX columns (20 fractions) followed by reverse phase chromatography (192 fractions) yielding a total of 3840 fractions for each iTRAQ experiment that were spotted on MALDI plates and analyzed by mass spectrometry (a total of 15,360 spots for the four experiments). Over the four iTRAQ experiments this led to the identification of about 7500 redundant (non-unique) proteins (C.I.
95%) by GPS Explorer analysis in Swiss-Prot and NCBI rat and human databases. Taking advantage of the sequence dissimilarities between human and rat proteins, only peptides that could differentiate between the two species were considered in the subsequent analyses. Peptides with a sequence that did not discriminate between isoforms were also excluded resulting in about 3150 isoform- and species-specific proteins of which 1633 were of human/tumor origin and 1517 were of rat/host origin. This 1:1 ratio between human and rat IDs was constant throughout the data set, suggesting that there was no differential bias toward one or the other species in any of the samples. Of these, 1460 proteins were quantifiable based on identification with at least two peptides and a high annotation confidence (CI
95%), and 1038 thereof were present in both patients. Here again the distribution between human and rat remained similar (522 human, 511 rat).
Bioinformatics Analysis Identifies Different Protein Clusters between Invasive and Angiogenic Tumor Phenotypes
Correspondence Analysis
Using CA plots, global views of the different data sets were produced to reveal global relationships between the samples and between samples and proteins. Fig. 3A shows that the major global trend in the data set for analyzing differentially expressed proteins between early and late generations was indeed the separation of the two early and the two late generation samples along the first axis of the plot. Interestingly this effect was found to be larger in this estimated global view than the individual difference between P6 and P17, which was not even detected as the second largest major trend along the y axis. This suggests that the biological variation between human GBMs is less pronounced than the phenotypic variation observed between the different xenograft generations. Fig. 3B shows that the four consecutive tumor generations of P6 fall out in the plot in a sequential counterclockwise order around origo. The exact same trend was seen for P17 (Fig. 3C). The similar trends of P6 and P17 were further confirmed by applying CA to only the proteins found to overlap in the P6 and P17 trend data sets (Fig. 3D). We then performed unsupervised cluster analysis to generate a heat map of a global clustering (Fig. 3E), describing "pairwise sample similarity." This again shows that the highest similarity was found between late generation tumors of the two patients (P17-4 and P6-4), and the biggest difference was between early and late generations (independent of patient). The CA plots and the heat map are indicative of the confidence of the data obtained as the observed sample clustering clearly reflects the histological similarities and dissimilarities of the tumor phenotypes.
|
Protein Profiles in Trend Data Sets
Next we analyzed the global plot of the two trend data sets over the four tumor generations derived from P6 and P17 (iTRAQ3 and iTRAQ4). Visual inspection clearly identified common dominant protein profiles that appear to be present in both data sets (Fig. 4A). By adjoining the data from P6 and P17 for shared proteins, proteins co-regulated in both patients can be detected by clustering. Using the self-organizing map clustering algorithm these proteins were divided into nine clusters (Fig. 4B) from which two major common profiles were identified: the first one assembling proteins that were up-regulated in the late generation (derived from clusters 7, 8, and 4), and a second one showing peak protein expression in the middle generations (derived from clusters 1 and 2). Proteins with expression patterns similar to these global trends were extracted individually from the P6 and P17 data sets using Profile similarity search in J-Express, resulting in Profile I (red) and Profile II (dark green) highlighted for P6 and for P17 (Fig. 4C). Interestingly most proteins in the up-regulated Profile I were of human origin (Fig. 4, D–G). In P6 only one of 41 was of rat origin (Fig. 4D), and in P17 only one of 28 was of rat origin (Fig. 4F). The situation is exactly the opposite for Profile II where 48 of 70 proteins in P6 (Fig. 4E) and 26 of 28 in P17 (Fig. 4G) were of rat origin. The protein IDs and annotations from the profiles depicted in Fig. 4, D–G, are presented in supplemental Table 2.
|
|
|
|
Functional Analysis of Data Sets
To further interpret our findings in a biological context, the top 60 proteins from the rank product list (supplemental Table 1) and the proteins from the two main profile lists (supplemental Table 2) were analyzed through the Ingenuity Pathway Knowledge Base. This functional analysis identifies biological functions and/or diseases that are most significant to the data set. The rank product list, representing proteins up-regulated in the angiogenic phenotype, generated the following top networks: (i) cell-to-cell signaling and interaction (score, 47), (ii) amino acid and lipid metabolism (score, 32), and (iii) energy production/nucleic acid metabolism (score, 27). The proteins attributed to the networks are shown in Table I. Neurological disease and cancer were the main diseases connected to the list. The top canonical pathways associated with high significance were oxidative phosphorylation and mitochondrial dysfunction (see Table I for protein IDs and associated p values). Next we analyzed the proteins from the two major profiles from the trend data set (Fig. 4) that are of particular interest as they reveal differences between host and tumor responses. Profile I from the trend data set (supplemental Table 2), consisting largely of tumor proteins found to be increased in the angiogenic phenotype, generated the following major networks: (i) lipid metabolism/small molecule biochemistry (score, 49), (ii) cancer/cell morphology (score, 32), and (iii) energy production/nucleic acid metabolism (score, 29). The major diseases associated to this data set are genetic disorder and neurological disease. The top canonical pathways identified with high significance are oxidative phosphorylation and mitochondrial dysfunction. The proteins attributed to the networks, diseases, and canonical pathways are shown in Table II. Profile II from the trend data set(supplemental Table 2), consisting largely of host-derived (rat) proteins that showed slightly increased expression in the intermediate generations, generated the following major networks: (i) cellular function and maintenance (score, 68), (ii) energy production/lipid metabolism (score, 36), and (iii) energy production/nucleic acid metabolism (score, 34). The major diseases associated with this data set are again genetic disorder and neurological disease. The top canonical pathways identified with high significance are oxidative phosphorylation and mitochondrial dysfunction. The proteins attributed to these networks, diseases, and canonical pathways are shown in Table III.
|
|
|
| DISCUSSION |
|---|
|
|
|---|
The combination of subfractionation, iTRAQ-based labeling, 2D LC peptide separation, and MALDI-TOF/TOF mass spectrometry connected to database searches for human and rat proteins yielded several thousand protein IDs over four iTRAQ experiments. Taking advantage of the xenograft model, we focused on those proteins that were extracted as isoform- and species-specific based on their sequence identity, thus ensuring that the origin of the proteins (tumor- or host-derived) was unequivocal. Interestingly the number of human protein IDs was very similar to the number of rat IDs, suggesting that there was no bias in species contribution between different samples. Only proteins identified with at least two peptides/protein and a high annotation confidence (
95%) were considered, resulting in 1460 quantifiable proteins over the four experiments. Correspondence analysis and heat map plots revealed a high degree of similarity between tumor xenografts of the same generation although arising from different patient GBMs. This indicates that the xenograft animal model provides reproducible tumor specimens derived from different patients. It is also a strong indication of the validity of the overall data set because it clearly mirrors the histological findings. This is of major importance for studies with rare, clinical material and for this study in particular where the elaborate and time-consuming establishment of the animal model restricted the number of available GBM tumor xenografts. A similar trend was seen in the analysis of the trend data set comparing the expression profiles over four consecutive tumor generations. Here again the two major profiles extracted were identical between the two patients. The first most abundant profile encompasses the proteins that showed increased expression in the angiogenic, late generation tumor phenotype. Proteins from the second major profile showed a slightly increased expression in the second and third generation tumors (with intermediate phenotype) and dropped down in the late generation, angiogenic phenotype. Interestingly the large majority of proteins from Profile I were of human origin (tumor proteins), whereas those from Profile II were largely host-derived (rat proteins). The fact that the overall distribution of rat versus human IDs was 1:1 suggests that the species-specific protein regulation in these subsets is significant.
Obviously species- and isoform-specific quantification was achieved at the expense of the number of included peptides. In fact 76% of unique peptides were discarded in the present analysis focusing on species- and isoform-specific proteins. However, these peptides may provide important additional information and will be used to further validate the current data. This is currently under investigation, in particular taking into account the comparison with a gene expression study carried out on the same tumor samples.2
The expression pattern of Calnexin, AnxA2, AnxA5, and EGFR was confirmed by immunostaining on tissue microarrays of a large number of invasive and angiogenic xenografts as well as on glioma TMAs. The proteins were chosen from the rank product list on the basis of (a) their strong regulation (Calnexin), (b) the representation of a protein family (Annexins), or (c) their known role in GBM biology (EGFR). The validation of the iTRAQ result for all four proteins suggests that the combined proteomics and bioinformatics analysis applied has generated valid candidates that may be further evaluated for their role in the progression from invasive to angiogenic brain tumors. Obviously the validation of a handful of proteins cannot be extrapolated to the full protein list.
Calnexin is a calcium-binding, endoplasmic reticulum-associated protein that is involved in protein folding and assembly as well as the quality control of protein folding (25). It has an additional function in the assembly of MHC class I proteins as part of the antigen processing machinery (26), which plays a crucial role in the interaction between malignant cells and the immune system of the host. Calnexin is highly expressed in many human cancers, but an up-regulation in malignant gliomas has not been reported previously. Its functional role in angiogenesis and glioma progression remains to be addressed.
Annexins belong to a large family of calcium-dependent phospholipid-binding molecules with Annexin repeats, which are involved in several cellular processes including cell growth, signal transduction, apoptosis, inflammation, and differentiation. In addition to AnxA2 and AnxA5, the present screen identified Annexins A1, A3, and A6, all of which showed increased expression in the angiogenic tumors. AnxA3, one of the proteins that was increased in the host compartment of angiogenic tumors, is known to promote endothelial cell migration and proliferation possibly via induced VEGF production (27). Interestingly AnxA3 has recently been identified in two proteomics screens in the injured central nervous system with its expression being limited to activated microglia (28, 29). The cellular origin of AnxA3 in our brain tumor model has not been investigated yet, but the fact that it is host-derived may point to microglial production, suggesting that activated microglia in malignant glioma contribute to tumor angiogenesis.
AnxA2 is classified as a candidate cancer marker (protein atlas), and its expression in gliomas correlates with malignancy (30). Indeed we confirmed increased expression of AnxA2 in grade III and IV gliomas on a large glioma TMA, suggesting that AnxA2 may be a novel biomarker for malignant gliomas. It has an important role in neoangiogenesis during development (31) and is involved in tumor angiogenesis presumably through its role in plasmin and metalloproteinase activation, which leads to increased VEGF production (32, 33). It has therefore been proposed as a potential antiangiogenic cancer target (32). Clearly the Annexin protein family deserves further attention with regard to glioma angiogenesis and progression.
GBM frequently involves amplification and alteration of the EGFR gene, resulting in overexpression of wild type and/or mutated forms of the receptor, including the most common mutation, EGFRvIII (34). EGFR gene status and expression is a negative prognostic factor for patients with GBM, especially in younger patients (35). EGFR signaling promotes proliferation, migration, and invasion and inhibits glioma cell apoptosis (36). In our xenograft model, the expression of this receptor appears to be independent of the tumor phenotype but is likely linked to the original patient biopsy from which the xenograft was derived. This supports the notion that the genetic mutations (in this case EGFR amplification) are maintained in our xenograft model.
Although the bioinformatics and functional analysis should be considered as hypothesis-generating, important trends can be identified in the data set. GO annotations and initial functional pathway analysis revealed that many proteins with increased expression in angiogenic tumors are associated with metabolic processes (energy production and lipid and nucleic acid metabolism) as well as cell-to-cell signaling and interaction. The data suggest that the angiogenic phenotype is characterized by an aerobic metabolism for energy production (oxidative phosphorylation) and a more active general cellular metabolism compared with the non-angiogenic phenotype. The canonical pathway of mitochondrial dysfunction suggests that alternative metabolic pathways may additionally be recruited in tumor cells for efficient energy production. These data are consistent with the higher growth rate of the angiogenic tumors. Conversely the infiltrative, non-angiogenic tumor may display reduced oxygen tension thereby forcing tumor cells to infiltrate the surrounding brain parenchyma. This is also supported by recent data from our laboratory showing a significant increase in lactate levels in the infiltrative tumor by NMR spectroscopy (24). The cell-to-cell signaling and interaction network extracted from the angiogenic tumor suggests an increased cross-talk between tumor cells and/or between tumor and host cells. This may reflect a better adaptation of the cancer cells to their microenvironment in the angiogenic phenotype, thereby favoring the induction of new blood vessels and improved tumor growth. Based on the protein IDs of this network (AnxA3, Calnexin, interferon
, MHC class I, and NF
B), an interaction between tumor cells and the host immune system is also conceivable. Thus our data suggest that enhanced tumor host interaction may lead to a better adaptation of the tumor to its microenvironment, which may form the basis for the angiogenic switch in solid tumors. The precise analysis of the candidate proteins in this cross-talk will provide clues on the role of the environment on tumor progression.
In conclusion, based on a novel clinically highly relevant xenograft brain tumor model, we have generated an extensive and unique data set of protein expression profiles that recapitulates the change from invasive to angiogenic tumor growth. Based on glioma TMA expression analysis, we propose Calnexin, AnxA2, and AnxA5 as novel biomarkers for high grade gliomas. Pathway analysis suggests the importance of increased metabolic activity and tumor host interactions to drive tumor growth in the angiogenic phenotype. The evaluation phase of the candidate proteins should reveal novel biomarkers that can be further exploited as therapeutic targets for malignant gliomas.
| ACKNOWLEDGMENTS |
|---|
| FOOTNOTES |
|---|
* This work was supported by the European Commission 6th framework program (Angiotargeting project, Contract 504743); by CRP-Santé and the Research Ministry (Ministère de la Culture, de l'Enseignement Supérieur et de la Recherche) of Luxembourg; by the Norwegian Cancer Society; by the Norwegian Research Council Bergen, Norway; and by The VU Medical Cancer Center Amsterdam (to J. C. K., T. V. P., and C. R. J. and for the proteomics infrastructure). ![]()
The on-line version of this article (available at http://www.mcponline.org) contains supplemental Tables 1 and 2. ![]()
2 K. Petersen and S. P. Niclou, unpublished data. ![]()
1 The abbreviations used are: GBM, glioblastoma multiforme; iTRAQ, isobaric tags for relative and absolute quantitation; TMA, tissue microarray; EGFR, epidermal growth factor receptor; SCX, strong cation exchange; CI, confidence interval; NCBI, National Center for Biotechnology Information; CA, correspondence analysis; 2D, two-dimensional; FDR, false discovery rate; VEGF, vascular endothelial growth factor; S, supernatant; P, patient; ID, identification; Anx, Annexin; GO, gene ontology; MHC, major histocompatibility complex..![]()

To whom correspondence should be addressed: Norlux Neuro-Oncology Laboratory, CRP-Santé, 84 Val Fleuri, L-1526 Luxembourg, Luxembourg. E-mail: simone.niclou{at}crp-sante.lu.
| REFERENCES |
|---|
|
|
|---|
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| All ASBMB Journals | Journal of Biological Chemistry |
| Journal of Lipid Research | ASBMB Today |