Jennifer CHARLES, Makdi MASNODDIN, Farhan NAZAIE & Nur Athirah YUSOF
Biotechnology Research Institute, Universiti Malaysia Sabah, Jalan UMS 88400 Kota Kinabalu, Sabah, Malaysia
Abstract Cold shock domain (CSD)-containing proteins are one of the groups of the evolutionarily conserved nucleic acid-binding proteins in all three domains of life consisting of an ancient beta-barrel fold that serves to bind nucleic acids. The cDNA of a novel protein-coding gene containing CSD was cloned from Glaciozyma antarctica designated as Ga16676. The full length of Ga16676 gene with the size of 1335 bp encodes for an N-terminal CSD with conserved nucleic acids binding motif RNP1 and RNP2. The Ga16676 gene was cloned in pET30 Ek/LIC, sequenced, expressed and its resistance towards cold was characterized. Recombinant protein expression of Ga16676 showed overexpressed soluble expression in both supernatant and pellet forms at 20 . The effects of recombinant CSD protein overexpression on colony formation shows that E. coli cells were able to grow at 37 and 20 but not at 4 while E. coli_Ga16676 cells were able to grow at all temperatures tested. In addition, E. coli_Ga16676 cells showed higher growth rate compared to empty E. coli cells at 10 . Structural analysis of Ga16676 reveals some interesting findings such as more aromatic interactions for efficient binding in low energy environment, a longer loop that may contribute to structural flexibility and clustering of charged amino acids on the protein surface that is important for protein stability and flexibility.
Keywords cold sh ock protein (CS P), cold ad aptation, cold s tress, s tress resp onse, s tress toler ance, Glaciozyma antarctica, psychrophilic yeast
More than three-quarters or approximately 80% of the Earth’s surface is covered by cold-oceans with a temperature below 5 , including areas such deep oceans, glaciers, and polar regions (Russell, 1990; Margesin et al., 2007). The extremely low temperature in the Antarctic made it almost impossible for the survival of most organisms (Rogers et al., 2012). Indeed, temperature remains a critical environmental factor controlling the evolution and biodiversity of life on earth. The microorganisms that occupy these regions are identified as psychrophiles which are able to proliferate faster at 15 or below but unable to proliferate above 20(Margesin et al., 2007). Due to the extreme environments, psychrophiles meet a variety of stressful and changing environmental conditions that challenge their optimal development.Glaciozyma antarticais an obligate psychrophilic yeast that strives in such harsh environments (Hashim et al., 2013). It was isolated from the sea ice near Casey Station, Antarctica (Firdaus-Raih et al., 2018). This yeast lives in the Antarctic marine waters where temperatures range from -2.2 to 4 . According to Boo et al. (2013),G. antarcticais classified as an obligate psychrophilic yeast based on its optimum temperature, reported as 12 , and tolerance of temperatures up to 20 . One of the main challenges for organisms in cold environments is low temperatures that decrease the rates of biochemical reactions and thereby affect cell growth (D’Amico et al., 2006). Temperature downshift causes the cell membrane fluidity to decrease which affects protein translocation and secretion hence, cells produce cold shock proteins (Csps) to help overcome these challenges (Phadtare and Severinov, 2010).
Experimental studies also have shown that cold-shock protein is responsible for the stabilization of membrane fluidity caused by cold stress. This involved the increasing production of unsaturated fatty acids, which involves fatty acid desaturase activity (Nakagawa et al., 2002). Cold shock proteins are small nucleic acid-binding proteins ranging from 65 to 75 amino acids in length (Graumann and Marahiel, 1996; Czapski and Trun, 2014). Cold shock proteins act upon harmful effects of temperature downshift and thus help the cells to adapt (Phadtare, 2004). Interestingly, studies have shown that Csps not only are important during temperature downshift but also play a wider role in stress tolerance of bacteria (Michaux et al., 2012; Sch?rer et al., 2013; Wang et al., 2014; Derman et al., 2015). The cold shock protein A (CspA) inE.colifunction as RNA chaperone which aid in low affinity for RNA binding, low sequence specificity, and ability to destabilize the secondary structures in RNA resulted in them to be more susceptible to RNases, therefore preventing formation of secondary structure which is responsible to facilitating the efficient translation of mRNAs (Vasina and Baneyx, 1996). Cold shock proteins were initially identified in bacteria, a sudden drop in temperature (37 to 10 ) induced a 200-fold increase in CspA, which was independent of transcriptional activity (Jones and Inouye, 1994; Gottesman, 2018). The cold shock proteins are the most evolutionary conserved proteins (Jones and Inouye, 1994; Lindquist and Mertens, 2018) which can be found in psychrophilic, mesophilic, thermophilic and even hyperthermophilic bacteria (Phadtare, 2004; Jin et al., 2014).
In contrast with the various studies conducted with heat shock in yeast, the mechanisms of the cold stress response are still poorly understood. Cold-shock proteins have been identified in yeast and recently, protein with thermotolerance and molecular chaperone properties has been studied inG.antartica(Yusof et al., 2016; Firdaus Raih et al., 2018). Cold-shock response has been studied in eukaryotes, but no well-defined conservation of cold-shock proteins has been found (Panadero et al., 2006; Yang et al., 2012). Similar to heat shock, cold-stress was reported to be regulated by classical stress pathways involving the high-osmolarity glycerol mitogen-activated protein kinase Hog1, and the transcriptional factors Msn2 and Msn4 (Panadero et al., 2006). However, there is still very limited information about the signaling events resulting from changes in temperature. Furthermore, the mechanism of how cells react with environmental stress varies between species. Thus, in order to understand howG.antarcticaadapts to cold, harsh environments, one must understand every aspect of its physiology and biology. In this study, we present the first exploration of the role of cold shock domain in a novel protein from an obligate psychrophilic yeast.Ga16676is a novel protein-coding gene with uncharacterized function, also known as hypothetical protein, which has been identified in our study (unpublished data) as having significant expression patterns. The gene contains a CSD with the size of 207 bp which has sequence variants that confer thermal adaptation. We aimed to shed light on the cold adaptation strategies acquired by this cold shock domain containing protein. For this purpose, we have cloned the whole gene ofGa16676with the size of 1335 bp and proceeded to functional and structural analysis. We found out several structural features that allowed this protein to adapt to the low temperatures. The elucidation of the molecular adaptations that rendered this protein cold active activities expends our current understanding about protein cold adaptation and for biotechnological applications.
G.antarcticacells were cultured in yeast peptone dextrose broth (10% (w/v) yeast extract, 20% (w/v) peptone, and 20% (w/v) dextrose) at 12 until the OD600reached approximately 0.6-0.8. The cells were harvested and stored at -80 until further usage.
The following materials were treated overnight with diethylpyrocarbonate (DEPC) 0.1% (v/v) and subsequently autoclaved: pestle and mortar, microcentrifuge tubes, pipette tips, and spatula. The RNA extraction was performed using TRIzol? reagent (Invitrogen, USA). The concentration and purity of total RNA were measured using a Nanodrop spectrophotometer at 260 nm (ThermoScientific, USA). The RNA quality was determined after running total RNA on a 1% (w/v) agarose gel at 120 V for 1 h. Total RNA was stored at -80 .
Ga16676complementary DNA (cDNA) was amplified using specific primers that promoted the ligation- independent cloning of target genes into the pET30 Ek/LIC (Merck-Millipore, Germany) vector (Table 1). The direct ligation of amplifiedGa16676cDNA with the pET30 Ek/LIC vector was prepared according to the manufacturer’s instructions. Then, the ligation mix was transformed into anE. coliBL21(DE3) expression host. The positive transformants were identified using sequencing analysis. TheGa16676sequence was deposited in GenBankTM under accession number MH719220.
2.3.1 Expression of Ga16676 in E. coli BL21(DE3)
The expression of recombinant Ga16676 was assessed in LB media containing 50 μg·mL-1of kanamycin. Induction was performed using 1 mmol·L-1isopropyl β-D- thiogalactopyrosidase (IPTG) after reaching an OD600of ~0.6 and overproduction of recombinant protein was induced for 16 h, at 16 ℃, 20 ℃ and 37 ℃ at 180 rpm respectively. Overexpression of the Ga16676 was analyzed using 15% SDS-PAGE. The protein bands were visualized by Coomassie brilliant blue R250 staining.
2.3.2 Colony formation of Ga16676 in E. coli BL21(DE3)
The Ga16676 culture was incubated at 37 ℃ until OD600of ~0.6 and induced with 1 mmol·L-1IPTG. An amount of 10 μL of the culture was plated on LB plates supplemented with 50 μg·mL-1of kanamycin and streaked using sterile inoculation loop. The plates were incubated in 4 ℃, 20 ℃ and 37 ℃ until the formation of the streaked colony was observed.E.coliBL21(DE3) consisting plasmid withoutGa16676insert was used as control.
The Ga16676 culture was incubated at 37 ℃ until OD600of ~0.6 and induced with 1 mmol·L-1IPTG. The cells were transferred to 10 ℃, 20 ℃ and 37 ℃ and incubated at 180 r·min-1respectively. The optical density of the culture (OD600) was observed and recorded until stationary phase in the respective temperatures.E.coliBL21(DE3) consisting plasmid withoutGa16676insert was used as control.
The three-dimensional Ga16676 structure was modelled to theBacillus caldolyticus(PDB entry 1HZA) using the SWISS-MODEL program. The structure quality was evaluated using PROCHECK (Laskowski et al., 1993), Verify3D (Eisenberg et al., 1997) and ANOLEA (Melo et al., 1997). The superimposed Ga16676 CSD model and template and comparative analysis were performed using CHIMERA USCF (Pettersen et al., 2004).
The Ga16676 sequence analysis revealed that this protein-coding gene was a conserved hypothetical protein containing CSD with the size of 49 kDa. The cold shock domain was 47.83% identical toBacillus caldolyticuscold shock protein, 48.57% identical toListeria monocytogenescold shock CspB, 49.28% identical toGeobacillus stearothermophiluscold shock CspB and 50% identical toBacillus cereuscold shock protein CspD (Table 2). The full length of Ga16667 was 445 amino acids while the cold shock domain contained 69 amino acids. The cold shock domain was found to be conserved in all prokaryotic and eukaryotic single-strand nucleic-acid binding proteins. The amino acid sequence of Ga16676 CSD showed the presence of 5 β-strands which created a common β-barrel of a cold shock domain structure. The sequence also contains RNA binding motifs that were highly conserved among living organisms. Interestingly, residue substitutions of Ile to Val were observed in RNA binding motif-1 (RNP1) while Asp to Glu in RNA binding motif-2 (RNP2) (Figure 1). The RNA binding motifs play an important role in binding both single-stranded RNA and DNA (Manival et al., 2001). A total number of seven aromatic residues (Trp-11, Phe-12, Phe-18, Phe-20, His-33, Phe-34 and Tyr-42) that interacted with RNA or ssDNA were found in the sequence. Ga16676 CSD retained all of the aromatic residues except Phe-12 and His-33. In this study, the Phe-18 and Phe-31 residues that play a critical role in nucleic acid binding activity were conserved in Ga16676 CSD (Jung et al., 2010). A total of 8 unique residue substitutions was found in Ga16676 CSD. These were substitutions of Val to Cys, Trp to Phe, Asp to Glu, Lys to Arg, Gln to Glu, Phe to Tyr, Iso to Leu and Pro to Tyr. Phylogenetic tree analysis showed Ga16676 CSDhad high similarity at the protein level with other CSD from bacteria to eukarya (Figure 2). Since Ga16676 CSD was unclustered with other organisms, it was possible that the sequence evolved to acquire cold adaptation strategy.
Figure 1 Multiple sequence alignment of Ga16676 CSD (amino acid position 1-114) with other cold shock domain proteins from E. coli, B. caldolyticus, B. subtilis and T. maritime. The RNA binding motifs RNP1 and RNP2 were boxed. The β-strands were shown using top line. The aromatic residues that interacted with RNA or ssDNA were labelled with asterisks (*) while the hydrophobic residues forming hydrophobic core were labelled with black box (?).
Recombinant Ga16676 cells showed different protein expression profile when cells were grown at different temperatures of 10, 20 and 37 and IPTG concentrations of 0.5 mmol·L-1and 1 mmol·L-1respectively (Figure 3). At 37 , recombinant Ga16676 expressed target protein in supernatants both induced with 1 or 0.5 mmol·L-1IPTG. Interestingly, at 20, the recombinant Ga16676 was overexpressed in supernatants both in 1 and 0.5 mmol·L-1IPTG and in pellet form in 1 mmol·L-1IPTG. At 10 , the recombinant Ga16676 was expressed in supernatant induced with 1 mmol·L-1IPTG but none was found in pellet. Since the amount of protein loaded in each well was standardized, Ga16676 was found to be overexpressed at 20 both in the supernatant and pellet forms.
The ability of Ga16676 inE.coliBL21(DE3) to grow at different temperatures was observed by its colony formation (Figure 4). In this study,E.coliBL21(DE3) with empty vector was used as control. At 37 after overnight incubation, both control and Ga16676 cells grew. The control grew better compared to Ga16676 cells. Incubation at 20 for 3 d yielded the same result where both cells grew on the tested LB plate. Contrast to incubation at 4 for 8 d, only Ga16676 cells grew but not the control.
The cell growth of Ga16676 and control were observed at different temperatures of 37 , 20 and 10(Figure 5). In this study,E.coliBL21(DE3) with empty vector was used as control. At 37, control cells grew better compared to Ga16676. Incubation at 20 showed the same result as 37 where the control cells grew better compared to Ga16676. However, at 10 incubation, Ga16676 cells grew better compared to control. After 2 h exposure to 10 , Ga16676 cells started to increase in its growth density until 11 h of incubation. However, as for the control, exposure to 10 for 2 h had slowed down the cell growth where the cells increment started to become stationary after 10 h of exposure.
Ga16676 CSD domain had the highest similarity with CSD structure fromB.caldolyticus(PDB: 1HZA) with 48.57% identity. The structure of 1HZA was used as a template where the tertiary structure of Ga16676 CSD was modelled using SWISS-MODEL. The model was evaluated using PROCHECK, Verify3D and ANOLEA. PROCHECK analysis showed that the build model had 100% amino acids in favoured and allowed regions. Furthermore, model verification using Verify3D showed that the constructed model obtained a positive score of more than 80%. Analysis using ANOLEA showed acceptable energy calculations at the atomic level in the protein model structure.
Figure 2 Phylogenetic analysis of Ga16676 CSD with other CSD from bacteria to eukarya at protein level.
Figure 3 15% SDS-PAGE analysis of recombinant Ga16676 protein expression at different temperatures and IPTG concentration. From far left, lane-1 was loaded with protein marker (PM), lane-2 contained Ga16676 supernatant after incubation at 37 induced with 1 mmol·L-1 IPTG, lane-2 contained Ga16676 supernatant after incubation at 37 induced with 0.5 mmol·L-1 IPTG, lane-3 contained Ga16676 supernatant after incubation at 20 induced with 1 mmol·L-1 IPTG, lane-4 contained Ga16676 supernatant after incubation at 20 induced with 0.5 mmol·L-1 IPTG, lane-5 contained Ga16676 supernatant after incubation at 10 induced with 1 mmol·L-1 IPTG lane-6 contained Ga16676 pellet after incubation at 37 induced with 1 mmol·L-1 IPTG, lane-7 contained Ga16676 pellet after incubation at 20 induced with 1 mmol·L-1 IPTG, lane-8 contained Ga16676 pellet after incubation at 10 induced with 1 mmol·L-1 IPTG and lane-9 contained expression of BL21(DE3) harboring expression vector as control.
Figire 4 Colony formation of Ga16676 cells and control on LB plates supplimented with ampicillin at different incubation temperatures.
Superimposing of Ga16676 CSD model and its template gave an RMSD value of 0.168 ?. Comparative analysis of Ga16676 CSD model and 1HZA showed that the constructed model contained a longer loop which connected the RNP1 and RNP2 (Figure 6a). Besides that, the residue substitutions as mentioned in sequence analysis showed an increment of aromatic interactions at the RNA binding motifs. Some selective residues substitutions like Val to Cys, Asp to Glu, Gln to Glu, Phe to Tyr, and Pro to Tyr substituted uncharged/less charged amino acids to more charged amino acids, non-polar to polar and hydrophobic to less hydrophobic amino acids. Interestingly, these substitutions increased the distance between residues interactions at the loop regions which connected the RNP1 and RNP2. Analysis of the protein surface revealed that Ga16676 CSD model had a higher solvent accessible surface area (4902.63 ?) compared to its counterpart, 1HZA (4442.95 ?). Moreover, there was no distinct clustering of charged amino acids on Ga16676 CSD model as compared to 1HZA. The protein surface of Ga16676 CSD model had considerable clustering of charged amino acids (Figure 6b).
Figure 5 Cell growth comparison between Ga16676 cells and control at different temperatues of 37(a), 20(b) and 10(c).
Figure 6 a, Comparative structural analysis between Ga16676 CSD model and B. caldolyticus CSD (1HZA); b, The charged surface of 1HZA and Ga16676 CSD model. The solvent accessible surface area (SASA) of Ga16676 CSD model showed that it has no distinct patches of charged amino acids compared to 1HZA protein surface.
Most studies on Csps were done fromE.coliand a number of homologues from psychrophiles have been reported but mostly through genomic DNA studies. However, to our knowledge, there is no functional study that has been performed on Csps from an obligate psychrophilic yeast. In order to have a better insight into the molecular mechanism that a psychrophilic yeast uses to overcome the temperature downshift, we studied the role of its cold shock domain containing protein. In the present study, our sequence analysis showed that Ga16676 protein fromG. antarcticacontained cold shock domain where the sequence of nucleic acid-binding proteins, RNP1 and RNP2 were highly conserved in Ga16676 (Figure 1). Residue substitutions of Ile to Val was observed in RNA binding motif-1 (RNP1) while Asp to Glu in RNA binding motif-2 (RNP2) showed interesting residue substitutions of hydrophobic to less hydrophobic and less polar to more polar residues. In addition, a total of 8 unique residue substitutions in Ga16676 CSD sequence which were substitution of Val to Cys, Trp to Phe, Asp to Glu, Lys to Arg, Gln to Glu, Phe to Tyr, Iso to Leu and Pro to Tyr showed favorable substitutions from non-polar to polar residues, less polar to more polar residues, and less charged to more charged residues. The residue substitution P67Y (Proline to Tyrosine) on the β-strand may contribute to the aromatic-aromatic interactions in strand-strand stabilization of β-sheets. The interactions between aromatics are believed to occupy a crucial role in protein folding, stability, and protein-protein or protein-ligand (Waters, 2002; Chakrabarti and Bhattacharyya, 2007). Therefore, the substitution of amino acid residues with aromatic residue in Ga16676 cold shock domain enables the stabilization of structure of a barrel without H-bonding which allows the larger strand-strand distance to exist stably (Budyak et al., 2013). In addition, the high thermostability ofThermotoga maritimacold shock protein is reported due to the electrostatic interactions among the residues (Motono et al., 2008). Another report on an experimental mutational study onBacillus caldolyticuscold shock protein shows that when favourable charged residues inThermotoga maritimacold shock protein were grafted intoBacillus caldolyticuscold shock protein, the mutational effects significantly improved the thermostability ofBacillus caldolyticuscold shock protein without changing its two-state folding mechanism (Su et al., 2016). It was possible that the residue substitutions in Ga16676 CSD sequence which favoured hydrophilic, charged and polar residues helped to promote thermostability in the protein structure as a cold adaptation mechanism. Moreover, the phylogenetic analysis of the CSD obtained by evolutionary distance showed that the Ga16676 CSD was closely related to eukaryotes compared to the bacteria counterparts. Further, the analysis of sequence distribution on evolutionary scale also suggested sequence variation in Ga16676 CSD that possibly made the protein functionally more diverse compared to bacteria. Besides that, we also tested whether the recombinant Ga16676 protein could be expressed at 37 , 20 and 10 . As described previously, the recombinant Ga16676 protein was expressed at 10 in supernatant form and overexpressed in both supernatant and pellet forms at 20 after 16 h incubation. It has been reported that cold-shock inducible system produced high cold-active protein expression levels as well as reducing host toxicity (Bjerga and Williamson, 2015). Our result showed that the cold shock domain in Ga16676 may promote protein production at temperatures lower than 37 . Furthermore, the colony formation ofE.coliBL21(DE3) containing Ga16676 showed that the cells could grow at 4 after 8 d of incubation but not forE.coliBL21(DE3) cells with no insertion of Ga16676. Therefore, it is possible that the Ga16676 CSD involved in the acquisition of cold tolerance allowed the cells to grow at a low temperature of 4 . Production of cold shock proteins is part of the mechanisms that permit low-temperature growth of microorganisms (Wouters et al., 2000). Besides that, further analysis on the cell growth showed thatE.coliBL21(DE3) cells with Ga16676 were able to grow better at 10 compared toE.coliBL21(DE3) cells with no insertion of Ga16676. After 2 h of exposure to 10 , the cell growth started to increase with higher doubling time until hour-11 compared toE.coliBL21 (DE3) cells with no insertion of Ga16676. Hence, the colony formation and cell growth experiment suggested that recombinant Ga16676 protein might play a protective role inE.coliBL21(DE3) against cold shock stress which allowed the cells to grow at low temperatures. A similar result was reported by Yusof et al. (2016), where a recombinant GaSGT1 protein was able to prevent protein aggregation during temperature assaults as low as 0 . The importance of cold shock protein in cold adaptation was also documented in the gene expression of early cold response inSaccharomyces cerevisiae. When the budding yeast was exposed to a rapid temperature shift from 30 to 10 , early cold response genes were induced during the first 2 h of cold treatment while late cold response genes were induced after 12 and 60 h (Schade, 2004). The 3D structure analysis was important for the elucidation of a protein structure and function. Ga16676 CSD model contained a longer loop which connected the RNP1 and RNP. The loop which connected the two most important nucleic acid-binding domains was possibly linked to higher flexibility in cold-adapted proteins. This finding has been reported in several studies of cold-adapted enzymes (Michetti et al., 2017). Selective residues substitutions which involved uncharged/less charged to a more charged amino acid in Ga16676 was also observed inB.caldolytiscold shock protein and its mesophilic homologB.subtiliscold shock protein where this is important for protein stability (Delbrück et al., 2001). Based on the charged surface of 1HZA and Ga16676 CSD model, the value of the solvent-accessible surface area was higher than the 1HZA. These finding contrasted with the proposed theory of cold-adapted proteins contain less charged exposed residues, more exposed hydrophobic regions and less solvent accessibility to be more flexible for efficient activity at cold environments (Gianese et al., 2001; Zanphorlin et al., 2016). Hence, we proposed that accessibility to the solvents may be important to cold shock protein inG.antarcticain order to stabilize its structure during extreme temperature decrease. Another interesting finding was the proportional clustering of charged amino acids on Ga16676 CSD. In thermophiles, the high number of charged residues on a protein surface is important for thermostabilisation (S?lensminde et al., 2009).InB.caldolyticuscold shock protein, the clustering of charged amino acids is important for thermostability of the thermophilic protein (Delbrück et al., 2001). In our study, Ga16676 CSD protein surface has a considerable distribution of charged residues instead of being clustered. This finding suggested that the distribution of charged amino acids was equally important potentially for balanced stability and flexibility for protein to function in cold. Our analyses suggested that stability is important in cold shock protein structure where the rigidity may be ‘softened’ with flexible loops or linkers and considerable distribution of charged amino acids which allow conformational change for nucleic acid-binding.
AcknowledgmentsWe are grateful to the Ministry of Higher Education Malaysia for funding our project (Grant no. FRG0463-2017). We thank Prof. Clemente Michael Wong Vui Ling from Biotechnology Research Institute who has assisted us with his advice, material support and helpful discussions. We also thank those who involved in the whole genome sequencing and protein structural project ofGlaciozyma antarcticain Malaysia Genome Institute and Universiti Kebangsaan Malaysia for their relentless efforts. We appreciate very much two anonymous reviewers, and guest editor for their helpful and constructive comments on the manuscript of this paper.
Advances in Polar Science2020年2期