Biodiversity of organisms and their genomic content is a valuable source of enzymes, some of which can be isolated and turned into biocatalysts, useful for more sustainable and efficient industrial processes. Organisms thriving in constantly cold environments produce enzymes that may be more efficient in the cold and more thermolabile than enzymes from other organisms, and that display interesting features for the catalysis of several processes that require or are better at low temperature. In the first part of this thesis, two glycoside hydrolases of family 19 (GH19), named LYS177 and LYS188, were identified in the genome of an Antarctic Pseudomonas strain and characterized. Even though most of the characterized GH19 are chitinases, LYS177 and LYS188 showed no chitinolytic activity, but were active as lysozymes with an optimum temperature of 25-35°C, and retained 40% of their highest activity at 5°C. The temperatures of midpoint unfolding transition were estimated to be 20°C higher than their optimum of activity. Based on these features and sequence analysis, LYS177 and LYS188 can be considered cold-active phage endolysins integrated in prophagic regions of the bacterial host. Moreover, the best performing of the two, LYS177, was active and structurally stable over several days only at 4°C, indicating it as a candidate for potential application on the preservation of food and beverages during cold storage. In protein families, enzymes can rapidly acquire new specializations. Therefore, best practices should be implemented to select optimal candidates with the activity of interest and new, potentially promising, features. Characterized GH19 enzymes showed an enhanced in vivo crop defence against chitin containing pathogens and antimicrobial potentialities. In the second part of this thesis, the sequence space of the GH19 family was explored and a database was created to highlight non-described sequences potentially endowed with interesting variants. Based on global pairwise sequence identity of all proteins available in public databases, GH19s were assigned to two subfamilies, the chitinases and the endolysins. Subfamilies were further split into homologous families, which differ in the n° of characterized enzymes they harbour, in the taxonomical distribution, in the presence of accessory domains and loop insertions. Despite this heterogeneity, a core consisting of 27 amino acids around the active site, including important substrate binding residues, was inferred to be conserved between GH19 subfamilies. Thus, this shared core is suggested to be associated to the GH19 capacity to bind sugars containing N-acetyl-glucosamine. Moreover, specifically conserved positions in each subfamily alignment were identified to be a “signature” useful for predicting the substrate specialization of chitinases and endolysins, and to indicate possible outliers with different features. The GH19 evolution was also investigated through molecular phylogeny to explain the observed sequence and structural plasticity: despite endolysins were divided in an higher number of homologous families, they remained in phages and their bacterial hosts, contrary to chitinases, which spread to both prokaryotic and eukaryotic taxa, and acquired at least four loop insertions; moreover, the GH19 chitinase catalytic domain passed from plants to bacteria by horizontal gene transfer in at least two cases. In conclusion, the second part of this thesis shows how bioinformatic tools can be used to analyse the sequence space of a glycoside hydrolase family and extract information to help both experts and non-experts to optimize the discovery of new biocatalysts potentially applied in the field of human health and nutrition.
Biodiversity of organisms and their genomic content is a valuable source of enzymes, some of which can be isolated and turned into biocatalysts, useful for more sustainable and efficient industrial processes. Organisms thriving in constantly cold environments produce enzymes that may be more efficient in the cold and more thermolabile than enzymes from other organisms, and that display interesting features for the catalysis of several processes that require or are better at low temperature. In the first part of this thesis, two glycoside hydrolases of family 19 (GH19), named LYS177 and LYS188, were identified in the genome of an Antarctic Pseudomonas strain and characterized. Even though most of the characterized GH19 are chitinases, LYS177 and LYS188 showed no chitinolytic activity, but were active as lysozymes with an optimum temperature of 25-35°C, and retained 40% of their highest activity at 5°C. The temperatures of midpoint unfolding transition were estimated to be 20°C higher than their optimum of activity. Based on these features and sequence analysis, LYS177 and LYS188 can be considered cold-active phage endolysins integrated in prophagic regions of the bacterial host. Moreover, the best performing of the two, LYS177, was active and structurally stable over several days only at 4°C, indicating it as a candidate for potential application on the preservation of food and beverages during cold storage. In protein families, enzymes can rapidly acquire new specializations. Therefore, best practices should be implemented to select optimal candidates with the activity of interest and new, potentially promising, features. Characterized GH19 enzymes showed an enhanced in vivo crop defence against chitin containing pathogens and antimicrobial potentialities. In the second part of this thesis, the sequence space of the GH19 family was explored and a database was created to highlight non-described sequences potentially endowed with interesting variants. Based on global pairwise sequence identity of all proteins available in public databases, GH19s were assigned to two subfamilies, the chitinases and the endolysins. Subfamilies were further split into homologous families, which differ in the n° of characterized enzymes they harbour, in the taxonomical distribution, in the presence of accessory domains and loop insertions. Despite this heterogeneity, a core consisting of 27 amino acids around the active site, including important substrate binding residues, was inferred to be conserved between GH19 subfamilies. Thus, this shared core is suggested to be associated to the GH19 capacity to bind sugars containing N-acetyl-glucosamine. Moreover, specifically conserved positions in each subfamily alignment were identified to be a “signature” useful for predicting the substrate specialization of chitinases and endolysins, and to indicate possible outliers with different features. The GH19 evolution was also investigated through molecular phylogeny to explain the observed sequence and structural plasticity: despite endolysins were divided in an higher number of homologous families, they remained in phages and their bacterial hosts, contrary to chitinases, which spread to both prokaryotic and eukaryotic taxa, and acquired at least four loop insertions; moreover, the GH19 chitinase catalytic domain passed from plants to bacteria by horizontal gene transfer in at least two cases. In conclusion, the second part of this thesis shows how bioinformatic tools can be used to analyse the sequence space of a glycoside hydrolase family and extract information to help both experts and non-experts to optimize the discovery of new biocatalysts potentially applied in the field of human health and nutrition.
Biochemical and biophysical analysis of two Antarctic lysozyme endolysins and in silico exploration of glycoside hydrolase 19 sequence space
ORLANDO, MARCO
2020
Abstract
Biodiversity of organisms and their genomic content is a valuable source of enzymes, some of which can be isolated and turned into biocatalysts, useful for more sustainable and efficient industrial processes. Organisms thriving in constantly cold environments produce enzymes that may be more efficient in the cold and more thermolabile than enzymes from other organisms, and that display interesting features for the catalysis of several processes that require or are better at low temperature. In the first part of this thesis, two glycoside hydrolases of family 19 (GH19), named LYS177 and LYS188, were identified in the genome of an Antarctic Pseudomonas strain and characterized. Even though most of the characterized GH19 are chitinases, LYS177 and LYS188 showed no chitinolytic activity, but were active as lysozymes with an optimum temperature of 25-35°C, and retained 40% of their highest activity at 5°C. The temperatures of midpoint unfolding transition were estimated to be 20°C higher than their optimum of activity. Based on these features and sequence analysis, LYS177 and LYS188 can be considered cold-active phage endolysins integrated in prophagic regions of the bacterial host. Moreover, the best performing of the two, LYS177, was active and structurally stable over several days only at 4°C, indicating it as a candidate for potential application on the preservation of food and beverages during cold storage. In protein families, enzymes can rapidly acquire new specializations. Therefore, best practices should be implemented to select optimal candidates with the activity of interest and new, potentially promising, features. Characterized GH19 enzymes showed an enhanced in vivo crop defence against chitin containing pathogens and antimicrobial potentialities. In the second part of this thesis, the sequence space of the GH19 family was explored and a database was created to highlight non-described sequences potentially endowed with interesting variants. Based on global pairwise sequence identity of all proteins available in public databases, GH19s were assigned to two subfamilies, the chitinases and the endolysins. Subfamilies were further split into homologous families, which differ in the n° of characterized enzymes they harbour, in the taxonomical distribution, in the presence of accessory domains and loop insertions. Despite this heterogeneity, a core consisting of 27 amino acids around the active site, including important substrate binding residues, was inferred to be conserved between GH19 subfamilies. Thus, this shared core is suggested to be associated to the GH19 capacity to bind sugars containing N-acetyl-glucosamine. Moreover, specifically conserved positions in each subfamily alignment were identified to be a “signature” useful for predicting the substrate specialization of chitinases and endolysins, and to indicate possible outliers with different features. The GH19 evolution was also investigated through molecular phylogeny to explain the observed sequence and structural plasticity: despite endolysins were divided in an higher number of homologous families, they remained in phages and their bacterial hosts, contrary to chitinases, which spread to both prokaryotic and eukaryotic taxa, and acquired at least four loop insertions; moreover, the GH19 chitinase catalytic domain passed from plants to bacteria by horizontal gene transfer in at least two cases. In conclusion, the second part of this thesis shows how bioinformatic tools can be used to analyse the sequence space of a glycoside hydrolase family and extract information to help both experts and non-experts to optimize the discovery of new biocatalysts potentially applied in the field of human health and nutrition.File | Dimensione | Formato | |
---|---|---|---|
phd_unimib_823497.pdf
Open Access dal 01/05/2020
Dimensione
19.76 MB
Formato
Adobe PDF
|
19.76 MB | Adobe PDF | Visualizza/Apri |
I documenti in UNITESI sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.
https://hdl.handle.net/20.500.14242/170601
URN:NBN:IT:UNIMIB-170601