ABSTRACT INTRODUCTION Genetic diversity is generated by a combination of different evolutionary processes, including mutation, genetic drift, migration, and natural selection.It is well known that natural selection acts on a specific locus\variant, whereas demographic effects act on all loci in the same way; also the selection is expected to be focused on genomic positions that have a functional role. Importantly, the selected variants targeted by selection may not only have a functional role but can correlate with predisposition or protection to some specific diseases. They can therefore be prioritized in screenings for association with diseases and infections; indeed, genetic variants that are advantageous tend to increase in frequency in the population, while deleterious mutations tend to be eliminated. To identify selection, intra and inter species approaches are usually applied; comparing orthologous genes among different species is a successful approach to detect positive selection acting over long evolutionary timescales; on the other hand, comparing genetic variation within human populations may underline more recent adaptive events. Genes related to immune system are among the most studied genes from an evolutionary point of view: it is now established that infections have been acting as a major selective pressure on humans and, most likely, on all living organisms. Thus, the interactions between hosts and pathogens have shaped the genetic diversity over time on both sides: moreover the continuous arm race between hosts and pathogens creates a competition of co-evolving genes that develop adaptations and counter-adaptations against each other. Therefore, it is important to evaluate the level of genetic variation that determines advantageous phenotypic traits to identify genomic regions/positions underlying diversity and adaptation. My first study was focused on molecules involved in the regulation of T-cell activation. The activation of T lymphocytes is a complex phenomenon that is mediated by the interaction of a number of proteins expressed on the surface of T lymphocytes and antigen presenting cells (APC).Several pathogens have evolved strategies that specifically target these genes to either invade the host or to reduce the response of the immune system. Thus, on one hand,these genes have been engaged in a constant conflict with a large number of pathogens and play a fundamental role during infections; on the other hand, genetic variation at these loci has a potential impact on the development of autoimmune and inflammatory conditions. In the second study I focused on molecules involved in the antigen processing and presentation pathway (APP).Whatever the nature of the presenting molecule, the limited dimension of its cleft makes it impossible for macro molecules to be presented: only fragments (antigens) derived from the lysis of such molecules can be nested in the cleft. This antigenic repertoire is generated by the antigen processing and presentation pathway. Another study focused on the contact system and the molecules involved in this pathway. In particular this pathway represents a link between the coagulation and inflammatory responses, two systems central to host survival in the face of tissue damage and infection. Finally, the last molecules analyzed were the proteins responsible for nucleic acids recognition and the activation of the immune response against virus and bacteria, as the RIG-I like proteins and the AIM-2 like proteins AIM OF THE WORK The aim of all the studies was to investigate the evolutionary history of the genes involved in different pathways, both at the inter- and intra- specific level, and to exploit this information to provide novel insight into the functional role of these molecules in human health and disease. Also i wanted to evaluate the history of autoimmune risk alleles and how they spread in human populations. MATERIALS AND METHODS Mammalian coding sequences were retrieved from the Ensembl website and aligned using the The RevTrans 2.0 utility. For detecting the action of positive selection I used the PAML software with different evolutionary models. Sites under selection were identified using Bayes empirical Bayes and Mixed Effects Model of Evolution analyses.Genotype Data from the Pilot 1 phase of the 1000 Genomes Project were retrieved from the dedicated website for the genes analyzed and for 1,200 randomly selected RefSeq genes (control set) for three populations with different ancestry: European, African and East Asian. These data were used to calculate nucleotide diversity parameters as well as some site frequency spectrum-based statistics. Data from the control gene set were used to calculate empirical distributions of these parameters. FST, a measure of population genetic differentiation, and the DIND test, a test based on haplotype homozygosity, were calculated for all SNPs analyzed. I calculated statistical significance of these tests by obtaining an empirical distribution for variants located within the control genes. RESULTS AND DISCUSSION In my analyses I found that genes involved in the regulation of T cell activation have represented selection targets both along mammalian evolution and during the history of human populations; I also found that variants in these genes related to human diseases to be preferential targets of pathogen-driven selection. These results showed that an allele can spread in a population because it confers higher protection against some infectious agent, indicating adaptation to infection as the underlying explanation for the maintenance of a set of autoimmune risk alleles. These result has a relevance for the hygiene hypothesis, and support the idea that human adaptation to an environment with reduced presence of pathogens has determined the spread of some risk alleles for autoimmune and inflammatory diseases. We then presented a comprehensive analysis of the selective events acting on the antigen processing and presentation pathway across different evolutionary timescales, revealing a high proportion of genes under positive selection in mammalian species. Data also indicate a continuum in selective pressure acting on different timescales for some of these genes analyzed, and we also demonstrated that the selected variants in human populations were always located within regions with regulatory function and can have a role in modulating human phenotypes. The evolutionary analysis we performed about contact system genes indicated that mammalian kininogen has been a target of long-lasting and strong selective pressures. In particular our results reinforced the possibility that kininogen plays a central role in the modulation of immune response and is a target of different pathogen species. Finally our study of two different families of nucleic acid receptors showed that a proportion of these genes have been engaged in host-virus genetic conflict leading to a continuous host–pathogen arms race scenario, and again our results provide functional information about variants that might affect immunologic phenotypes. CONCLUSIONS Results in all these studies showed how natural selection shaped diversity in different pathway involved in the immune response, and selected sites involve positions of fundamental importance to the protein function. These novel data give rise to a number of experimentally testable hypothesis concerning the role of specific sites or regions as modulators of immunological phenotypes; they also suggest caution when extrapolating results from specific experiments in model organisms, as a considerable portion of genetic diversity in these molecules has accumulated not as a result of neutral processes but in response to adaptive events.

HOW NATURAL SELECTION SHAPED DIVERSITY AT IMMUNE RESPONSE GENES AND AUTOIMMUNE RISK ALLELES DURING MAMMALIAN EVOLUTION

FORNI, DIEGO
2015

Abstract

ABSTRACT INTRODUCTION Genetic diversity is generated by a combination of different evolutionary processes, including mutation, genetic drift, migration, and natural selection.It is well known that natural selection acts on a specific locus\variant, whereas demographic effects act on all loci in the same way; also the selection is expected to be focused on genomic positions that have a functional role. Importantly, the selected variants targeted by selection may not only have a functional role but can correlate with predisposition or protection to some specific diseases. They can therefore be prioritized in screenings for association with diseases and infections; indeed, genetic variants that are advantageous tend to increase in frequency in the population, while deleterious mutations tend to be eliminated. To identify selection, intra and inter species approaches are usually applied; comparing orthologous genes among different species is a successful approach to detect positive selection acting over long evolutionary timescales; on the other hand, comparing genetic variation within human populations may underline more recent adaptive events. Genes related to immune system are among the most studied genes from an evolutionary point of view: it is now established that infections have been acting as a major selective pressure on humans and, most likely, on all living organisms. Thus, the interactions between hosts and pathogens have shaped the genetic diversity over time on both sides: moreover the continuous arm race between hosts and pathogens creates a competition of co-evolving genes that develop adaptations and counter-adaptations against each other. Therefore, it is important to evaluate the level of genetic variation that determines advantageous phenotypic traits to identify genomic regions/positions underlying diversity and adaptation. My first study was focused on molecules involved in the regulation of T-cell activation. The activation of T lymphocytes is a complex phenomenon that is mediated by the interaction of a number of proteins expressed on the surface of T lymphocytes and antigen presenting cells (APC).Several pathogens have evolved strategies that specifically target these genes to either invade the host or to reduce the response of the immune system. Thus, on one hand,these genes have been engaged in a constant conflict with a large number of pathogens and play a fundamental role during infections; on the other hand, genetic variation at these loci has a potential impact on the development of autoimmune and inflammatory conditions. In the second study I focused on molecules involved in the antigen processing and presentation pathway (APP).Whatever the nature of the presenting molecule, the limited dimension of its cleft makes it impossible for macro molecules to be presented: only fragments (antigens) derived from the lysis of such molecules can be nested in the cleft. This antigenic repertoire is generated by the antigen processing and presentation pathway. Another study focused on the contact system and the molecules involved in this pathway. In particular this pathway represents a link between the coagulation and inflammatory responses, two systems central to host survival in the face of tissue damage and infection. Finally, the last molecules analyzed were the proteins responsible for nucleic acids recognition and the activation of the immune response against virus and bacteria, as the RIG-I like proteins and the AIM-2 like proteins AIM OF THE WORK The aim of all the studies was to investigate the evolutionary history of the genes involved in different pathways, both at the inter- and intra- specific level, and to exploit this information to provide novel insight into the functional role of these molecules in human health and disease. Also i wanted to evaluate the history of autoimmune risk alleles and how they spread in human populations. MATERIALS AND METHODS Mammalian coding sequences were retrieved from the Ensembl website and aligned using the The RevTrans 2.0 utility. For detecting the action of positive selection I used the PAML software with different evolutionary models. Sites under selection were identified using Bayes empirical Bayes and Mixed Effects Model of Evolution analyses.Genotype Data from the Pilot 1 phase of the 1000 Genomes Project were retrieved from the dedicated website for the genes analyzed and for 1,200 randomly selected RefSeq genes (control set) for three populations with different ancestry: European, African and East Asian. These data were used to calculate nucleotide diversity parameters as well as some site frequency spectrum-based statistics. Data from the control gene set were used to calculate empirical distributions of these parameters. FST, a measure of population genetic differentiation, and the DIND test, a test based on haplotype homozygosity, were calculated for all SNPs analyzed. I calculated statistical significance of these tests by obtaining an empirical distribution for variants located within the control genes. RESULTS AND DISCUSSION In my analyses I found that genes involved in the regulation of T cell activation have represented selection targets both along mammalian evolution and during the history of human populations; I also found that variants in these genes related to human diseases to be preferential targets of pathogen-driven selection. These results showed that an allele can spread in a population because it confers higher protection against some infectious agent, indicating adaptation to infection as the underlying explanation for the maintenance of a set of autoimmune risk alleles. These result has a relevance for the hygiene hypothesis, and support the idea that human adaptation to an environment with reduced presence of pathogens has determined the spread of some risk alleles for autoimmune and inflammatory diseases. We then presented a comprehensive analysis of the selective events acting on the antigen processing and presentation pathway across different evolutionary timescales, revealing a high proportion of genes under positive selection in mammalian species. Data also indicate a continuum in selective pressure acting on different timescales for some of these genes analyzed, and we also demonstrated that the selected variants in human populations were always located within regions with regulatory function and can have a role in modulating human phenotypes. The evolutionary analysis we performed about contact system genes indicated that mammalian kininogen has been a target of long-lasting and strong selective pressures. In particular our results reinforced the possibility that kininogen plays a central role in the modulation of immune response and is a target of different pathogen species. Finally our study of two different families of nucleic acid receptors showed that a proportion of these genes have been engaged in host-virus genetic conflict leading to a continuous host–pathogen arms race scenario, and again our results provide functional information about variants that might affect immunologic phenotypes. CONCLUSIONS Results in all these studies showed how natural selection shaped diversity in different pathway involved in the immune response, and selected sites involve positions of fundamental importance to the protein function. These novel data give rise to a number of experimentally testable hypothesis concerning the role of specific sites or regions as modulators of immunological phenotypes; they also suggest caution when extrapolating results from specific experiments in model organisms, as a considerable portion of genetic diversity in these molecules has accumulated not as a result of neutral processes but in response to adaptive events.
20-gen-2015
Inglese
Natural Selection; positive selection; mammalian evolution; immune response genes; human evolution
COMI, GIACOMO PIETRO
CLERICI, MARIO SALVATORE
Università degli Studi di Milano
File in questo prodotto:
File Dimensione Formato  
phd_unimi_R09723.pdf

accesso aperto

Dimensione 34.8 MB
Formato Adobe PDF
34.8 MB Adobe PDF Visualizza/Apri

I documenti in UNITESI sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14242/85422
Il codice NBN di questa tesi è URN:NBN:IT:UNIMI-85422