The semi-automatic building of QM/MM models of rhodopsins has been recently proposed, by means of a new technology called Automatic Rhodopsin Model protocol. In its original version, here called original ARM protocol, published in 2016, the constructed QM/MM models were found to be useful for biophysical studies and for protein engineering, but had the disadvantage of being time-consuming to prepare, error-prone, and, also, difficult to replicate when the same model was independently constructed by different investigators. These issues were a consequence of the <> (i.e., not fully automated) nature of the protocol, since (i) the generation of its input was achieved through <> of the template structure and (ii) the code (i.e., the computer program) for the construction of the QM/MM model was written as a non homogeneous collection of bash scripts, not driven by a parent program. Such methodological and computational pitfalls impaired the possibility of comparatively studying hundreds of rhodopsins (i.e., light-sensitive proteins belonging to the same superfamily), as well as hoping that, in the future, a similar protocol could be generalized to other families of light-responsive proteins (e.g., Xanthorhodopsin, phytochromes or synthetic rhodopsin mimics) of interest for biological or biotechnological applications. In order to overcome the above drawbacks, this doctoral Thesis is devoted to the design of a substantially improved ARM methodological framework, characterized by a fully automated, rather than manual, construction of QM/MM models. Accordingly, I introduce the blueprinting of four different ARM-based fully automatic protocols for the QM/MM modeling of rhodopsin electronically excited states. Furthermore, I present their implementation into a new, user-friendly, Python-based software package, called ARM software package, conceived for allowing the use of each protocol via a ``one-click'' command given either at the command-line or, in certain cases, Web-interface levels. Finally, I report on the performance of the four ARM-based protocols, highlighting both their methodological and scientific capabilities as well as their current limitations. To do so, I have constructed and employed a benchmark set of about 150 wild-type and mutant rhodopsins, as well as carried out selected applications, directed to the prediction of trends in light-induced properties, including absorption and emission spectra, as well as excited-state molecular dynamics. Such trends unveil different mechanistic aspects of color tuning and fluorescence emission, as well as, more in perspective, the systematic prediction of photoisomerization quantum yields. In conclusion, the research carried out during my doctoral Thesis has generated and explored novel, automated, ARM-based research tools and, most importantly, a programming framework called the ARM package. I believe that these tools and package have the potential to be generalized, thanks to their characteristics that I will thoroughly describe. In other words, my hope is that the research line started with my thesis will not only be useful for achieving better performing QM/MM models of rhodopsins but be expanded to deal with other sets of light-responsive proteins useful, for instance, in optogenetic studies.
Blueprinting, implementation, and application of fully automatic protocols for the QM/MM modeling of photo-excited states of rhodopsin variants
PEDRAZA GONZÁLEZ, LAURA MILENA
2021
Abstract
The semi-automatic building of QM/MM models of rhodopsins has been recently proposed, by means of a new technology called Automatic Rhodopsin Model protocol. In its original version, here called original ARM protocol, published in 2016, the constructed QM/MM models were found to be useful for biophysical studies and for protein engineering, but had the disadvantage of being time-consuming to prepare, error-prone, and, also, difficult to replicate when the same model was independently constructed by different investigators. These issues were a consequence of the <> (i.e., not fully automated) nature of the protocol, since (i) the generation of its input was achieved through <> of the template structure and (ii) the code (i.e., the computer program) for the construction of the QM/MM model was written as a non homogeneous collection of bash scripts, not driven by a parent program. Such methodological and computational pitfalls impaired the possibility of comparatively studying hundreds of rhodopsins (i.e., light-sensitive proteins belonging to the same superfamily), as well as hoping that, in the future, a similar protocol could be generalized to other families of light-responsive proteins (e.g., Xanthorhodopsin, phytochromes or synthetic rhodopsin mimics) of interest for biological or biotechnological applications. In order to overcome the above drawbacks, this doctoral Thesis is devoted to the design of a substantially improved ARM methodological framework, characterized by a fully automated, rather than manual, construction of QM/MM models. Accordingly, I introduce the blueprinting of four different ARM-based fully automatic protocols for the QM/MM modeling of rhodopsin electronically excited states. Furthermore, I present their implementation into a new, user-friendly, Python-based software package, called ARM software package, conceived for allowing the use of each protocol via a ``one-click'' command given either at the command-line or, in certain cases, Web-interface levels. Finally, I report on the performance of the four ARM-based protocols, highlighting both their methodological and scientific capabilities as well as their current limitations. To do so, I have constructed and employed a benchmark set of about 150 wild-type and mutant rhodopsins, as well as carried out selected applications, directed to the prediction of trends in light-induced properties, including absorption and emission spectra, as well as excited-state molecular dynamics. Such trends unveil different mechanistic aspects of color tuning and fluorescence emission, as well as, more in perspective, the systematic prediction of photoisomerization quantum yields. In conclusion, the research carried out during my doctoral Thesis has generated and explored novel, automated, ARM-based research tools and, most importantly, a programming framework called the ARM package. I believe that these tools and package have the potential to be generalized, thanks to their characteristics that I will thoroughly describe. In other words, my hope is that the research line started with my thesis will not only be useful for achieving better performing QM/MM models of rhodopsins but be expanded to deal with other sets of light-responsive proteins useful, for instance, in optogenetic studies.File | Dimensione | Formato | |
---|---|---|---|
phd_unisi_076319_1.pdf
accesso aperto
Dimensione
5.49 MB
Formato
Adobe PDF
|
5.49 MB | Adobe PDF | Visualizza/Apri |
phd_unisi_076319_2.pdf
accesso aperto
Dimensione
6.96 MB
Formato
Adobe PDF
|
6.96 MB | Adobe PDF | Visualizza/Apri |
phd_unisi_076319_3.pdf
accesso aperto
Dimensione
5.58 MB
Formato
Adobe PDF
|
5.58 MB | Adobe PDF | Visualizza/Apri |
phd_unisi_076319_4.pdf
accesso aperto
Dimensione
3.78 MB
Formato
Adobe PDF
|
3.78 MB | Adobe PDF | Visualizza/Apri |
phd_unisi_076319_5.pdf
accesso aperto
Dimensione
9.53 MB
Formato
Adobe PDF
|
9.53 MB | Adobe PDF | Visualizza/Apri |
phd_unisi_076319_6.pdf
accesso aperto
Dimensione
9.19 MB
Formato
Adobe PDF
|
9.19 MB | Adobe PDF | Visualizza/Apri |
phd_unisi_076319_7.pdf
accesso aperto
Dimensione
10.16 MB
Formato
Adobe PDF
|
10.16 MB | Adobe PDF | Visualizza/Apri |
phd_unisi_076319_8.pdf
accesso aperto
Dimensione
5.95 MB
Formato
Adobe PDF
|
5.95 MB | Adobe PDF | Visualizza/Apri |
I documenti in UNITESI sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.
https://hdl.handle.net/20.500.14242/175394
URN:NBN:IT:UNISI-175394