TY - JOUR
T1 - Investigation of de novo totally random biosequences. Part II. On the folding frequency in a totally random library of de novo proteins obtained by phage display
AU - Chiarabelli, Cristiano
AU - Vrijbloed, Jan W.
AU - De Lucrezia, Davide
AU - Thomas, Richard M.
AU - Stano, Pasquale
AU - Polticelli, Fabio
AU - Ottone, Tiziana
AU - Papa, Ester
AU - Luisi, Pier Luigi
PY - 2006
Y1 - 2006
N2 - We present an investigation on theoretically possible protein structures which have not been selected by evolution and are, therefore, not present on our Earth ('Never Born Proteins' (NBP)). In particular, we attempt to assess whether and to what extent such polypeptides might be folded, thus acquiring a globular protein status. A library (ca. 109 clones) of totally random polypeptides, with a length of 50 amino acids, has been produced by phage display. The only structural bias in these sequences is a tripeptide substrate for thrombin: PRG, chosen according to the criteria described in the preceding Part I of this series. The presence of this substrate in an otherwise totally random sequence forms the basis for a qualitative experimental criterion which distinguishes unfolded from folded proteins, as folded proteins are more protected from protease digestion than unfolded ones. The investigation of 79 sequences, randomly selected from the initially large library, shows that over 20% of this population is thrombin-resistant, likely due to folding. Analysis of the amino acid sequences of these clones shows no significant homology to extant proteins, which indicates that they are indeed totally de novo. A few of these sequences have been expressed, and here we describe the structural properties of two thrombin-resistant randomly selected ones. These two de novo proteins have been characterized by spectroscopic methods and, in particular, by circular dichroism. The data show a stable three-dimensional folding, which is temperature-resistant and can be reversibly denatured by urea. The consequences of this finding within a library of 'Never Born Proteins' are discussed in terms of molecular evolution.
AB - We present an investigation on theoretically possible protein structures which have not been selected by evolution and are, therefore, not present on our Earth ('Never Born Proteins' (NBP)). In particular, we attempt to assess whether and to what extent such polypeptides might be folded, thus acquiring a globular protein status. A library (ca. 109 clones) of totally random polypeptides, with a length of 50 amino acids, has been produced by phage display. The only structural bias in these sequences is a tripeptide substrate for thrombin: PRG, chosen according to the criteria described in the preceding Part I of this series. The presence of this substrate in an otherwise totally random sequence forms the basis for a qualitative experimental criterion which distinguishes unfolded from folded proteins, as folded proteins are more protected from protease digestion than unfolded ones. The investigation of 79 sequences, randomly selected from the initially large library, shows that over 20% of this population is thrombin-resistant, likely due to folding. Analysis of the amino acid sequences of these clones shows no significant homology to extant proteins, which indicates that they are indeed totally de novo. A few of these sequences have been expressed, and here we describe the structural properties of two thrombin-resistant randomly selected ones. These two de novo proteins have been characterized by spectroscopic methods and, in particular, by circular dichroism. The data show a stable three-dimensional folding, which is temperature-resistant and can be reversibly denatured by urea. The consequences of this finding within a library of 'Never Born Proteins' are discussed in terms of molecular evolution.
UR - http://www.scopus.com/inward/record.url?scp=33748541979&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=33748541979&partnerID=8YFLogxK
U2 - 10.1002/cbdv.200690088
DO - 10.1002/cbdv.200690088
M3 - Article
C2 - 17193317
AN - SCOPUS:33748541979
VL - 3
SP - 840
EP - 859
JO - Chemistry and Biodiversity
JF - Chemistry and Biodiversity
SN - 1612-1872
IS - 8
ER -