I currently work in the Institute of Genomics at Huaqiao University as an assistant professor. I am broadly interested in bioinformatics, genomics, structural biology, evolution and deep learning.
My recent researches focus on RNA structure, translation process and long non-coding RNA.
PhD in Bioinformatics, 2018
Technical University of Munich
MMed in Microbiology and Biochemical Pharmacy, 2012
East China University of Science and Technology
BEng in Bioengineering, 2009
East China University of Science and Technology
The speed of mRNA translation depends in part on the amino acid to be incorporated into the nascent chain. Peptide bond formation is especially slow with proline and two adjacent prolines can even cause ribosome stalling. While previous studies focused on how the amino acid context of a Pro-Pro motif determines the stalling strength, we extend this question to the mRNA level. Bioinformatics analysis of the Escherichia coli genome revealed significantly differing codon usage between single and consecutive prolines. We therefore developed a luminescence reporter to detect ribosome pausing in living cells, enabling us to dissect the roles of codon choice and tRNA selection as well as to explain the genome scale observations. Specifically, we found a strong selective pressure against CCC/U-C, a sequon causing ribosomal frameshifting even under wild-type conditions. On the other hand, translation efficiency as positive evolutionary driving force led to an overrepresentation of CCG. This codon is not only translated the fastest, but the corresponding prolyl-tRNA reaches almost saturating levels. By contrast, CCA, for which the cognate prolyl-tRNA amounts are limiting, is used to regulate pausing strength. Thus, codon selection both in discrete positions but especially in proline codon pairs can tune protein copy numbers.
Here, we developed a genome-wide approach to discovering self-cleaving ribozymes and identified a naturally occurring ribozyme in humans. The secondary structure and biochemical properties of this ribozyme indicate that it belongs to an unidentified class of small, self-cleaving ribozymes. The sequence of the ribozyme exhibits a clear evolutionary path, from its appearance between ~130 and ~65 million years ago (Ma), to acquiring self-cleavage activity very recently, ~13–10 Ma, in the common ancestors of humans, chimpanzees and gorillas.
Translation of consecutive prolines causes ribosome stalling, and thus they are under nagative evolutionary selection which is especially pronounced in proteins with high translational efficiency. However, the time gain caused by ribosome pausing at polyproline motifs might be advantageous in protein regions bracketing domains and transmembrane helices. Polyproline motifs might therefore be crucial for co-translational folding and membrane insertion.
In the coding regions of mRNAs, high Tm regions contain thermo-stable functionally important RNA structures, which impose relaxed evolutionary constraint on sequence as long as the base-pairing patterns remain intact. By contrast, low thermostability regions contain single-stranded functionally important conserved RNA sequence elements accessible for binding by other molecules.