Translation initiation is a critical rate-limiting step in protein synthesis. It plays a pivotal role in determining cellular protein concentrations by setting an upper bound on the overall protein synthesis from a specific transcript. Factors such as mRNA structural stability, coding sequence length, and specific motifs influence initiation rates and collectively regulate the in vivo translation of mRNA transcripts and thus the overall protein synthesis.
We developed a machine learning model that predicts the in vivo translation initiation rates in S. cerevisiae transcripts using mRNA sequence features as input. The model was trained using in vivo initiation rates computed from wild-type S. cerevisiae transcripts. It accurately predicts the initiation rates for both endogenous and exogenous transcripts. The model is deployed as a web server – TIR predictor. It requires mRNA sequence and start and stop codon positions in an Excel file. You can also provide these details manually to the TIR predictor web server.
We also developed a tool for gene optimization – TIR optimizer. This allows optimizing the gene sequence (i.e., the 5′ untranslated region and coding sequence) for a target initiation rate. You can access the TIR predictor and optimizer via the following button.
The algorithm optimizes any given sequences for a specified initiation rate, considering nucleotide substitutions in UTRs and integrating codon and UTR substitutions in CDS and UTR regions.
The link for the webserver is freely available in the link below.