mRNAutilus: Multi-Objective-Guided Discrete Generation of mRNA with Optimized Therapeutic Properties

01 — The Challenge

Why mRNA Design
Needs a New Approach

Messenger RNA therapeutics have transformed medicine[1]. The COVID-19 pandemic proved that mRNA vaccines can be developed, manufactured, and distributed in mere months[2]. But beneath this success lies a persistent challenge: mRNAs are inherently unstable, and designing one that is simultaneously stable, efficiently translated, and highly expressed remains a multi-objective optimization problem that current methods struggle to address systematically.

An mRNA molecule is more than its coding sequence. The 5' untranslated region (UTR) initiates translation, the coding sequence (CDS) determines the protein product through codon choice, and the 3'UTR governs stability and localization. Critically, these regions interact: inter-region base-pairing, secondary structures at the 5'UTR-CDS junction, and 3'UTR structural elements all influence the overall fitness of an mRNA. Yet most existing design methods optimize each component in isolation.

Traditional codon optimization considers codons independently, substituting them with synonymous alternatives based on species-specific usage tables. More recent approaches use separate language models for each mRNA component — one for the CDS, one for the 5'UTR, one for the 3'UTR — and combine them post hoc. This component-by-component strategy requires lab-in-the-loop screening and misses the cross-region dependencies that determine overall function. Additionally, mRNA expression is highly dependent on the gene in consideration and differ markedly between tissues due to variations in intracellular machinery, metabolism, and innate immune activity.

The Central Question

Can we generate a complete, functional mRNA transcript — UTRs and optimized coding sequence together — in a single generative process, guided simultaneously toward multiple therapeutic properties?

mRNAutilus is the first framework to do exactly this, generating full-length mRNA transcripts with Pareto-optimal therapeutic properties through simultaneous codon optimization and de novo UTR design.

02 — The Model

mRNAutilus: Masked Diffusion
Meets Tree Search

mRNAutilus — mRNA generation via Unrolled Trajectories and Informed Latent UpdateS — is a 150M-parameter masked discrete diffusion model[3,4] trained on 14.2 million full-length mRNA sequences from 342 vertebrate species[5]. It uses a hybrid tokenization scheme: codons in the CDS are tokenized as 3-mers, while UTR nucleotides are encoded individually, preserving the natural granularity of each region within a unified model.

The model starts from a fully masked sequence and progressively unmasks tokens, capturing bidirectional dependencies across the entire transcript. During conditional generation, a template protein coding sequence can be partially masked, allowing simultaneous codon optimization and de novo UTR generation. Importantly, biological constraints are enforced at every step: only synonymous codons (encoding the same amino acid) are permitted during CDS optimization, guaranteeing the designed mRNA produces the intended amino acid sequence.

Core Architecture

Figure 1: Overview of mRNAutilus — **Figure 1.** Overview of mRNAutilus. **(A)** Masked diffusion model pretraining and *de novo* mRNA sequence generation. **(B)** mRNA function and property prediction using XGBoost regression analysis from model embeddings. **(C)** Multi-objective codon optimization and UTR generation using Monte Carlo Tree Guidance, which leverages tree search to optimize a set of Pareto-optimal sequences. **(D)** *In vitro* experiments demonstrate that zero-shot mRNAs generated with mRNAutilus achieve heightened expression and half-life over those produced by other machine learning-driven methods and commercial mRNAs.

Masked Discrete Diffusion

Bidirectional generation captures long-range dependencies

Unlike autoregressive models that generate left-to-right, the masked diffusion framework unmasks tokens in any order, capturing interactions between distant UTR and CDS regions. Trained with a continuous-time NELBO objective on the Ensembl vertebrate mRNA corpus.

Hybrid Tokenization

Codons and nucleotides in one vocabulary

The CDS is tokenized by codon (61 coding + start/stop), while UTRs use single-nucleotide resolution with IUPAC codes. This 86-token vocabulary enables the model to jointly reason about codon identity and UTR sequence in a single forward pass.

Monte Carlo Tree Guidance

Multi-objective Pareto optimization during sampling

At inference time, Monte Carlo Tree Guidance (MCTG) explores the denoising trajectory as a search tree, expanding promising branches via Pareto dominance over multiple property regressors. The result is a set of non-dominated mRNA sequences optimizing half-life, translation efficiency, and protein abundance simultaneously.

Property Prediction from Embeddings

mRNAutilus embeddings serve two roles: they drive generation and enable lightweight property prediction. We train XGBoost regressors on mRNAutilus embeddings to predict three key mRNA properties from sequence alone, including half-life, translation efficiency, and protein abundance datasets used for guidance[9,10,11].

Half-life — 13,000 transcripts across 39 human samples Translation efficiency — ribosome profiling data from 1,282 human and 995 mouse datasets Protein abundance — mRNA-seq and ribosomal profiling in human B2 and U2OS cells

Despite their simplicity, these regressors are competitive with models up to 45× larger. On half-life and protein abundance, mRNAutilus (150M parameters) outperforms Helix-mRNA (6M), HyenaDNA (7M), and RiNALMo (150M), and matches the 7B-parameter Evo-2 model. These lightweight predictors serve as the reward functions that guide MCTG during generation — no gradient computation required.

Figure 2: Property Regressor Correlations — **Figure 2.** Correlation plots for property regressors. **(A)** Half-life, **(B)** ribosome profiling, and **(C)** protein abundance correlation for validation (left) and training (right) data. Optimization curves for **(D)** half-life, **(E)** translation rate, and **(F)** translation efficiency for conditional generation of *P. pyralis* luciferase mRNA via Monte Carlo Tree Guidance (MCTG). Shown are Pareto front scores with standard error (purple), WT *P. pyralis* luciferase score (navy), and median classifier property score from the entire dataset (teal). **(G)** Predicted structure for a single generated *P. pyralis* luciferase mRNA colored by sequence component (5'UTR: teal, CDS: purple, 3'UTR: navy).

03 — Monte Carlo Tree Guidance

Navigating the Pareto Front
Through Tree Search

Designing a therapeutic mRNA requires optimizing multiple competing properties simultaneously — half-life, translation efficiency, and protein abundance each pull the sequence in different directions. Rather than optimizing a single scalar objective, mRNAutilus frames mRNA design as a multi-objective optimization problem and searches for the set of Pareto-optimal solutions: sequences where no property can be improved without degrading another.

To navigate this trade-off landscape, we leverage Monte Carlo Tree Guidance (MCTG)[6], a gradient-free algorithm that steers the masked diffusion model's denoising process toward Pareto-optimal mRNA sequences. MCTG treats each step of the unmasking trajectory as a node in a search tree and uses Monte Carlo rollouts to evaluate which unmasking decisions lead to the best multi-property outcomes.

How MCTG Works

The MCTG algorithm operates in four steps, which iteratively optimize multi-objective rewards:

1 — Selection

Choose the most promising branch

From expanded nodes, compute a selection score balancing cumulative reward and exploration. Select from the Pareto-optimal set of children based on normalized scores, favoring both high-reward and under-explored branches.

2 — Expansion

Sample diverse unmasking candidates

At a leaf node, sample M distinct child sequences by applying Gumbel noise to the denoising distribution. Randomly unmask k positions per child, enforcing synonymous-codon constraints to preserve the intended protein.

3 — Rollout

Evaluate with property regressors

Complete each child via ancestral sampling to obtain a fully unmasked sequence. Score it across all property regressors. Compute a reward vector measuring Pareto dominance over the current optimal set.

4 — Backpropagation

Update the search tree

Propagate reward vectors from children back to the root, accumulating rewards and visit counts. Future selections are biased toward unmasking paths that consistently lead to Pareto-dominant sequences.

Biological Constraints During Search

At every expansion step, MCTG enforces hard biological constraints to guarantee valid mRNA output. Codon tokens cannot appear in UTR positions, and for any masked codon, only synonymous codons encoding the same amino acid as the template are permitted. This ensures the designed mRNA always encodes the intended amino acid sequence.

The result is a set of non-dominated mRNA sequences spanning the Pareto front across half-life, translation efficiency, and protein abundance. Researchers can then select from this front based on their application-specific priorities — for instance, favoring durability for a vaccine or peak expression for a reporter assay.

Why Tree Search?

Unlike gradient-based guidance that requires differentiable reward functions, MCTG works with any black-box predictor — including the lightweight XGBoost regressors trained on mRNAutilus embeddings. This makes multi-objective guidance practical even in low-data settings where training a differentiable surrogate would be unreliable.

04 — Unconditional Generation

Generated mRNAs Match
Natural Distributions

Before applying property guidance, we verified that mRNAutilus generates biologically plausible mRNA sequences. Libraries of unconditionally generated mRNAs were compared against vertebrate mRNAs from the Ensembl training corpus across multiple biophysical metrics.

Generated sequences exhibit higher GC content (a hallmark of efficient translation), lower minimum free energy (indicating greater thermodynamic stability), and natural Kozak consensus frequencies. Critically, the generated libraries maintain sequence diversity comparable to natural mRNAs, with nearly identical Shannon entropy distributions — ruling out degenerate repetitive sampling. The model learns to produce diverse, stable, translation-ready sequences without any explicit optimization.

Figure 3: In Silico mRNA Generation Benchmarking — **Figure 3.** Metric-driven evaluation of *in silico* full-sequence mRNA generation methods. A library of 250 luciferase mRNAs was produced using mRNAutilus, randomness, and GEMORNA. All three methods were prompted to design a luciferase mRNA encoding the GenScript F-Luc protein. **(A)** Coding sequence comparison using all three methods. Considered metrics are codon adaptation index (CAI), length-normalized minimum-free energy (MFE), optimal codon frequency, GC content, and uridine percentage. Metric distributions are also shown for all generated **(B)** 5′UTRs and **(C)** 3′UTRs.

We compared mRNAutilus-designed full-length luciferase mRNAs against sequences produced by GEMORNA[7] (an autoregressive component-by-component approach) and random sampling. Across codon adaptation index, minimum free energy, optimal codon frequency, and uridine percentage, mRNAutilus-designed CDS sequences achieved consistently improved metrics. Generated UTRs showed greater thermodynamic stability and diversity than both alternatives.

05 — Experimental Validation

Zero-Shot mRNAs That
Outperform the State of the Art

The true test of any generative model is the wet lab. mRNAutilus-designed mRNAs were synthesized and tested in vitro across three targets with distinct biological and therapeutic relevance — P. pyralis luciferase, SARS-CoV-2 spike glycoprotein, and PEMax (a prime-editor payload). All designs were generated zero-shot — without any iterative laboratory optimization — drawn from libraries jointly optimized for half-life and translation efficiency, and benchmarked against human alpha-globin (HAB) UTR composition as an in silico baseline. For each target, the top five designs by predicted fold-change over the HAB baseline were synthesized by in vitro transcription and evaluated in cell-based expression assays.

Target 01

P. pyralis Luciferase (Fluc)

Standard reporter protein for mRNA expression. Zero-shot designs vastly exceed wild-type across multiple human cell lines.

Zero-shot mRNAutilus designs produced luminescence signals comparable to or significantly greater than wild-type control in HEK293T cells, with designed sequences exhibiting approximately 400-fold higher expression than wild-type P. pyralis luciferase at 48h post-transfection. Across three additional cell lines (Jurkat, A549, HepG2), the same mRNAutilus designs consistently outperformed both zero-shot GEMORNA-designed luciferase mRNA (GMR-FL) and the composition combining GenScript F-Luc CDS with human alpha-globin UTRs.

Figure 4: In vitro mRNA evaluation — **Figure 4.** mRNAs designed by mRNAutilus achieve superior zero-shot performance over competing methods *in vitro*. **(A)** Schematic of the experimental validation workflow for mRNAutilus-designed sequences. **(B)** Absolute expression levels for *P. pyralis* luciferase mRNAs. **(C)** Normalized expression levels over time in HEK293T cells (n=3 biological replicates). Normalized expressions compared to a zero-shot GEMORNA-designed luciferase mRNA and a GenScript F-Luc mRNA with human alpha-globin UTRs in **(D)** Jurkat, **(E)** A549, and **(F)** HepG2 cells. **(G)** Zero-shot mRNAs generated for *SARS-CoV-2* Spike glycoprotein compared alongside several benchmarks. Displayed are normalized expressions in A549 cells.(H) PEMax mRNAs were designed using mRNAutillus and show more durable expression over a zero-shot GEMORNA PEMax mRNA and the commercially-available GenScript PEMax mRNA. **(I)** In particular, ABW-PEMax-2 achieves improved editing efficiency over the GenScript PEMax mRNA, with other designed sequences showing comparable efficacy.

~400×

vs. wild-type at 48h

3 cell lines

tested (Jurkat, A549, HepG2)

Outperformed

GEMORNA & commercial

Target 02

SARS-CoV-2 Spike Glycoprotein

Primary antigen in COVID-19 mRNA vaccines. Template: GenScript Spike mRNA v2 ORF. Exceeds clinically used and commercial benchmarks.

3 of 4

designs exceed BNT162b2

Match/Exceed

lab-in-the-loop GEMORNA

Improved

intracellular stability

Three zero-shot mRNAutilus designs (ABW-Spike-1, ABW-Spike-2, ABW-Spike-3) expressed above both BNT162b2 (Pfizer/BioNTech) and the commercial GenScript Spike v2 mRNA in A549 cells. Across all measured time points, these designs matched the expression profile of the lab-in-the-loop multi-shot GEMORNA sequence (GMR-CV-F2), with ABW-Spike-1 additionally exhibiting improved intracellular stability.

Target 03

PEMax Prime Editor

Programmable gene-editing payload. Zero-shot designs match commercial expression and exceed lab-in-the-loop baselines on editing efficiency.

Comparable

expression vs. GenScript PEMax

Higher

T→A editing (ABW-PEMax-2)

~10×

vs. GEMORNA zero-shot PEMax

mRNAutilus-designed constructs for prime editing (PEMax) similarly outperformed baselines, highlighted by more durable expression over commercial mRNAs and improved gene editing efficiency. Beyond reporters, antigens, and enzymes, we further demonstrated generality to functionally demanding intracellular payloads using ubiquibodies (uAbs) — synthetic E3 ligases where the substrate-binding domain of CHIP is replaced with peptide binders[12], enabling programmable degradation of disease-relevant targets. mRNAutilus-designed uAb mRNAs targeting β-catenin showed enhanced expression and functional efficacy, establishing the framework as broadly applicable across diverse therapeutic modalities.

Target 04

Ubiquibody (uAb) Degraders

Synthetic E3 ligases where the substrate-binding domain of CHIP is replaced with peptide binders[12], enabling programmable degradation of disease-relevant targets via the ubiquitin-proteasome system.

mRNAutilus-designed uAb mRNAs targeting β-catenin showed markedly higher transcript abundance than the human α-globin-based positive control at both 9 h and 24 h post-transfection in DLD-1 cells. Optimized constructs fully retained downstream degrader function in the TOPFlash Wnt/β-catenin reporter assay, and immunoblot analysis confirmed significant proteasome-dependent reductions in endogenous β-catenin for uAb1, uAb2, and uAb4 (reversed by MG132 co-treatment) — establishing mRNAutilus as broadly applicable to programmable intracellular therapeutic modalities.

Figure 5: Therapeutic uAb mRNA design and in vitro evaluation — **Figure 5.** Design of therapeutic uAb mRNAs with mRNAutilus and *in vitro* evaluation. **(A)** Architecture and mechanism of the uAb degradation system with sequences optimized by mRNAutilus. CHIP∆TPR fused to the C-terminus of β-catenin-specific peptides targets endogenous proteins for ubiquitin-mediated degradation via the proteasome following mRNA transfection. **(B)** Quantitative PCR analysis of uAb mRNA expression levels at 9 h and 24 h post-transfection, normalized to the untreated control. **(C)** TOPFlash luciferase reporter assay of Wnt/β-catenin transcriptional activity; FOPFlash served as negative control. **(D)** Degradation of endogenous β-catenin in cytosolic fractions of DLD-1 cells by immunoblotting (left) and densitometric analysis (right, n=3). *p≤0.05; ns, not significant.

06 — Embedding Benchmarks

Competitive with Models
45× Its Size

We benchmarked mRNAutilus embeddings for property prediction against leading nucleic acid language models, training identical XGBoost regressors on each model's embeddings. The key result from the manuscript is not a single translation-efficiency score, but the full tradeoff: mRNAutilus delivers the strongest half-life prediction and is effectively tied for best protein abundance while using far fewer parameters than Evo-2.

Validation R² by Task

Half-Life

mRNAutilus150M parameters

0.551

Evo-27B parameters

0.538

Helix-mRNA6M parameters

0.481

Translation Efficiency

Evo-27B parameters

0.794 (TE)

Helix-mRNA6M parameters

0.767 (TE)

mRNAutilus150M parameters

0.716 (TE)

Protein Abundance

Evo-27B parameters

0.566

mRNAutilus150M parameters

0.565

Helix-mRNA6M parameters

0.544

mRNAutilus is the top model on half-life prediction (R² = 0.551), exceeding Evo-2 and Helix-mRNA despite being far smaller than Evo-2. On protein abundance, it is essentially tied for best overall, trailing Evo-2 by just 0.001 (0.565 vs. 0.566). Evo-2 and Helix-mRNA embeddings are demonstrably better for predicting translation efficiency, yet mRNAutilus remains competitive while offering the strongest overall balance across all three tasks.

07 — Looking Forward

Programmable Methods for
Genetic Medicines

mRNAutilus establishes a new paradigm for mRNA design: a single generative process that jointly optimizes codon usage, UTR sequences, and their cross-region interactions toward multiple therapeutic properties. Unlike prior methods that design components in isolation or require iterative laboratory screening, mRNAutilus generates complete, ready-to-synthesize transcripts zero-shot.

Key Contributions

First multi-objective generative model for full-length mRNA: Simultaneous codon optimization and de novo UTR design in a single framework.
Masked diffusion + MCTG: A gradient-free approach that explores Pareto-optimal solutions across half-life, translation efficiency, and protein abundance.
14.2M mRNA pretraining corpus: Trained on the largest curated dataset of vertebrate mRNA from 342 species via Ensembl.
Competitive embeddings at 150M parameters: Property prediction matching or exceeding models up to 45× larger.
400× wild-type expression: Zero-shot Fluc mRNAs outperform commercial and ML-designed baselines across multiple cell lines.
Clinically relevant designs: Zero-shot spike mRNAs exceed BNT162b2 and commercial constructs, with improved durability.
Broad therapeutic applicability: Demonstrated on prime editing and ubiquibodies for proteome modulation.

Future directions include incorporating additional property regressors for immunogenicity and tissue-specific expression, integrating manufacturing constraints directly into the guidance procedure, exploring non-uniform reward weighting, and coupling with high-throughput screening for design-build-test-learn cycles. The framework is gradient-free and practical even in low-data settings, opening the door to rapid mRNA therapeutic design across diverse biological applications.

Key Insight

mRNAutilus is a significant step toward a unified framework for controllable, multi-objective-guided, full-length mRNA therapeutic design.

References

Pardi, N., Hogan, M. J., Porter, F. W. & Weissman, D. mRNA vaccines — a new era in vaccinology. Nat. Rev. Drug Discov. 17(4), 261–279 (2018). doi:10.1038/nrd.2017.243
Park, J. W., Lagniton, P. N. P., Liu, Y. & Xu, R.-H. mRNA vaccines for COVID-19: what, why and how. Int. J. Biol. Sci. 17(6), 1446–1460 (2021). doi:10.7150/ijbs.59233
Austin, J., Johnson, D. D., Ho, J., Tarlow, D. & van den Berg, R. Structured denoising diffusion models in discrete state-spaces. NeurIPS (2021). arXiv:2107.03006
Sahoo, S. S., Arriola, M., Schiff, Y., Gokaslan, A., Marroquin, E., Chiu, J. T., Rush, A. & Kuleshov, V. Simple and effective masked diffusion language models. NeurIPS (2024). arXiv:2406.07524
Dyer, S. C., Austine-Orimoloye, O., Azov, A. G., Barba, M., Barnes, I., Barrera-Enriquez, V. P., Becker, A., Bennett, R., Beracochea, M., Berry, A. et al. Ensembl 2025. Nucleic Acids Res. 53(D1), D948–D957 (2025). doi:10.1093/nar/gkae1071
Tang, S., Zhang, Y. & Chatterjee, P. PepTune: De Novo Generation of Therapeutic Peptides with Multi-Objective-Guided Discrete Diffusion. Proc. 42nd Int. Conf. on Machine Learning (2025). arXiv:2412.17780
Zhang, H., Liu, H., Xu, Y., Huang, H., Liu, Y., Wang, J., Qin, Y., Wang, H., Ma, L., Xun, Z. et al. Deep generative models design mRNA sequences with enhanced translational capacity and stability. Science 390(6773), eadr8470 (2025). doi:10.1126/science.adr8470
Agarwal, V. & Kelley, D. R. The genetic and biochemical determinants of mRNA degradation rates in mammals. Genome Biology 23(1), 245 (2022). doi:10.1186/s13059-022-02811-x
Zheng, D., Persyn, L., Wang, J., Liu, Y., Ulloa-Montoya, F., Cenik, C. & Agarwal, V. Predicting the translation efficiency of messenger RNA in mammalian cells. Nature Biotechnology, 1–14 (2025). doi:10.1038/s41587-025-02712-x
Eichhorn, S. W., Guo, H., McGeary, S. E., Rodriguez-Mias, R. A., Shin, C., Baek, D., Hsu, S.-h., Ghoshal, K., Villén, J. & Bartel, D. P. mRNA destabilization is the dominant effect of mammalian microRNAs by the time substantial repression ensues. Molecular Cell 56(1), 104–115 (2014). doi:10.1016/j.molcel.2014.08.028
Portnoff, A. D., Stephens, E. A., Varner, J. D. & DeLisa, M. P. Ubiquibodies, synthetic E3 ubiquitin ligases endowed with unnatural substrate specificity for targeted protein silencing. J. Biol. Chem. 289(11), 7844–7855 (2014). doi:10.1074/jbc.M113.544825

Why mRNA DesignNeeds a New Approach

mRNAutilus: Masked DiffusionMeets Tree Search

Core Architecture

Property Prediction from Embeddings

Navigating the Pareto FrontThrough Tree Search

How MCTG Works

Biological Constraints During Search

Generated mRNAs MatchNatural Distributions

Zero-Shot mRNAs ThatOutperform the State of the Art

P. pyralis Luciferase (Fluc)

SARS-CoV-2 Spike Glycoprotein

PEMax Prime Editor

Ubiquibody (uAb) Degraders

Competitive with Models45× Its Size

Programmable Methods forGenetic Medicines

Key Contributions

References

Why mRNA Design
Needs a New Approach

mRNAutilus: Masked Diffusion
Meets Tree Search

Navigating the Pareto Front
Through Tree Search

Generated mRNAs Match
Natural Distributions

Zero-Shot mRNAs That
Outperform the State of the Art

Competitive with Models
45× Its Size

Programmable Methods for
Genetic Medicines