Kinetic tRNA charging port #305

thalassemia · 2025-05-03T09:24:38Z

This PR ports the following commit to vEcoli: CovertLab/WholeCellEcoliRelease@bdfd19a

Currently, the kinetic tRNA charging model can only be run with growth rate control off:

"kinetic_trna_charging": true,
"mechanistic_aa_transport": false,
"aa_supply_in_charging": false,
"ppgpp_regulation": false,
"mechanistic_translation_supply": false

The average doubling time with the new model is close to the expected 44 minutes with operons off.

With operons on, the spread of doubling times increases dramatically, with many outliers above 1 hour.

*16 seeds, 32 generations (omit doubling times for first 8 generations)

Other tweaks

Converting between TUs and cistrons where necessary

Operons were added after the reference commit. As a result, some parts of the reference commit (example) require cistron data instead of TU data (example ported).

Discovering coding opal codons at runtime

The reference commit hard-coded the protein IDs and positions of opal codons that code for selenocysteine. I added some logic to determine this at runtime using the EcoCyc sequence data. At the time of this PR, my logic produced the exact same proteins and positions as the hard-coded values in the reference commit.

Adjusting TU boundaries to include TSS of first gene

Some of the adjusted genes are already noted on EcoCyc as unusual (example). My manual adjustments are just a temporary fix. We'll need to come up with a more robust long-term solution.

Making new ParCa and sim modules deterministic with PRNG seeding

There were some places in the code that used unseeded np.random or Cython rand(). This meant running the sim twice with the same inputs could yield different outputs. This lack of reproducibility made it near impossible to debug issues (especially rare ones like the reconciliation buffer issue discussed below).

Accounting for reconciliation buffer when building codon sequences

In the ParCa, codon sequences are stored in an array such that the longest polypeptide sequence has 30 codons worth of padding at the end. In the model, the reconciliation program is allowed to read ahead of a ribosome's current position by up to 32 codons. In the extremely rare case that a ribosome is just on the cusp of fully translating the longest polypeptide, reconciliation may try to read beyond the dimensions of the codon sequence array, raising an error.

Enabling multi-core tRNA charging parameter optimization

The reference commit adds a new ParCa option to optimize the kinetic tRNA charging parameters (optimize_trna_charging_kinetics). I modified this to use multiprocessing, with each process handling optimization for a single amino acid. Unfortunately, a handful of amino acids take an order of magnitude longer than the others. In my testing, most amino acids finish optimizing in a few minutes while a handful drag the process out for 30+ minutes.

Re-optimizing ParCa tRNA charging parameters

Because a lot has changed since the reference commit, I decided to re-run the optimization described above and commit the new parameters. We'll need to develop a standard protocol for re-optimizing parameters going forwards.

Next Steps

Investigate increased doubling time variance with operons turned on
Integrate with growth rate control

thalassemia added 6 commits May 3, 2025 02:31

Port kinetic tRNA charging model

82f6c41

Adjust TU boundaries to contain full genes

6b1b659

Use multiprocessing for tRNA charging optimization

74bcadc

Re-optimize tRNA charging params

6838aea

Make kinetic tRNA charging deterministic

69493e0

Add reconciliation buffer to codon sequences

9a93103

thalassemia force-pushed the trna_charging_final branch from 7c84cfe to 9a93103 Compare May 3, 2025 09:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Kinetic tRNA charging port #305

Kinetic tRNA charging port #305

thalassemia commented May 3, 2025 •

edited

Loading

Kinetic tRNA charging port #305

Are you sure you want to change the base?

Kinetic tRNA charging port #305

Conversation

thalassemia commented May 3, 2025 • edited Loading

Other tweaks

Converting between TUs and cistrons where necessary

Discovering coding opal codons at runtime

Adjusting TU boundaries to include TSS of first gene

Making new ParCa and sim modules deterministic with PRNG seeding

Accounting for reconciliation buffer when building codon sequences

Enabling multi-core tRNA charging parameter optimization

Re-optimizing ParCa tRNA charging parameters

Next Steps

thalassemia commented May 3, 2025 •

edited

Loading