Skip to main content

BackTranslate: Functions for reverse translation.

Project description

# Back translation This library provides functions for back translation from amino acids to nucleotides.

from __future__ import unicode_literals

from backtranslate.backtranslate import BackTranslate

# Create a class instance, optionally giving the translation table id. bt = BackTranslate() # Find all substitutions that transform the codon ‘TTG’ into a stop codon. substitutions = bt.with_dna(‘TGG’, ‘*’)

Sometimes we do not have access to the DNA sequence so we have to find possible substitutions from the amino acids directly.

# Find all substitutions that transform a Tryptophan into a stop codon. substitutions = bt.without_dna(‘W’, ‘*’)

To find out which substitution predictions can be improved by adding codon information, use the following function.

bt.improvable()

To get substitutions in a readable format, we can use the following:

from backtranslate.util import subst_to_cds

# Transform the substitutions to CDS coordinates. variants = subst_to_cds(substitutions, 12)

## Command line interface Use the command backtranslate to find substitutions that explain an amino acid change:

$ backtranslate with_dna -o 210 data/mhv.fa - 1 Leu 1 A C 1 A T

If no reference is available, use the without_dna subcommand:

$ backtranslate without_dna - Asp 92 Tyr 274 G T

The command find_stops finds a list of positions and substitutions that lead to stop codons. This list of destructive substitutions are useful when analysing a pool of viral transcripts. Counting the appropriate nucleotides at the given positions gives insight into how many transcripts are active.

$ backtranslate find_stops -o 210 data/mhv.fa - 216 A T 225 A T 230 C A 230 C G 243 A T …

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page