Non-proteinogenic amino acids

From Infogalactic: the planetary knowledge core
Jump to: navigation, search
Proteinogenic amino acids are a small fraction of all amino acids

In biochemistry, non-coded, non-proteinogenic, or "unnatural" amino acids are those not naturally encoded or found in the genetic code of any organisms. Despite the use of only 23 amino acids (21 in eukaryotes[note 1]) by the translational machinery to assemble proteins (the proteinogenic amino acids), over 140 natural amino acids are known and thousands of more combinations are possible.[1] Several non-proteinogenic amino acids are noteworthy because they are:

Definition by negation

Technically, any organic compound with an amine (-NH2) and a carboxylic acid (-COOH) functional group is an amino acid. The proteinogenic amino acids are small subset of this group that possess central carbon atom (α- or 2-) bearing an amino group, a carboxyl group, a side chain and an α-hydrogen levo conformation, with the exception of glycine, which is achiral, and proline, whose amine group is a secondary amine and is consequently frequently referred to as an imino acid for traditional reasons, albeit not an imino.

The genetic code encodes 20 standard amino acids. However, there are three extra proteinogenic amino acids: selenocysteine, pyrrolysine and N-formylmethionine. The former two do not have a dedicated codon, but are added in place of a stop codon when a specific sequence is present, UGA codon and SECIS element for selenocysteine, UAG PYLIS downstream sequence for pyrrolysine.[2][3] Formylmethionine is an amino acid encoded by the start codon AUG in bacteria, mitochondria and chloroplasts, but is often removed posttranslationally.[4]

There are various groups of amino acids:[5]

  • 20 standard amino acids
  • 23 proteinogenic amino acids
  • over 80 amino acids created abiotically in high concentrations
  • about 900 are produced by natural pathways
  • over 118 engineered amino acids have been placed into protein

These groups overlap, but are not identical. All 23 proteinogenic amino acids are biosynthesised by organisms, but not all of them are abiotic (found in prebiotic experiments and meteorites), such as histidine. Many amino acids, such as ornithine, are metabolic intermediates produced biotically, but not coded. Others are only metabolic intermediates, such as citrulline. Others are solely found in abiotic mixes, such as α-methylnorvaline. Over 30 unnatural amino acids have been translationally inserted into protein in engineered systems, yet are not biosynthetic.[5]

Nomenclature

In addition to the IUPAC numbering system to differentiate the various carbons in an organic molecule, by sequentially assigning a number to each carbon, including those forming a carboxylic group, the carbons along the side-chain of amino acids can also be labelled with Greek letters, where the α-carbon is the central chiral carbon possessing a carboxyl group, a side chain and, in α-amino acids, an amino group – the carbon in carboxylic groups is not counted.[6] (Consequently, the IUPAC names of many non-proteinogenic α-amino acids start with 2-amino- and end in -ic acid.)

Natural, but non L-α-amino acids

Most natural amino acids are α-amino acids in the L conformation, but some exceptions exist.

Non-alpha

Comparison of the structures of alanine and beta alanine.

Some non-α amino acids exist in organisms. In these structures, the amine group displaced further from the carboxylic acid end of the amino acid molecule. Thus a β amino acid has the amine group bonded to the second carbon away, and a γ amino acid has it on the third. Examples include β-alanine, GABA, and δ-aminolevulinic acid.

The reason why α-amino acids are used in proteins has been linked to their frequency in meteorites and prebiotic experiments.[7][original research?] An initial speculation on the deleterious properties of β-amino acids in terms of secondary structure,[7] turned out to be incorrect.[8]

D-amino acids

Some amino acids contain the opposite absolute chirality, chemicals that are not available from normal ribosomal translation/transcription machinery. Most bacterial cells walls are formed by peptidoglycan, a polymer composed of amino sugars crosslinked with short oligopeptides bridged between each other. The oligopeptide is non-ribosomally synthesised and contains several peculiarities, including D-amino acids, generally D-alanine and D-glutamate. A further peculiarity is that the former is racemised by a PLP-binding enzymes (encoded by alr or the homologue dadX), whereas the latter is racemised by a cofactor independent enzyme (murI). Some variants are present, in Thermotoga spp. D-lysine is present and in certain vancomycin-resistant bacteria D-serine is present (vanT gene).[9] [10]

In animals, some D-amino acids are neurotransmitters.

Without a hydrogen on the α-carbon

All proteinogenic amino acids have at least one hydrogen on the α-carbon. Glycine has two hydrogens, and all others have one hydrogen and one side-chain. Replacement of the remaining hydrogen with a larger substituent, such as a methyl group, distorts the protein backbone.[7]

In some fungi α-amino isobutyric acid is produced as a precursor to peptides, some of which exhibit antibiotic properties.[11] This compound is similar to alanine, but possesses an additional methyl group on the α-carbon instead of a hydrogen. It is therefore achiral. Another compound similar to alanine without an α-hydrogen is dehydroalanine, which possess a methylene sidechain. It is one of several naturally occurring dehydroamino acids.

Twin amino acid stereocentres

A subset of L-α-amino acids are ambiguous as to which of two ends is the α-carbon. In proteins a cysteine residue can form a disulfide bond with another cysteine residue, thus crosslinking the protein. Two crosslinked cysteines form a cystine molecule. Cysteine and methionine are generally produced by direct sulfurylation, but in some species they can be produced by transsulfuration, where the activated homoserine or serine is fused to a cysteine or homocysteine forming cystathionine. A similar compound is lanthionine, which can be seen as two alanine molecules joined via a thioether bond and is found in various organisms. Similarly, djenkolic acid, a plant toxin from jengkol beans, is composed of two cysteines connected by a methylene group. Diaminopimelic acid is both used as a bridge in petidoglycan and is used a precursor to lysine (via its decarboxylation).

Prebiotic amino acids and alternative biochemistries

<templatestyles src="Module:Hatnote/styles.css"></templatestyles>

In meteorites and in prebiotic experiments (e.g. Miller–Urey experiment) many more amino acids than the twenty standard amino acids are found, several of which at higher concentrations that the standard ones: it has been conjectured that if amino acid based life were to arise in parallel elsewhere in the universe, no more than 75% of the amino acids would be in common.[7] The most notable anomaly is the lack of aminobutyric acid.

Proportion of amino acids relative to glycine (%)
Molecule Electric discharge Murchinson meteorite
Glycine 100 100
Alanine 180 36
α-Amino-n-butyric acid 61 19
Norvaline 14 14
Valine 4.4
Norleucine 1.4
Leucine 2.6
Isoleucine 1.1
Alloisoleucine 1.2
t-leucine < 0.005
α-Amino-n-heptanoic acid 0.3
Proline 0.3 22
Pipecolic acid 0.01 11
α,β-diaminopropionic acid 1.5
α,γ-diaminobutyric acid 7.6
Ornithine < 0.01
lysine < 0.01
Aspartic acid 7.7 13
Glutamic acid 1.7 20
Serine 1.1
Threonine 0.2
Allothreonine 0.2
Methionine 0.1
Homocysteine 0.5
Homoserine 0.5
β-Alanine 4.3 10
β-Amino-n-butyric acid 0.1 5
β-Aminoisobutyric acid 0.5 7
γ-Aminobutyric acid 0.5 7
α-Aminoisobutyric acid 7 33
isovaline 1 11
Sarcosine 12.5 7
N-ethyl glycine 6.8 6
N-propyl glycine 0.5
N-isopropyl glycine 0.5
N-methyl alanine 3.4 3
N-ethyl alanine < 0.05
N-methyl β-alanine 1.0
N-ethyl β-alanine < 0.05
isoserine 1.2
α-hydroxy-γ-aminobutyric acid 17

Straight side chain

The genetic code has been described as a frozen accident and the reasons why there is only one standard amino acid with a straight chain (alanine) could simply be redundancy with valine, leucine and isoleucine.[7] However, straight chained amino acids are reported to form much more stable alpha helices.[12]

Chalcogen

Serine, homoserine, O-methyl-homoserine and O-ethyl-homoserine possess an hydroxymethyl, hydroxyethyl, O-methyl-hydroxymethyl and O-methyl-hydroxyethyl side chain. Whereas cysteine, homocysteine, methionine and ethionine possess the thiol equivalents. The selenol equivalents are selenocysteine, selenohomocysteine, selenomethionine and selenoethionine. Amino acids with the next chalcogen down are also found in nature: several species such as Aspergillus fumigatus, Aspergillus terreus, and Penicillium chrysogenum in the absence of sulfur are able to produce and incorporate into protein tellurocysteine and telluromethionine.[13]

Hydroxyglycine, an amino acid with a hydroxyl side-chain, is highly unstable.

Expanded genetic code

<templatestyles src="Module:Hatnote/styles.css"></templatestyles>

Roles

In cells, especially autotrophs, several non-proteinogenic amino acids are found as metabolic intermediates. However, despite the catalytic flexibility of PLP-binding enzymes, many amino acids are synthesised as keto-acids (e.g. 4-methyl-2-oxopentanoate to leucine) and aminated in the last step, thus keeping the number of non-proteinogenic amino acid intermediates fairly low.

Ornithine and citrulline occur in the urea cycle, part of amino acid catabolism (see below).[14]

In addition to primary metabolism, several non-proteinogenic amino acids are precursors or the final production in secondary metabolism to make small compounds or non-ribosomal peptides (such as some toxins).

Post-translationally incorporated into protein

<templatestyles src="Module:Hatnote/styles.css"></templatestyles>

Despite not being encoded by the genetic code as proteinogenic amino acids, some non-standard amino acids are nevertheless found in proteins. These are formed by post-translational modification of the side chains of standard amino acids present in the target protein. These modifications are often essential for the function or regulation of a protein; for example, in Gamma-carboxyglutamate the carboxylation of glutamate allows for better binding of calcium cations,[15] and in hydroxyproline the hydroxylation of proline is critical for maintaining connective tissues.[16] Another example is the formation of hypusine in the translation initiation factor EIF5A, through modification of a lysine residue.[17] Such modifications can also determine the localization of the protein, e.g., the addition of long hydrophobic groups can cause a protein to bind to a phospholipid membrane.[18]

There is some preliminary evidence that aminomalonic acid may be present, possibly by misincorporation, in protein.[19][20]

Toxic analogues

Several non-proteinogenic amino acids are toxic due to their ability to mimic certain properties of proteinogenic amino acids, such as thialysine. Some non-proteinogenic amino acids are neurotoxic by mimicking amino acids used as neurotransmitters (i.e. not for protein biosynthesis), e.g. Quisqualic acid, canavanine or azetidine-2-carboxylic acid.[21] Cephalosporin C has an α-aminoadipic acid (homoglutamate) backbone that is amidated with a cephalosporin moiety.[22] Penicillamine is therapeutic amino acid, whose mode of action is unknown.

Naturally-occurring cyanotoxins can also include non-proteinogenic amino acids. Microcystin and nodularin, for example, are both derived from ADDA, a β-amino acid.

Not amino acids

Taurine is an amino sulfonic acid and not an amino acid, however it is occasionally considered as such as the amounts required to suppress the auxotroph in certain organisms (e.g. cats) are closer to those of "essential amino acids" (amino acid auxotrophy) than of vitamins (cofactor auxotrophy).

The osmolytes, sarcosine and glycine betaine are derived from amino acids, but have an secondary and quaternary amine respectively.

References

  1. Lua error in package.lua at line 80: module 'strict' not found.
  2. Lua error in package.lua at line 80: module 'strict' not found.
  3. Lua error in package.lua at line 80: module 'strict' not found.
  4. Lua error in package.lua at line 80: module 'strict' not found.
  5. 5.0 5.1 Lua error in package.lua at line 80: module 'strict' not found.
  6. Lua error in package.lua at line 80: module 'strict' not found.
  7. 7.0 7.1 7.2 7.3 7.4 Lua error in package.lua at line 80: module 'strict' not found.
  8. Lua error in package.lua at line 80: module 'strict' not found.
  9. Lua error in package.lua at line 80: module 'strict' not found.
  10. Lua error in package.lua at line 80: module 'strict' not found.
  11. Lua error in package.lua at line 80: module 'strict' not found.
  12. Lua error in package.lua at line 80: module 'strict' not found.
  13. Lua error in package.lua at line 80: module 'strict' not found.
  14. Lua error in package.lua at line 80: module 'strict' not found.
  15. Lua error in package.lua at line 80: module 'strict' not found.
  16. Lua error in package.lua at line 80: module 'strict' not found.
  17. Lua error in package.lua at line 80: module 'strict' not found.
  18. Lua error in package.lua at line 80: module 'strict' not found.
  19. Lua error in package.lua at line 80: module 'strict' not found.
  20. Lua error in package.lua at line 80: module 'strict' not found.
  21. Lua error in package.lua at line 80: module 'strict' not found.
  22. Lua error in package.lua at line 80: module 'strict' not found.

Notes

  1. plus formylmethionine in eukaryotes with prokaryote organelles like mitochondria