Figure 2. Computed phylogenetic relationships among known1a,c
plant oxidosqualene cyclases closely related to CAMS1. The
nucleotide sequences of these enzymes were aligned with the
MegAlign program (DNASTAR, Inc., Madison, WI) by the Clustal
W method. The alignment was used to create a phylogenetic tree
rooted with the A. thaliana CAS1 using PAUP 4.05b. The tree was
generated using the bootstrap method with 100 replicates with equal
weight given to all the characters and maximum likelihood as the
optimality criterion. Bootstrap values are listed at nodes. Cyclases
are grouped by color according to phylogeny and product structure.
For the full species names, see Figure 3.
Figure 3. Partial amino acid alignment of oxidosqualene
cyclases.1a,c,15 An asterisk (*) designates position 484 in CAMS1,
where steric bulk impacts B-ring formation.
elevated amounts of At1g78955 mRNA in Arabidopsis
inflorescence tissue.16
pentacycles â-amyrin (72-75% identical) and lupeol (59-
61% identical).1a,13 One might imagine that cyclases that
polycyclize oxidosqualene evolved from enzymes that form
smaller ring systems by iterative addition of motifs that favor
additional rings. However, phylogenetic analysis indicates
the reverse evolutionary order, i.e., CAMS1 is a descendent
of enzymes that form pentacyclic triterpenoids.
Monocyclic triterpenoids appear to be distributed sparsely
across the vast diversity of higher plants. Compounds 2 and
3 have been found in a handful of asterids (Camellia
sasanqua,8,17a Camellia japonica,8,17a Achillea odorata,9
Bupleurum spinosum,17b and Santolina elegans17c), two
eurosids (Euphorbia antiquorum17d and Garcinia speciosa17e),
the monocots wheat and rice,17a and the fern Polypodiodes
formosana.17f Because monocyclic and polycyclic triterpenes
have rather different spectral and chromatographic sig-
natures, most monocycles were described as components of
plant oils (usually lacking polycyclic triterpenes) rather than
in surveys of triterpene distribution. We suspect that the
monocyclic triterpenes are more widespread than the litera-
ture suggests.
How often have cyclases that generate A-ring monocycles
evolved? This question can be addressed by considering gene
divergences (Figure 2) in the context of organismal relation-
ships. CAMS1 diverged relatively recently from LUP4 within
the LUP clade. LUP enzymes appear to be unique to malvids
(eurosids II). We conclude that CAMS1 arose within the
malvids and is evolutionarily distinct from the unknown
enzymes that generate monocyclic triterpenes in asterids,
monocots, and ferns.18 The phylogenetic analyses suggest
Sequence alignments show that nearly all plant cyclases
contain valine or isoleucine at the position corresponding
to Ala484 in CAMS1 (Figure 3). Previous mutational
studies11a,14 suggested that decreased steric bulk at this
position (notably mutation to alanine or glycine) promotes
the formation of monocycles. Our CAMS1 results strongly
support this proposal. An uncharacterized cyclase from
Betula platyphylla (OSCBPD)15 also encodes alanine at this
position and may likewise be compromised in B-ring
formation.
The overwhelming dominance of 2 in the product profile
of CAMS1 suggests that camelliol C or a metabolite
thereof provides a competitive advantage that achilleol A
does not replicate. The biological role of enzymes and their
products can often be illuminated by microarray data.
Unfortunately, the initial annotation of the Arabidopsis
genome included At1g78955 (CAMS1) as a fusion with
At1g78950 (LUP4), and these genes were not distinguished
in early microarrays. The limited available nonarray expres-
sion data (Arabidopsis MPSS Plus: Gene Analysis) show
(17) (a) Akihisa, T.; Koike, K.; Kimura, Y.; Sashida, N.; Matsumoto,
T.; Ukiya, M.; Nikaido, T. Lipids 1999, 34, 1151-1157. (b) Barrero, A.
F.; Haidour, A.; Munoz-Dorado, M.; Akssira, M.; Sedqui, A.; Mansour, I.
Phytochemistry 1998, 48, 1237-1240. (c) Barrero, A. F.; Alvarez-
Manzaneda, E. J.; Herrador, M. M.; Alvarez-Manzaneda, R.; Quilez, J.;
Chahboun, R.; Linares, P.; Rivas, A. Tetrahedron Lett. 1999, 40, 8273-
8276. (d) Akihisa, T.; Wijeratne, E. M. K.; Tokuda, H.; Enjo, F.; Toriumi,
M.; Kimura, Y.; Koike, K.; Nikaido, T.; Tezuka, Y.; Nishino, H. J. Nat.
Prod. 2002, 65, 158-162. (e) Rukachaisirikul, V.; Pailee, P.; Hiranrat, A.;
Tuchinda, P.; Yoosook, C.; Kasisit, J.; Taylor, W. C.; Reutrakul, V. Planta
Med. 2003, 69, 1141-1146. (f) Arai, Y.; Hirohara, M.; Ageta, H.; Hsu, H.
Y. Tetrahedron Lett. 1992, 33, 1325-1328.
(13) (a) Still more distantly related are cyclases of the Arabidopsis PEN
clade,1a three of which make fewer than four rings. (b) Protein sequence
identities were calculated in MegAlign from pairwise alignments.
(14) Matsuda, S. P. T.; Darr, L. B.; Hart, E. A.; Herrera, J. B. R.;
McCann, K. E.; Meyer, M. M.; Pang, J.; Schepmann, H. G. Org. Lett. 2000,
2, 2261-2263.
(15) Zhang, H.; Shibuya, M.; Yokota, S.; Ebizuka, Y. Biol. Pharm. Bull.
2003, 26, 642-650.
Org. Lett., Vol. 9, No. 25, 2007
5225