12904 J. Am. Chem. Soc., Vol. 122, No. 51, 2000
Communications to the Editor
cluster, are present in CSL12.1. A BLAST7 search did not identify
any deposited sequences with sequence similarity to these ORFs.
A BLAST search against deposited sequences found one se-
quence, a hypothetical protein (MJ1207) from Methanococcus
jannaschii,8 similar to ORF1. MJ1207 is a member of a small
superfamily of hypothetical proteins that are predicted N-acyl
transferases. Although ORF1 is only distantly related to this group
of hypothetical N-acyl transferases by sequence homology, its
functional similarity, the production of N-acyl products, argues
that the sequence homology could be significant.
Long-chain N-acyl aminoacylases have been characterized from
bacteria and appear to be widespread among microorganisms.9
However, no enzymes that catalyze the formation of these
compounds have been identified and the role of this growing
family of natural products remains unclear.10 With the observation
that these long-chain N-acyl amino acids possess antibacterial
activities, it is likely that long-chain N-acyl amino acids represent
a general class of microbial antibiotics. The identification of ORF1
as an N-acyl amino acid biosynthesis gene should aid in
determining the true role of this family of natural products as
well as help in the assignment of function to other hypothetical
N-acyl transferases from bacterial and archaeal sequencing
projects.
The heterologous expression of eDNA accesses the chemical
diversity of uncultured microorganisms. CSL12-A through
CSL12-M (1-13) are the first new biologically active natural
products that have been characterized from the heterologous
expression of environmental DNA.11 This heterologous expression
approach automatically couples biologically active natural prod-
ucts with their biosynthetic genes. In this case, the characterization
of the eDNA cloned in CLS12 identified the first biosynthesis
gene for long-chain N-acyl amino acids. The straightforward
strategy for constructing and screening large eDNA libraries that
we have described should provide ready access to many of the
natural products produced by previously inaccessible microorgan-
isms. This method can also be used for heterologous expression
in alternative host organisms using cosmid shuttle vectors and
such studies are currently underway.
Figure 1. Genome priming systems (GPS) transposon mutagenesis of
CSL12.1. Transposon insertions that knock out, reduce, or do not affect
the antibacterial activity produced by CSL12.1 are marked with red,
yellow, or green flags, respectively: (a) 3.5 kb insert from CSL12.1, (b)
expanded region containing the predicted promoter for ORF1, and (c)
proposed eDNA ribosome binding site (RBS), -35 and -10 sequences
aligned with the corresponding consensus sequences from E. coli.
on tyrosine. The N-acyl-tyrosine structure and the configuration
of the tyrosine in this series of compounds was ultimately
confirmed by the total synthesis of two representative examples:
CSL12-C (3), the most abundant compound isolated from
CSL12.1, and CSL12-G (7), one of the most active compounds
isolated from CSL12.1.5 Both 3 and 7 are spectroscopically
identical to their synthetic counterparts N-decanoyl-L-tyrosine and
N-myristoyl-L-tyrosine, respectively.
CSL12-A to CSL12-M (1-13) show varying degrees of
antibacterial activity, with the length of the aliphatic substitution
on tyrosine markedly affecting the antibacterial activity of these
natural products. In assays against B. subtilis, C13 to C16 saturated
and unsaturated N-acyl substituted L-tyrosines were the most
active. Tyrosine derivatives with side chains one and two carbons
longer or shorter were markedly less active and derivatives with
even shorter side chains were essentially inactive.
The genome priming system transposon mutagenesis (New
England BioLabs, MA) of the CSL12.1 insert indicated that a
single open reading frame (ORF1) produced the observed
antibacterial activity and permitted the complete sequencing of
the CSL12.1 insert.6 With two exceptions, all of the transposon
insertions that knock out the production of antibacterial activity
were found in ORF1 (Figure 1a). The remaining two knock-out
insertions were found in what is predicted to be the endogenous
promoter in the eDNA for ORF1 (Figure 1b,c). In addition to
transposons which completely knocked out the antibacterial
activity, we also found a group of insertions both upstream and
downstream of the proposed promoter but not in ORF1 that
reduced the antibacterial activity observed in agar plate assays
(Figure 1b). The transposon insertion studies suggest that ORF1
is responsible for the antibacterial activity and that the eDNA
contains its own promoter that is successfully used by E. coli. In
vitro biochemical studies are currently underway to confirm that
ORF1 is an N-acyl transferase and not functioning indirectly to
cause the production of N-acyl-tyrosine compounds in the E. coli
host.
Acknowledgment. We thank Jo Handelsman and Robert Goodman
for helpful discussions. Mass spectra were obtained in the Mass
Spectrometry Facility, Department of Chemistry and Chemical Biology,
Cornell University. This work was supported by the NIH (CA24487)
and the David and Lucile Packard Foundation.
Supporting Information Available: NMR spectral data for 3 and
characterization of the monounsaturated sides in 11, 12, and 13 (PDF).
JA002990U
(7) WU-BLAST (Washington University) 2.0.
(8) Bult, C. J.; et al. Science 1996, 272, 1058.
(9) Fukada, H.; Iwade, S.; Kimura, A. J. Biochem. 1982, 91, 1734. Shintani,
Y.; Fukuda, H.; Okamoto, N.; Murata, K.; Kumura, A. J. Biochem. 1984, 96,
6343. Matsumoto, J.; Nagai, S. J. Biochem. 1972, 72, 269.
Three additional hypothetical ORFs (>150 amino acids), which
do not appear to be organized as a secondary metabolite gene
(10) Pohnert, G.; Jung, V.; Haukioja, E.; Lempa, K.; Boland, W.
Tetrahedron 1999, 55, 11275. Alborn, H. T.; Turlings, T. C. J.; Jones, T. H.;
Stenhagen, G.; Loughrin, J. H.; Tumlinson, J. H. Science 1997, 276, 945.
Yagi, H.; Corzo, G.; Nakahara, T. Biochim. Biophys. Acta 1997, 1336, 28
and references therein. Yagi, H.; Corzo, G.; Yochochi, T.; Kamisaka, Y.;
Yamaoka, M.; Nakahara, J. Jpn. Oil Chem. Soc. 1995, 44, 635. Asselineau,
J. Prog. Chem. Org. Nat. Prod. 1991, 56, 1.
(5) A slight excess of the appropriate acid chloride was stirred with 20 mg
of L-tyrosine in 1 mL of DMF at room temperature. After 3 h the reaction
was diluted with 10 mL of 1 N HCl, extracted 3× with ethyl acetate, and the
synthetic N-acyl-tyrosines were purified by reversed-phase HPLC.
(6) The nucleotide sequence of CSL12.1 has been deposited in GenBank
under Accession Number AF324335.
(11) Wang, G.; Graziani, E.; Waters, B.; Pan, W.; Li, X.; McDermott, J.;
Meurer, G.; Saxena, G.; Anderson, R.; Davies, J. Org. Lett. 2000, 2, 2401.