130
K. KINO et al.
Sequence analysis and gene annotation. DNA manipulations were
(Methyl-Leu), L-arginine-hydroxamate (Arg-NHOH), and L-phenyl-
alanine methyl ester (Phe-OMe) were used as N- and C-protected
amino acid substrates. To test reactivity with dipeptides, a reaction was
carried out with L-valyl-L-valine (Val-Val) as the substrate. In addition,
a reaction with a combination of Val and L-arginyl-L-serine (Arg-Ser)
was conducted.
performed by the methods of Sambrook et al.,18) with minor
modifications. To analyze the DNA sequences of unknown regions
around rizA, a cassette polymerase chain reaction (PCR) method was
used.13,19) Sequence analysis was performed by the primer-walking
method with primers designed on the basis of sequencing results. To
predict the function of the protein encoded by each gene, a Pfam
search20) and a BLAST search21) were performed. To predict the
transcriptional start site, reverse transcription-PCR was performed
using the ReverTra-Plus kit (Toyobo, Osaka, Japan). The following
primers were used: R1, (50-CTCTTTTATTTGAGAGAACACGTTG-
30); F1, (50-TAATGCTGACTTCAACTGAACG-30); F2, (50-GCGAT-
TAGCATTTTAATACTGAACA-30); and F3, (50-GGACAGGAAA-
TGTCTTCTGATG-30). Total RNA was extracted from B. subtilis
NBRC3134 cultivated in the production medium for 2 d.13,22)
To examine the time course of peptide synthesis, a reaction mixture
containing 0.1 mg/ml of RizB, 20 mM Val, 50 mM ATP, and 25 mM
.
MgSO4 7H2O in 100 mM Tris–HCl buffer (pH 9.0) was prepared, and
this was incubated at 37 ꢀC. The reaction mixture was analyzed by
HPLC.
To characterize RizB, ATP was replaced with GTP, CTP, or TTP,
.
.
.
and MgSO4 7H2O was replaced with MnSO4 5H2O, CoSO4 7H2O,
.
.
.
ZnSO4 7H2O, FeSO4 7H2O, or CaSO4 2H2O. Val was used as the
substrate in these reactions, and the reaction mixtures were analyzed by
LC-ESI-MS. To determine optimum pH, 100 mM Tris–HCl buffer
(pH 7.0–10.0) and 100 mM borate buffer (pH 9.0–11.0) were used, and
the reaction with Val as a substrate was performed at 30 ꢀC for 1 h. To
determine the optimum temperature, the reaction temperature was set
between 20 ꢀC and 60 ꢀC, and the reaction with Val as substrate was
performed in 100 mM Tris–HCl buffer (pH 8.0) for 1 h. In each case,
optimal conditions were determined by measuring the amounts of
L-valyl-L-valyl-L-valine (Val-Val-Val) synthesized in each reaction
mixture.
Overexpression of rizB and preparation of RizB. The gene encoding
a putative L-amino acid ligase was designated rizB (DDBJ accession
no. AB467270). rizB-P1 and rizB-P2 were amplified from genomic
DNA of B. subtilis NBRC3134 by PCR, and the following primers
were used (restriction sites are underlined): P1 sense, (NdeI, 50-
GGAATTCCATATGCTGACTTCAACTGAACG-30); P1 anti, (Bam-
HI, 50-CGCGGATCCGATTTGAGAGAACACGTTGA-30); P2 sense,
(NdeI, 50-GGAATTCCATATGAGCATTTTAATACTGAACAAAA-
CCT-30); and P2 anti, (BamHI, 50-CGCGGATCCGATTTGAGAGAA-
CACGTTGA-30). The PCR fragments were digested with appropriate
restriction enzymes and then ligated into the pET-21a(+) expression
vector. The resulting plasmid was designed to express the gene with a
C-terminal His-tag sequence under the control of the T7 promoter. The
plasmids were introduced into E. coli BL21(DE3).
Results
Search for a gene encoding L-amino acid ligase
We formulated the hypothesis that the genes involved
in rhizocticin biosynthesis form a cluster, and that a gene
in this cluster encodes a novel L-amino acid ligase
that catalyzes tripeptide synthesis. Hence the DNA
sequences of unknown regions around rizA were
analyzed. We found a novel gene, located about
9,000 bp upstream of rizA, that encodes a protein
possessing an ATP-grasp motif. The gene starts from
an ATG codon and codes for 380 amino acid residues.
This open reading frame (ORF) was subsequently
designated rizB-P1 (Fig. 1, position 1). A BLAST
search additionally showed that the amino acid sequence
of RizB-P1 had 65% identity to that of a hypothetical
protein, BL02410 (DDBJ accession no. CP000002) in
B. licheniformis ATCC14580. The BL02410 protein has
405 amino acid residues and also possesses an ATP-
grasp motif. Alignment of the two amino acid sequences
showed the high degree of sequence conservation in the
upstream region of RizB-P1 and the N-terminal region
of the BL02410 protein (Fig. 1). Thus the ORF starting
from the ATT codon, which corresponds to the initial
Met residue of the BL02410 protein, was designated
rizB-P2 (Fig. 1, position 2). ATT codons generally code
for Ile, but some strains of bacteria and phages use ATT
as a start codon.25–29)
Furthermore, transcriptional analysis was performed
to predict the transcriptional start site of this gene.
Primers were designed as follows: R1 was designed on
the basis of the common C-terminal region of rizB-P1
and rizB-P2; F1 was designed on the basis of the region
containing the ATG codon of position 1; F2 was
designed on the basis of the region containing the
ATT codon of position 2; and F3 was designed on the
basis of an approximately 1,500-bp region upstream of
rizB-P2. RT-PCR analysis, however, showed that proper
DNA fragments were amplified in all cases (data not
shown), which suggests that this gene was transcribed
with a forward gene and might be transcribed as an
operon. In addition, sequence analysis was performed to
Recombinant E. coli cells were cultured in 3 ml of Luria-Bertani
medium (1% Bacto-tryptone, 0.5% yeast extract, 1% NaCl) contain-
ing 100 mg/ml of ampicillin (final concentration) at 37 ꢀC for 5 h with
shaking at 160 rpm. Cultivated cells were transferred to 100 ml of
fresh Luria-Bertani medium that contained 100 mg/ml of ampicillin
(final concentration) and were incubated at 37 ꢀC for 1 h with shaking
at 120 rpm. Isopropyl-ꢀ-D-thiogalactopyranoside (final concentration,
0.1 mM) was then added, and cultivation was continued at 25 ꢀC
for 19 h with shaking at 120 rpm. The cells were harvested by
centrifugation (4;160 ꢁ g, 10 min, 4 ꢀC), resuspended in 100 mM Tris–
HCl buffer (pH 8.0), and then disrupted by sonication at 4 ꢀC.
Cellular debris were removed by centrifugation (20;000 ꢁ g, 30 min,
4 ꢀC), and the supernatant was collected and purified with a HisTrap
HP Ni-affinity column (GE Healthcare, Buckinghamshire, UK). The
active fraction was then desalted with
a PD-10 column (GE
Healthcare) equilibrated with 100 mM Tris–HCl buffer (pH 8.0). To
confirm the presence of protein in the solution, sodium dodecyl
sulfate-polyacrylamide gel electrophoresis (SDS–PAGE) was per-
formed by the method of Laemmli.23) The protein concentration was
determined by the Bradford method, with bovine serum albumin as
the standard.
Characterization of RizB. The L-amino acid ligase activity of RizB
was assayed as follows, unless otherwise specified: The standard
reaction mixture (total volume, 0.3 ml) contained 0.1 mg/ml of RizB,
.
12.5 mM ATP, 12.5 mM MgSO4 7H2O, and 12.5 mM amino acid
substrates in 100 mM Tris–HCl buffer (pH 8.0). The reaction was
performed at 30 ꢀC for 20 h. To detect peptide synthesis activity, the
amount of phosphate released during the reaction was determined with
the Determiner L IP kit following the manufacturer’s protocol (Kyowa
Medex, Tokyo). To confirm peptide synthesis, reaction mixtures were
analyzed by liquid chromatography-electrospray ionization-mass spec-
trometry (LC-ESI-MS) (HPLC; Agilent 1100 series, Agilent Tech-
nologies, Santa Clara, CA; ESI-MS; LCQ Deca, Thermo Scientific,
Waltham, MA) and HPLC (L-2000 series; Hitachi High Technologies,
Tokyo). The details of analytical procedures are described in our
previous publications.13,24)
To determine substrate specificity, every combination of one or two
amino acids selected from 20 proteogenic amino acids was examined.
During analysis for homopeptide synthesis, the substrate concentration
used was 25 mM. To test reactivity with D-amino acids, reactions were
carried out with D-form amino acids. To test the reactivity with amino
acid derivatives, N-formyl-L-valine (Formyl-Val), N-methyl-L-leucine