Journal of Natural Products
Article
referenced to residual CDCl solvent signals with resonances at δH/C
similarity. Hits to library inputs were marked as squares. The size of
the node is representative of the number of MS spectra present for
3
2
7
.26/77.1. Semipreparative HPLC was performed on a Waters 515
pump system equipped with a Waters 996 PDA detector.
each parent mass. To visualize which compounds are shared between
Cyanobacteria. Moorea bouillonii PNG05-198 was collected in
May 2005 by scuba in 3−10 m water depth, where it was found
growing tangled around the coral Stylophora pistillata off Pigeon
Island, Papua New Guinea. M. producens 3L was originally collected
the three Moorea strains, the pie-chart-creating tool (nodeCharts
6
8
plugin for Cytoscape) was used. Pie slices are proportional to the
number of MS2 spectra for each parent mass and are therefore a proxy
for its relative quantity. To simplify the network, nodes created by
solvent background were removed from the network.
Isolation of Genomic DNA. Genomic DNA was isolated from live
cultures of M. producens JHB and M. bouillonii PNG using a standard
phenol/chloroform/isoamyl alcohol (PCI) extraction protocol. In
brief, two grams (wet weight) of cultured biomass was rinsed with
fresh SW BG-11 media, flash frozen in liquid nitrogen, and ground to a
fine powder using a prechilled mortar and pestle. The powder was
resuspended in 10 volumes of lysis buffer (10 mM Tris pH 8, 0.1 M
EDTA pH 8, 0.5% w/v SDS, and 20 μg/mL RNase) and incubated at
near CARMABI Research Station in Curac a̧ o, Netherlands Antilles.
Moorea producens JHB 22Aug96-1 was collected in Hector’s Bay,
Jamaica. Live cultures have been maintained in SW BG-11 media
under laboratory conditions.
Metabolic Profiling. Each strain was grown in SW BG-11 media.
After harvest, the cells were freeze-dried and extracted with (2:1, v/v)
CH Cl /MeOH. The extraction procedure was repeated until the
2
2
supernatant was colorless. After removal of the solvent, the extracts
were dissolved in CH CN. To remove compounds that would not
3
elute from reversed-phase column material, each extract was purified
using a disposable 100 mg reversed-phase C18 catridge (Bond Elut-C18
3
7 °C for 30 min. Proteinase K was added (100 μg/mL final
concentration), and samples were incubated at 50 °C for 1 h. After
cooling the samples to room temperature (rt), one volume of
equilibrated phenol (Life Technologies) was added and samples were
mixed for 10 min. Phases were separated by centrifugation at 3300 rcf
for 10 min, and the aqueous phase was removed to a new tube. The
phenol extraction was repeated twice followed by a chloroform/
isoamyl alcohol (24:1) (Life Technologies) extraction. DNA was
precipitated and purified by ethanol precipitation. Genomic DNA was
quantified using a NanoDrop (Thermo Scientific) and qualified by gel
electrophoresis. Genomic DNA was sequenced at the genomics core
facility at the University of Michigan using Illumina HiSeq and at the
University of California San Diego genomics facility using PACBIO
OH, 100 mg, 1 mL, Agilent Technologies). The column was
conditioned with CH CN, and then an aliquot of extract was added
3
to the column and eluted with 4 mL of CH CN. The solvent was
3
removed, and the residue dissolved in CH CN to give a 10 mg/mL
3
solution. The samples were analyzed via HPLC coupled to a Thermo
Finnigan LCQ Advantage Max mass spectrometer or an Agilent 6530
Accurate Mass Q-TOF MS system. The Thermo Finnigan mass
spectrometer was attached to a Thermo Finnigan Surveyor
Autosampler-Plus, a LC-Pump-Plus, and a PDA-Plus system. ESI
conditions were set to 325 °C capillary temperature, 5 kV source
voltage, and 69 psi sheath gas flow rate. Four scan events were set up:
positive total ion count of m/z 100−2000, followed by three data-
(
M. bouillonii PNG). Assembly was done using the SPAdes Genome
2
dependent MS scans of the first, second, and third most intense ions
32
Assember 3.0 followed by Opera to produce cyanobacterial scaffolds.
Further binning was performed to obtain cyanobacterial-specific
scaffolds.
from the first scan event. The collision energy was 35%, the minimum
5
intensity 10 counts, the isolation width m/z 2, the dynamic exclusion
count 5 with a repeat duration of 1 min, an exclusion list of 25, and an
exclusion duration of 1.5 min.
The Agilent 6530 Accurate Mass Q-TOF MS system was attached
to a 1290 Infinity Binary LC system (Agilent Technologies). ESI
conditions were set to 300 °C capillary temperature, 3.5 kV VCap, and
Bioinformatics Analysis. Biosynthetic gene clusters were
identified using the genome mining software programs AntiSMASH,
27,37,54
NaPDoS, and met2db.
NRPS adenylation domain substrate
71
predictions were made using NRPSpredictor2. Annotations were
refined using Delta BlastP to identify conserved domains.
55
1
0 L/min gas flow. The scan events were set up as follows: positive
2
Phylogenetic analysis was performed by comparing the N- and O-
total ion count of m/z 100−1700, followed by data-dependent MS
scans. The collision energy was ramped with a 0.5 slope and a 30
offset; three precursors per cycle were fragmented with a MS abs
threshold of 200, isolation width of m/z 1.3, and an active exclusion
count of 5 with a repeat duration of 0.5 min.
7
2
SBSPKS. The nucleotide sequence of the columbamide pathway
genes has been submitted to GenBank under the accession number
KP715425.
The same gradient was used on both machines. The separation was
performed using a Phenomenex Kinetex C-18 100 Å 100 × 4.6 mm
column with a 700 μL/min flow and a linear gradient from 5% A (A:
CH CN, B: 0.1% HCOOH in H O) for 3 min to 100% A in 45 min,
Structure Elucidation. About 5.5 g (wet weight) of M. bouillonii
PNG was repetitively extracted with CH Cl /MeOH (2:1, v/v),
2
2
yielding 389 mg of extract. Subsequently the extract was fractionated
by silica gel vacuum liquid chromatography (VLC) using a stepwise
gradient solvent system of increasing polarity beginning from 100%
hexanes to 100% EtOAc to 100% MeOH, yielding nine subfractions.
The fraction eluting with 40% EtOAc and 60% hexanes (fraction D,
3
2
which was maintained for 10 min. Afterward the column was
equilibrated back to starting conditions.
The data were converted to mzXML format, a text-based format for
mass spectrometry data by using MSConvert, part of the
69
3
.8 mg) was separated further using a 100 mg reversed-phase C18
ProteoWizard package, and uploaded to the Global Natural Products
Social Molecular Networking Web site. The mass spectrometry data
are available through MassIVE Public GNPS Data sets
solid-phase extraction cartridge (Bond Elut-C18 OH, 100 mg, 1 mL,
Agilent Technologies) and a stepwise gradient solvent system of 25%,
5
0%, 75%, and 100% CH CN/H O, respectively. The 75% eluting
(
MSV000079016, gnps.ucsd.edu). These data were analyzed using
3 2
41
fraction was further purified using reversed-phase HPLC: Phenomenex
Kinetex C-18 100 Å 100 × 10 mm column with a 4 mL/min flow and
the molecular networking workflow. The input data were searched
against annotated reference spectra of the MS library within GNPS.
The GNPS MS library contains ITMS and QTof reference spectra of
2
2
a linear gradient from 75% A (A: CH CN, B: H O) for 3 min to 95%
3 2
4
1
previously isolated compounds of the Gerwick laboratory.
A in 24 min. Afterward, the column was equilibrated back to starting
conditions. The yield of columbamide A was 2 mg, columbamide B 1
Computationally, the algorithms compare MS2 spectra by their
similarity and assign similarity scores. The basic underlying algorithm
mg, and columbamide C 0.5 mg.
3
0,40,41,68
23
was published in detail elsewhere.
For the network presented
Columbamide A (1): [α]
D
−11.3 (c 0.27, CHCl ); IR (neat) νmax
3
−1
1
13
in this paper, the parent mass peak tolerance was set to 2 Da and the
ion tolerance for mass fragments was set to 0.5 Da. The minimum
cluster size was set to 1, and the cosine score was set to 0.7. For
visualization, the created molecular networks were imported into the
2927, 2856, 1621, 1452, 1124, 1089, 1046 cm ; H and C NMR,
Table 4 and Supporting Information; HRESIMS m/z 466.2478 [M +
+
H] (calcd for C23
H
42Cl
NO
, 466.2485); MS/MS (CID 32.3%; [M +
2
4
+
H] ) m/z (%) 370 (100), 398 (30), 334 (27), 102 (14), 362 (6).
7
0
23
program Cytoscape 2.8.3. Each node was labeled with their
respective parent mass. The edges between nodes indicated the level
of similarity between nodes, with thicker lines indicating higher
Columbamide B (2): [α] 8.7 (c 0.77, CHCl ); IR (neat) νmax
2928, 2853, 1627, 1467 cm ; H and C NMR, Table 4 and
Supporting Information; HRESIMS m/z 500.2087 [M + H] (calcd
D 3
−1
1
13
+
I
J. Nat. Prod. XXXX, XXX, XXX−XXX