8566 J. Am. Chem. Soc., Vol. 120, No. 33, 1998
Communications to the Editor
Figure 4. Building blocks used in library synthesis.
Fourth, during the encoded library synthesis below,14 the ef-
fectiveness of carbene tagging was verified by analyzing beads
from every pool at each step of the synthesis.
Figure 3. Synthesis of 24 demonstration compounds and a 2.18 million
compound library.
A large encoded library was constructed by split-pool synthesis
beginning with attachment of two spacers, ꢀ-aminocaproic acid
and glycine, to the photolinker on resin. A third portion of the
resin was left without a spacer. The resin was pooled, divided
into two portions, and one enantiomer of epoxycyclohexenol
carboxylic acid 2 was coupled to each pool. After the resin was
pooled and divided into three portions, iodobenzyl nitrone acids
7b-d were coupled, resulting in a total of 18 tetracyclic scaffolds.
The synthesis was completed by reaction with 30 terminal alkynes,
then with 62 primary amines, and finally with 62 carboxylic acids
(Figures 3 and 4) with an additional skip codon at each step.15
This six-step reaction sequence resulted in a collection of
compounds calculated to contain 2.18 million distinct, spatially
separated, and encoded chemical entities. These synthetic
compounds are rigid, stereochemically defined, and structurally
diverse, characteristics common to many natural products.
Furthermore, the synthesis allows the controlled release of
compounds from the individual 90 µ supports into nanodroplets
containing engineered cells, features critical to the miniaturized
cell-based assays now being used to screen this library for cell
permeable, protein-binding ligands.3 Encouragingly, we have
already found that several members of this library activate a
reporter gene in mink lung cells.16 We note, however, that this
synthesis relies in part on simple acylation chemistry. Syntheses
benefiting from a greater range of mild and selective synthetic
methods may facilitate the routine discovery of compounds with
protein-binding properties rivaling those of natural products.
their palladium-mediated cross-coupling reactions were more
efficient than those involving iodophenyl tetracycles 5b-d. A
carefully controlled alkyne coupling reaction was used to convert
the aryl iodides (10) into aryl alkynes (11).10 Cycloreversion was
a persistent problem in uncatalyzed lactone aminolysis; however,
2-hydroxypyridine mediated the efficient reaction of primary
amines with the γ-butyrolactone to afford γ-hydroxyamides (12).11
After testing numerous acylating conditions, we found that the
unmasked secondary alcohol could be capped reliably with DIPC/
DMAP-activated carboxylic acids to give γ-acyloxy amides (13).12
Split-pool synthesis provides the theoretical means to synthesize
the large numbers of molecules likely required for chemical
genetic screens, where molecules replace mutations. However,
such syntheses present enormous analytical challenges. We have
developed a four-step protocol in order to provide maximum
confidence that a complex split-pool synthesis of encoded
molecules yields the anticipated products in high purity and
efficiency. First, to demonstrate the suitability of the entire
reaction sequence for library synthesis, we synthesized and fully
characterized 24 compounds,13 10a-f through 13a-f, with the
latter providing photocleavage products in acceptable 80-90%
purity. Second, potential building blocks at each step were tested
by reaction with a single selected substrate. Thus, 50 alkynes
were tested by reaction with 10a, 87 amines with 11a, and 98
acids with 12a. Of these, 23 alkynes, 54 amines, and 44 acids
reacted with g90% conversion and purity (LC-MS). These
building blocks, along with a limited number of less optimal
candidates (generally g70% conversion), were selected for
inclusion in library synthesis. Third, to verify effective synthesis
in a split-pool format, we generated a small test library with
building blocks carefully selected such that the products within
each final pool would have unique masses, allowing analysis by
LC-MS. Indeed, all of the expected 456 masses were detected.
Acknowledgment. We thank Dr. A. Tyler, J. Athanasopoulos, and
J. Lynch for expert mass spectral support.
Supporting Information Available: Acknowledgments, additional
discussion, complete experimental procedures, and analytical data for 2,
3, 6b-d, 7b-d, 10-13a-f, building block testing, test library synthesis,
large library synthesis, binary encoding (77 pages, print/PDF). See any
current masthead page for ordering information and Web access
instructions.
(10) (a) Sonogashira, K.; Tohda, Y.; Hagihara, N. Tetrahedron Lett. 1975,
4467-4470. (b) Young, J. K.; Nelson, J. C.; More, J. S. J. Am. Chem. Soc.
1994, 116, 10841-10842. (c) Collini, M. D.; Ellingboe, J. W. Tetrahedron
Lett. 1997, 38, 7963-7966. (d) Odingo, J.; Sharpe, B. A.; Oare, D., Presented
at the 213th National Meeting of the American Chemical Society, San
Francisco, CA, April, 1997; Paper ORGN 574.
JA981746O
(14) Binary encoding was performed by a modified version of the literature
procedure (see Supporting Information): Nestler, H. P.; Bartlett, P. A.; Still,
W. C. J. Org. Chem. 1994, 59, 4723-4724.
(15) Combs, A. P.; Kapoor, T. M.; Feng, S.; Chen, J. K.; Daude´-Snow, L.
F.; Schreiber, S. L. J. Am. Chem. Soc. 1996, 118, 287-288.
(16) Compounds with the general structure 13 in Figure 3, with no spacer,
R1 ) carbon-based substituents attached to an alkyne substituted at the meta
position (see S46 in Supporting Information), R2 ) 3,4-dimethoxybenzyl, and
R3 ) methoxymethyl, were found to activate the reporter plasmid p3TPLux
in a stably transfected mink lung cell line, both independently and synergisti-
cally with TGF-â1 (B. R. Stockwell and S. L. S., unpublished results and
Curr. Biol. 1998, 8, 761-770.).
(11) (a) Rony, P. R. J. Am. Chem. Soc. 1969, 91, 6090-6096. (b)
Openshaw, H. T.; Whittaker, N. J. Chem. Soc. C 1969, 89-91.
(12) (a) Atuegbu, A.; Maclean, D.; Nguyen, C.; Gordon, E. M.; Jacobs, J.
W. Bioorg. Med. Chem. 1996, 4, 1097-1106. (b) Hydroxyl groups introduced
during the alkyne coupling step were also acylated efficiently under these
conditions.
(13) 1H NMR with complete peak assignments by extensive homonuclear
decoupling and/or DQF-COSY, TOCSY, and NOESY experiments, FAB-
MS, HRMS, HPLC, TLC.