Journal of Medicinal Chemistry
Article
materials used were available from commercial sources. 1H NMR
spectra were recorded in the indicated deuterated solvent at 500
MHz. Purity of all compounds tested in biological assays was
determined to be ≥95% by LC-MS. Detailed synthesis procedures
In Vitro Cellular Assays for TAFIa Inhibition. The prepared
substance was tested for TAFIa inhibition using the Actichrome
plasma TAFI Activity Kit from American Diagnostica (Pr. No. 874).
This entailed adding 28 μL of assay buffer (20 mM HEPES, 150 mM
NaCl, pH 7.4) and 10 μL of TAFIa (American Diagnostica Pr. No.
874TAFIA; 2.5 μg/mL) to 2 μL of 2.5 mM DMSO solution of the
substance and incubating in a 96 half-well microtiter plate at room
temperature for 15 min. The enzyme reaction was started by adding
10 μL of TAFIa developer (prediluted 1:2 with assay buffer). The
time course of the reaction was followed at 420 nm in a microtiter
plate reader (SpectraMax plus 384; Molecular Devices) for 15 min.
The IC50 value was calculated from the averaged values (duplicate
determination) of serial dilutions of the substance with the aid of the
Softmax Pro software (version 4.8; Molecular Devices). Reported
values are mean ( SEM) IC50 from n = 2 values within a single
experiment.
Algorithmic Details of the Cyclizer. The cyclization method
proceeds from a complexed ligand to a series of cyclized ligands,
which have been geometrically screened. There are two “modes” for
the workflow: “small molecule” and “peptide”. Though there are many
similarities, some key differences made splitting of the two modes
more efficient. Either mode will attempt to construct geometrically
stable and chemically reasonable linkers to span the cyclization length
in a manner that does not negatively affect the complex structure (e.g.,
by introducing strain, clashes, or breaking good interactions). For
small molecules, there are typically fewer potential attachment points,
and a series of “spacers” to span the cyclization length, which can be
pieced together to fill the space. For peptides, the attachment points
are fixed but numerous (e.g., all side chains), and the linkers tend to
be of fixed length (e.g., known side-chain bridges). Figure 7 shows a
breakdown of the two modes.
For specific cases, it is possible to “mix” the peptide and small-
molecule approach. In a small-molecule scenario, for example, the
user can specify an explicit list of linkers (e.g., based on a list of
available chemical reagents) that will not be enumerated and pieced
together, but will be directly mapped onto the user-specified
attachment points of the ligand. For a peptide, on the other hand,
it is possible to evaluate other attachment points than the Cα−Cβ or
Cα−Hα vector, for example, for the introduction of head-to-side
chain, side chain-to-tail, and head-to-tail cyclizations or for a
rescaffolding of the backbone of a peptide within a macrocycle.
Detailed Description of the Workflow. Input. The user must
specify a complexed structure for cyclization, and it may be required
to identify the ligand (this is sometimes nontrivial for peptide
complexes as peptide ligands are chemically similar to proteins). The
user must specify a mode; for the small molecule, mode must specify
chemical linkers using SMILES patterns, and for the peptide, mode
must specify side-chain bridges either by name or by SMILES. All of
these inputs are sanity-checked to confirm no obvious problems.
Limb Recognition. For peptide cyclizations, typically all side-chain
Cα−Cβ and Cα−Hα vectors are considered as potential limbs. This
can be further controlled by specifying a subregion to target using
Schrodinger’s atom selection language (ASL). This may be especially
useful if there is a larger peptide system and there is some knowledge
of the system that would preclude some regions from cyclization. For
small molecules, users specify criteria for identifying potential branch
points (such as ASL, SMARTS, or manually selecting limbs for
cyclization).
Limb Pair Filtering. All combinations of limb pairs are enumerated
and filtered hierarchically to eliminate unphysical constructions. The
pairs are filtered if they either have too few bonds between them or
too few bonds per distance143 spanned to eliminate inefficient
cyclization. Then, the linker is conformationally sampled. Sub-
sequently, using the conformations, the remaining limbs are filtered
whether they match the limb distance and orientations seen in the
conformational ensemble.
Attach Linker Precursors. For small molecules, the entire
constructed linker is attached to one of the limbs. For peptides,
each limb side chain is mutated to either an L- or D-alanine (according
to the specified chirality), then portions of the linker are attached to
either side chain, leaving “dangling” side chains.
Close Linker. The dangling linkers are then closed by slowly
bringing the closing bond atoms closer to each other and finally
forming the bond. This process can be time-consuming as the bond
closing often runs into steric or strain issues.
Post Minimize and Filter. Finally, the system is minimized and
strain filters are applied to assure no highly strained linkers have been
built. The cyclized molecules are then output for further analysis.
Implementation. The macrocyclization script has been imple-
mented via command line in Schrodinger software as of release 2020-
1. To run, use $SCHRODINGER/run -FROM psp macrocyclize -h.
Molecular Dynamics Simulations. MD simulations were carried
out using an explicit solvent MD package, Desmond program (version
4.7, Desmond Molecular Dynamics System; D. E. Shaw Research,
New York, NY and version 3.1, Maestro-Desmond Interoperability
̈
Tools; Schrodinger) with inbuilt optimized potentials for liquid
simulation (OPLS 2.1) force field.144−146 The proteins were prepared
for simulation by first checking their correctness using the protein
preparation wizard tool and Epik module was used for deriving the
protonation states of the proteins at neutral pH. The system was
prepared by placing the proteins in a cubic box with periodic
boundary conditions specifying the shape and size of box as 10 Å × 10
Å × 10 Å distance. Predefined TIP3P water model was used as a
solvent, and the systems were neutralized by adding an appropriate
number of ions. The solvated systems were relaxed by implementing
steepest descent and the limited-memory Broyden−Fletcher−Gold-
farb−Shanno algorithms in a hybrid manner. The simulation was
performed under the NPT ensemble for 100 ns implementing the
Berendsen thermostat and barostat methods. The Nose−Hoover
thermostat algorithm147,148 was used to maintain a constant
temperature, and the Martyna−Tobias−Klein barostat algorithm149
was employed for maintaining 1 atm of pressure throughout the
simulation, respectively. The short-range coulombic interactions were
analyzed using the short-range method with a cutoff value of 9.0 Å.
The particle mesh Ewald (PME) method150 was used for treating the
long-range electrostatic interactions. All bonds involving hydrogen
atoms were constrained using the SHAKE algorithm.151
Enthalpic Scoring. For small molecules, binding poses of cyclized
molecules were predicted with Glide using integrating macrocycle ring
sampling.111 Enthalpic scoring of the protein−ligand complex was
done using the empirical scoring function GlideScore.118,119 For larger
systems, the cyclized molecules were minimized in the protein
environment, and the molecular mechanics-generalized Born surface
area (MM/GBSA) method120,121 was used as enthalpic score.
Conformational Scoring. Conformational scoring was per-
̈
formed using Schrodinger’s macrocycle conformational stability
script.107 Briefly, the script scores compounds on their propensity
to maintain the specified bioactive substructure given a sampled
ensemble. Here, macrocyclizations of small molecules were scored as
described in the paper, using Prime-MCS to generate a conforma-
tional ensemble, then using the stability script to estimate the
expected RMSD in the bioactive substructure using Boltzmann
weighting at 298.13 K. For peptide and protein systems, simple NPT
MD was run for 100 ns. The stability script here does not need to
Boltzmann weight since the conformers already obey the NPT
distribution. The bioactive substructure was defined uniquely for each
system as is recommended and the conformational score is provided
as average RMSD over the Cα atoms of the bioactive region.
Small-Molecule Linker Construction. To limit the combinatorics
of permuting linkers, a minimum and maximum linker length is
calculated using the recognized limbs. Then, linkers are enumerated
using combinations of the specified spacers that could potentially span
the distance within the range. These constructed linkers are then
filtered using simple chemical rules to avoid unchemical linkers (e.g.,
avoid −OO− or −NN− linkages).
K
J. Med. Chem. XXXX, XXX, XXX−XXX