Significance of the human genome sequence to drug discovery
O Grenet
11
Evolution and Applications. Oxford Univer-
sity Press, 1999.
Mullikin JC, Hunt SE, Cole CG, Mortimore
BJ, Rice CM, Burton J et al. An SNP map of
human chromosome 22. Nature 2000; 407:
516–520.
map of human genome sequence variation
containing 1.42 million single nucleotide
polymorphisms. Nature 2001; 409: 928–
934.
Collins A, Lonjou C, Morton NE. Genetic
epidemiology of single-nucleotide polymor-
phisms. Proc Natl Acad Sci U S A 1999; 96:
15173–15177.
Templeton AR, Weiss KM, Nickerson DA,
Boerwinkle E, Sing CF. Cladistic structure
within the human lipoprotein lipase gene
and its implications for phenotypic associ-
ation studies. Genetics 2000; 156: 1259–
1275.
bility and candidate gene analysis in com-
plex disease: association of mu opioid
receptor gene variation with substance
dependence. Hum Mol Genet 2000; 9:
2895–2908.
4
5
7
8
10 Li T, Ball D, Zhao J, Murray RM, Liu X, Sham
PC et al. Family-based linkage disequilib-
rium mapping using SNP marker haplo-
types: application to a potential locus for
schizophrenia at chromosome 22q11. Mol
Psychiatry 2000; 5: 77–84.
Buetow KH, Edmonson M, MacDonald R,
Clifford R, Yip P, Kelley
J et al. High-
throughput development and characteriz-
ation of a genome-wide collection of gene-
based single nucleotide polymorphism mar-
kers by chip-based matrix-assisted laser
desorption/ionization time-of-flight mass
spectrometry. Proc Natl Acad Sci U S A 2001;
98: 581–584.
11 Kruglyak L, Nickerson DA. Variation is the
spice of life. Nat Genet 2001; 27: 234–236.
9
Hoehe MR, Kopke K, Wendel B, Rohde K,
Flachmeier C, Kidd KK et al. Sequence varia-
6
The International SNP Working Group. A
Sequencing Consortium2 created an
initial gene index which contains
15000 known genes and about 17000
Significance of the human genome
sequence to drug discovery
gene predictions estimating
a cor-
rected gene number of 30000–35000.
The pharmaceutical companies are
certainly among the first to be inter-
ested in evaluating and understanding
the regulation of these newly pre-
dicted genes. Every additional gen-
omic piece of information is
important in the sense that each new
gene or function is a potential drug
target. It is interesting to note that
Celera will seek to develop on its own
100–300 medically important genes
for use by pharmaceutical and biotech-
nology companies.5 Although the
genes with unidentified function are
potentially interesting, selecting them
as targets for new compounds may be
cumbersome and expensive: research
programs would have to be initiated
for extensive assessment of their func-
tion and regulation. The possible
absence of clear homology between
these new genes with already known
sequences may necessitate the identi-
fication and/or development of new
biologically relevant in vitro or in vivo
models that will be used in pharmaco-
logical and toxicological studies. Ulti-
mately, and most interestingly, these
investigations could lead to the devel-
opment of new (classes of) compounds
targeting this gene and its function.
All the functions are not equivalent in
terms of ‘targetability’. Thus, the bio-
chemical classes of drug targets are
O Grenet
NOVARTIS Pharma AG, Preclinical Safety, Basel, Switzerland
Towards the end of their research and
development efforts, pharmaceutical
companies seek to deliver safe and
efficient molecules. The availability of
the human genome sequence shall
contribute to reach this goal.
information. It still remains to be
investigated what regulatory features
within the sequence are essential to
the function and regulation of the
approximately 30000 human genes.
Some parts of the human sequence
(approximately 42% according to
Celera’s work estimate) represent what
would be similar to the primary struc-
ture of an unknown protein to which
a function would still have to be
assigned. Expression studies will be
required to complement genomic
information.
The current DNA-array technology
allows the expression monitoring of
thousands of genes. The complete gen-
ome of Saccharomyces cerevisiae has
been sequenced and is available on oli-
gonucleotide DNA arrays.4 Similarly,
the complete human genome may be
The efforts engaged for the sequen-
cing of the human genome and the
data recently published1,2 will affect a
large population both in public and in
private institutions. Since 1953, when
James Watson and Francis Crick
described for the first time the struc-
ture of the deoxyribose nucleic acid
molecule,3 the completion of the
human genome sequencing projects is
possibly one of the major advances in
biology. Unlike the description of the
nucleic acid structure in 1953, the
massive world-wide genome sequen-
cing efforts were brought to the atten-
tion of
a
large public, creating
assembled on
allowing the analysis of virtually all
the transcripts produced by cell
a high-density chip
important expectations and exciting
perspectives in the scientific com-
munity but also in the less expert
population.
It is important to mention that the
results delivered so far are mostly
descriptive and give the essential basis
for further investigations which will
need to be carried out in order to make
the best use of this linear sequence of
a
population or tissue. Such genome-
wide expression profiling at the mRNA
level may designate new genes
involved in a pathway, potential drug
targets or expression markers that can
be used in a predictive or diagnostic
manner.
biased
towards
receptors
and
enzymes.6 Among the new potential
gene targets some will therefore get a
The International Human Genome