Scholarly article on 2-methoxy-6-(4-methoxy-phenethyl)-naphthalene 77799-50-9 from Journal of the Chemical Society p. 76

DOI: 10.1093/bioinformatics/17.9.847

Source and publish data:

Journal of the Chemical Society p. 76 (1946)

Update date:2022-07-28

Topics:: Authors:

Hudson

Read Full Text PDF DownLoad Join now for total 90,000,000 free articles

Article abstract of DOI:10.1093/bioinformatics/17.9.847

Full text of DOI:10.1093/bioinformatics/17.9.847

Vol. 17 no. 9 2001

Pages 847–848

BIOINFORMATICS APPLICATIONS NOTE

InterProScan – an integration platform for the

signature-recognition methods in InterPro

Evgeni M. Zdobnov ^∗and Rolf Apweiler

EMBL Outstation, European Bioinformatics Institute, Wellcome Trust Genome

Campus, Hinxton, Cambridge CB10 1SD, UK

Received on August 2, 2000; revised on January 12, 2001; accepted on May 30, 2001

ABSTRACT

INTERPRO MEMBER DATABASES AND

Summary: InterProScan is a tool that scans given protein

sequences against the protein signatures of the InterPro

member databases, currently – PROSITE, PRINTS, Pfam,

ProDom and SMART. The number of signature databases

and their associated scanning tools as well as the further

reﬁnement procedures make the problem complex. Inter-

ProScan is designed to be a scalable and extensible sys-

tem with a robust internal architecture.

Availability: The Perl-based InterProScan implementation

is available from the EBI ftp server (ftp://ftp.ebi.ac.uk/pub/

software/unix/iprscan/) and the SRS-based InterProScan

is available upon request. We provide the public web

interface (http://www.ebi.ac.uk/interpro/scan.html) as well

as email submission server (interproscan@ebi.ac.uk).

Contact: Evgueni.Zdobnov@EBI.ac.uk

SCANNING METHODS

Legend: ❖ denotes a database and ➢ denotes the associ-

ated scanning tools.

❖PROSITE patterns. Some biologically signiﬁcant

amino acid patterns can be summarized in the form of

regular expressions.

➢ ScanRegExp (by Wolfgang.Fleischmann@ebi.ac.uk).

❖PROSITE proﬁles based on weight matrices (also

known as proﬁles) are more sensitive in detection of

divergent protein families.

➢ pfscan from the Pftools package

(by Philipp.Bucher@isrec.unil.ch).

❖PRINTS database houses a collection of protein family

ﬁngerprints. These are groups of motifs that together are

diagnostically more potent than single motifs by making

use of the biological context inherent in a multiple-motif

method.

INTRODUCTION

Databases of protein domains and functional sites have

become vital resources for the prediction of protein

functions. During the last decade, several signature-

recognition methods have evolved to address different

sequence analysis problems, resulting in rather different

and, for the most part, independent databases. Diagnos-

tically, these resources have different areas of optimum

application owing to the different strengths and weak-

nesses of their underlying analysis methods. Thus, for best

results, search strategies should ideally combine all of

them. InterPro (The InterPro Consortium, 2001) is a col-

laborative project aimed at providing an integrated layer

on top of the most commonly used signature databases

by creating a unique, non-redundant characterization of

a given protein family, domain or functional site. The

InterPro database integrates PROSITE (Hofmann et al.,

1999), PRINTS (Attwood et al., 2000), Pfam (Bateman

et al., 2000), ProDom (Corpet et al., 1999) and SMART

(Schultz et al., 2000) databases and the addition of others

is scheduled.

➢ FingerPRINTScan (Scordis et al., 1999).

❖Pfam is a database of protein domain families. Pfam

contains curated multiple sequence alignments for each

family and corresponding proﬁle hidden Markov models

(HMMs).

➢ hmmpfam from the HMMER2.1 package

(by Sean Eddy, eddy@genetics.wustl.edu,

http://hmmer.wustl.edu),

DeCypher^TM(TimeLogic) HMM search.

❖ProDom families are built by an automated process

based on a recursive use of PSI-BLAST homology

searches.

➢BlastProDom.pl (by Florence Servant,

fservant@toulouse.inra.fr) a ﬁlter on top of the Blast

package (Altschul et al., 1997).

❖SMART domains are extensively annotated with respect

to phyletic distributions, functional class, tertiary struc-

tures and functionally important residues. SMART align-

ments are optimised manually and following construction

of corresponding Hidden Markov Models (HMMs).

➢ hmmpfam from the HMMER2.1 package,

DeCypher^TM(TimeLogic) HMM search.

∗

To whom correspondence should be addressed.

ꢀ Oxford University Press 2001

847

E.M.Zdobnov and R.Apweiler

in other stand-alone ad-hoc solutions. The Perl-based

InterProScan is capable of providing post-processed,

integrated results in several formats and it could be

used as a simple retrieval system for the underlying

data.

The tool has become popular in the bioinformatics

community. The EBI public web interface serves more

than 10 000 interactive requests a month. There are

more than 60 installations worldwide of the Perl-based

InterProScan package that has already been used to

analyse the complete genomes on a production scale.

INTERPROSCAN

InterProScan is a tool that combines different protein sig-

nature recognition methods into one resource. The number

of signature databases and their associated scanning tools

as well as the further reﬁnement procedures increase the

complexity of the problem. InterProScan is more than a

simple wrapping of sequence analysis applications since

it requires performing a considerable data look-up from

some databases and program outputs. The need for pro-

duction scale efﬁciency and an easy extensibility require

a robust and efﬁcient (parallel) internal architecture that

can beneﬁt from network distributed computing with the

support of UNIX queueing systems.

ACKNOWLEDGEMENTS

We developed an SRS-based InterProScan suite as well

as the stand-alone Perl-based InterProScan package.

Nowadays SRS (Etzold et al., 1996) has become an inte-

gration system for both data retrieval and applications for

data analysis that is ideally suited to resolve the data ﬂow

complexity in InterProScan. Firstly, InterProScan was im-

plemented using the introduced technique of joining some

of the SRS integrated applications into one virtual appli-

cation that can organize the execution of the underlying

steps in an efﬁcient (parallel) manner. Later, we devel-

oped a client web interface using the SRS Perl API that

is a compromise between the SRS inter-database linking

integrity and the simplicity of the user interface, providing

‘one-click-away’ results.

While the SRS-based InterProScan has several beneﬁts

from the close integration with other databases it requires

some SRS expertise and is bound to the licensed SRS

distribution. To overcome these limitations we decided

to develop a stand-alone InterProScan version based

on the popular scripting language Perl. The Perl-based

InterProScan was intended as an extensible and scalable

system optimised to cope with bulk data processing. In

the package a Perl-based simple data retrieval system

was introduced to provide the required data look-up

efﬁciency and easy extensibility. The system has a mod-

ular structure and is designed in an SRS-like fashion.

Each of the data description modules deﬁnes the data

schema of the source text data and the parsing rules. The

corresponding Perl module provides an object-oriented

interface to the underlying entry attributes. The parsing

of the source data into the memory objects happens only

once and is done upon request, implementing so-called

lazy-parsing. Hierarchical parsing rules are implemented

using the recursive-descent approach (Parse-RecDescent

package). Fast data retrieval is implemented using the Perl

native B-trees indexing (DB File.pm, based on Berkeley

DB). The simple ‘one Perl module per data source’

organisation makes it possible to reuse the modules

We would like to thank Rodrigo Lopez for general support

and the mailserver backend as well as Thure Etzold and

Henning Hermjakob for useful discussions and ideas.

REFERENCES

Altschul,S.F., Madden,T.L., Schaffer,A.A., Zhang,J., Zhang,Z.,

Miller,W. and Lipman,D.J. (1997) Gapped BLAST and PSI-

BLAST: a new generation of protein database search programs.

Nucleic Acids Res., 25, 3389–3402.

Attwood,T.K., Croning,M.D., Flower,D.R., Lewis,A.P., Mabey,J.E.,

Scordis,P., Selley,J.N. and Wright,W. (2000) PRINTS-S: the

database formerly known as PRINTS. Nucleic Acids Res., 28,

225–227.

Bateman,A., Birney,E., Durbin,R., Eddy,S.R., Howe,K.L. and

Sonnhammer,E.L. (2000) The Pfam protein families database.

Nucleic Acids Res., 28, 263–266.

Corpet,F., Gouzy,J. and Kahn,D. (1999) Recent improvements of

the ProDom database of protein domain families. Nucleic Acids

Res., 27, 263–267.

Etzold,T., Ulyanov,A. and Argos,P. (1996) SRS: information re-

trieval system for molecular biology data banks. Meth. Enzymol.,

266, 114–128.

Hofmann,K., Bucher,P., Falquet,L. and Bairoch,A. (1999) The

PROSITE database, its status in 1999. Nucleic Acids Res., 27,

215–219.

Schultz,J., Copley,R.R., Doerks,T., Ponting,C.P. and Bork,P. (2000)

SMART: a web-based tool for the study of genetically mobile

domains. Nucleic Acids Res., 28, 231–234.

Scordis,P., Flower,D.R. and Attwood,T.K. (1999) Finger-

PRINTScan: intelligent searching of the PRINTS motif database.

Bioinformatics, 15, 799–806.

The InterPro Consortium (Apweiler,R., Attwood,T.K., Bairoch,A.,

Bateman,A., Birney,E., Biswas,M., Bucher,P., Cerutti,L., Cor-

pet,F., Croning,M.D., Durbin,R., Falquet,L., Fleischmann,W.,

Gouzy,J., Hermjakob,H., Hulo,N., Jonassen,I., Kahn,D.,

Kanapin,A., Karavidopoulou,Y., Lopez,R., Marx,B., Mul-

der,N.J., Oinn,T.M., Pagni,M., Servant,F., Sigrist,C.J. and

Zdobnov,E.M.) (2001) The InterPro database, an integrated doc-

umentation resource for protein families, domains and functional

sites. Nucleic Acids Res., 29, 37–40.

848

Products guided by the article

Product name:2-methoxy-6-(4-methoxy-phenethyl)-naphthalene

Cas No:77799-50-9

R&D Labs maybe for 77799-50-9

SuZhou Hua-Emy Chemical Import and Export Co., LTD.

Contact:+86-512-88804994; +86-512-88804550;

Address:710, Building B, International Trade Center, 12 Huanghelu, Changshu, Jiangsu,China
King Brother Chem Co., Ltd

website:http://www.uvchemkeys.com

Contact:0086-021-58785816

Address:RM2607 Building No.1 Guosheng, Lane 388, Zhongjiang Road, Putuo District, Shanghai 200062 China
Joyochem Co., Ltd.

website:http://www.joyochem.com

Contact:0531-82687998

Address:Factory Building 11, Jinan Comprehensive free trade zone, Shandong, China
Changsha Goomoo Chemical Technology Co.Ltd

Contact:+86-731-82197655

Address:No.649,Chezhan Rd.(N),Changsha,Hunan,China
Anhui Sholon New Material Technology Co., Ltd.

website:http://www.sholonchem.com

Contact:+86-550-5261666

Address:4/F Block B, Beijing Chemical Building.No.520 Tianrun Road ,Science & Education Town Wujin District, Changzhou City Jiangsu Province

Relevant to this article

Design and synthesis of ten biphenyl-neolignan derivatives and their in vitro inhibitory potency against cyclooxygenase-1/2 activity and 5-lipoxygenase-mediated LTB₄-formation

Doi:10.1016/j.bmc.2009.05.018
(2009)
Synthesis and biological evaluation of truncated α-galactosylceramide derivatives focusing on cytokine induction profile

Doi:10.1016/j.bmc.2012.03.025
(2012)
THE SYNTHESIS OF PERFLUOROALKYL AND PERFLUOROALKYLETHER SUBSTITUTED BENZILS

Doi:10.1016/S0022-1139(00)82343-X
(1991)
Synthesis and biological activity of some 3,4-dihydro-4-(4-substituted aryl)-6-(naptho[2,1-b]furan-2-yl pyrimidine-2(1H)-one derivatives

Doi:10.1155/2012/258672
(2012)
Palladium(II)-catalyzed cycloisomerization of substituted 1,5-hexadienes: A combined experimental and computational study on an open and an interrupted hydropalladation/carbopalladation/β-hydride elimination (HCHe) catalytic cycle

Doi:10.1021/jo3004088
(2012)
Highly enantioselective synthesis of chiral allenes by sequential creation of stereogenic center and chirality transfer in a single pot operation

Doi:10.1021/ol300717e
(2012)

Article Doi

DOI: 10.1093/bioinformatics/17.9.847

Source and publish data:

Authors:

Article abstract of DOI:10.1093/bioinformatics/17.9.847

Full text of DOI:10.1093/bioinformatics/17.9.847

Products guided by the article

R&D Labs maybe for 77799-50-9

Relevant to this article

Hot Product