A Novel method for similarity analysis and protein sub-cellular localization prediction

Motivation: Biological sequence was regarded as an important study by many biologists, because the sequence contains a large number of biological information, what is helpful for scientists' studies on biological cells, DNA and proteins. Currently, many researchers used the method based on prot...

Full description

Saved in:
Bibliographic Details
Published inBioinformatics Vol. 26; no. 21; pp. 2678 - 2683
Main Authors Liao, Bo, Liao, Benyou, Sun, Xingming, Zeng, Qingguang
Format Journal Article
LanguageEnglish
Published Oxford Oxford University Press 01.11.2010
Subjects
Online AccessGet full text

Cover

Loading…
Abstract Motivation: Biological sequence was regarded as an important study by many biologists, because the sequence contains a large number of biological information, what is helpful for scientists' studies on biological cells, DNA and proteins. Currently, many researchers used the method based on protein sequences in function classification, sub-cellular location, structure and functional site prediction, including some machine-learning methods. The purpose of this article, is to find a new way of sequence analysis, but more simple and effective. Results: According to the nature of 64 genetic codes, we propose a simple and intuitive 2D graphical expression of protein sequences. And based on this expression we give a new Euclidean-distance method to compute the distance of different sequences for the analysis of sequence similarity. This approach contains more sequence information. A typical phylogenetic tree constructed based on this method proved the effectiveness of our approach. Finally, we use this sequence-similarity-analysis method to predict protein sub-cellular localization, in the two datasets commonly used. The results show that the method is reasonable. Contact: dragonbw@163.com
AbstractList Biological sequence was regarded as an important study by many biologists, because the sequence contains a large number of biological information, what is helpful for scientists' studies on biological cells, DNA and proteins. Currently, many researchers used the method based on protein sequences in function classification, sub-cellular location, structure and functional site prediction, including some machine-learning methods. The purpose of this article, is to find a new way of sequence analysis, but more simple and effective.MOTIVATIONBiological sequence was regarded as an important study by many biologists, because the sequence contains a large number of biological information, what is helpful for scientists' studies on biological cells, DNA and proteins. Currently, many researchers used the method based on protein sequences in function classification, sub-cellular location, structure and functional site prediction, including some machine-learning methods. The purpose of this article, is to find a new way of sequence analysis, but more simple and effective.According to the nature of 64 genetic codes, we propose a simple and intuitive 2D graphical expression of protein sequences. And based on this expression we give a new Euclidean-distance method to compute the distance of different sequences for the analysis of sequence similarity. This approach contains more sequence information. A typical phylogenetic tree constructed based on this method proved the effectiveness of our approach. Finally, we use this sequence-similarity-analysis method to predict protein sub-cellular localization, in the two datasets commonly used. The results show that the method is reasonable.RESULTSAccording to the nature of 64 genetic codes, we propose a simple and intuitive 2D graphical expression of protein sequences. And based on this expression we give a new Euclidean-distance method to compute the distance of different sequences for the analysis of sequence similarity. This approach contains more sequence information. A typical phylogenetic tree constructed based on this method proved the effectiveness of our approach. Finally, we use this sequence-similarity-analysis method to predict protein sub-cellular localization, in the two datasets commonly used. The results show that the method is reasonable.
Motivation: Biological sequence was regarded as an important study by many biologists, because the sequence contains a large number of biological information, what is helpful for scientists' studies on biological cells, DNA and proteins. Currently, many researchers used the method based on protein sequences in function classification, sub-cellular location, structure and functional site prediction, including some machine-learning methods. The purpose of this article, is to find a new way of sequence analysis, but more simple and effective. Results: According to the nature of 64 genetic codes, we propose a simple and intuitive 2D graphical expression of protein sequences. And based on this expression we give a new Euclidean-distance method to compute the distance of different sequences for the analysis of sequence similarity. This approach contains more sequence information. A typical phylogenetic tree constructed based on this method proved the effectiveness of our approach. Finally, we use this sequence-similarity-analysis method to predict protein sub-cellular localization, in the two datasets commonly used. The results show that the method is reasonable. Contact: dragonbw@163.com
Motivation: Biological sequence was regarded as an important study by many biologists, because the sequence contains a large number of biological information, what is helpful for scientists' studies on biological cells, DNA and proteins. Currently, many researchers used the method based on protein sequences in function classification, sub-cellular location, structure and functional site prediction, including some machine-learning methods. The purpose of this article, is to find a new way of sequence analysis, but more simple and effective. Results: According to the nature of 64 genetic codes, we propose a simple and intuitive 2D graphical expression of protein sequences. And based on this expression we give a new Euclidean-distance method to compute the distance of different sequences for the analysis of sequence similarity. This approach contains more sequence information. A typical phylogenetic tree constructed based on this method proved the effectiveness of our approach. Finally, we use this sequence-similarity-analysis method to predict protein sub-cellular localization, in the two datasets commonly used. The results show that the method is reasonable. Contact:  dragonbw@163.com
Motivation: Biological sequence was regarded as an important study by many biologists, because the sequence contains a large number of biological information, what is helpful for scientists' studies on biological cells, DNA and proteins. Currently, many researchers used the method based on protein sequences in function classification, sub-cellular location, structure and functional site prediction, including some machine-learning methods. The purpose of this article, is to find a new way of sequence analysis, but more simple and effective.Results: According to the nature of 64 genetic codes, we propose a simple and intuitive 2D graphical expression of protein sequences. And based on this expression we give a new Euclidean-distance method to compute the distance of different sequences for the analysis of sequence similarity. This approach contains more sequence information. A typical phylogenetic tree constructed based on this method proved the effectiveness of our approach. Finally, we use this sequence-similarity-analysis method to predict protein sub-cellular localization, in the two datasets commonly used. The results show that the method is reasonable.
Biological sequence was regarded as an important study by many biologists, because the sequence contains a large number of biological information, what is helpful for scientists' studies on biological cells, DNA and proteins. Currently, many researchers used the method based on protein sequences in function classification, sub-cellular location, structure and functional site prediction, including some machine-learning methods. The purpose of this article, is to find a new way of sequence analysis, but more simple and effective. According to the nature of 64 genetic codes, we propose a simple and intuitive 2D graphical expression of protein sequences. And based on this expression we give a new Euclidean-distance method to compute the distance of different sequences for the analysis of sequence similarity. This approach contains more sequence information. A typical phylogenetic tree constructed based on this method proved the effectiveness of our approach. Finally, we use this sequence-similarity-analysis method to predict protein sub-cellular localization, in the two datasets commonly used. The results show that the method is reasonable.
Author Sun, Xingming
Liao, Bo
Liao, Benyou
Zeng, Qingguang
Author_xml – sequence: 1
  givenname: Bo
  surname: Liao
  fullname: Liao, Bo
  organization: To whom correspondence should be addressed
– sequence: 2
  givenname: Benyou
  surname: Liao
  fullname: Liao, Benyou
  organization: School of computer and communication, Hunan University, Changsha Hunan, 410082, China
– sequence: 3
  givenname: Xingming
  surname: Sun
  fullname: Sun, Xingming
  organization: School of computer and communication, Hunan University, Changsha Hunan, 410082, China
– sequence: 4
  givenname: Qingguang
  surname: Zeng
  fullname: Zeng, Qingguang
  organization: School of computer and communication, Hunan University, Changsha Hunan, 410082, China
BackLink http://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&idt=23340941$$DView record in Pascal Francis
https://www.ncbi.nlm.nih.gov/pubmed/20826879$$D View this record in MEDLINE/PubMed
BookMark eNqFkUtP3DAUhS1ExbM_AZQNYpWOndiJLVYIKFQaDQX1JTaW7djC4MRgOxXTX19PZwDRDSufxXeuz71nG6wPftAA7CH4CUFWT6T1djA-9CJZFScyPZIKrYEthBtYVpCw9azrpi0xhfUm2I7xDkKCMMYbYLOCtGpoy7bAz-Ni5n9rV_Q63fquyBOLaHvrRLBpXohBuHm0MYuueAg-aTsUcZSl0s6NGSqcV8LZPzmFHzKhO6sWchd8MMJF_XH17oDvn8--nVyU08vzLyfH01JhQlIpsERaY4OlZA0TplJYM6w6QxSuKILGSGk6iqq2MaSTEEojoWqZQoRSKWS9Aw6Xc3O4x1HHxHsbF-HEoP0YOSWkoaQl7F0yM7SlrKGZ3F-Ro-x1xx-C7UWY8-erZeBgBYiYtzdBDMrGV66uMWQYZe5oyangYwzacGXTv0ulIKzjCPJFl_xtl3zZZXaT_9zPH7znK5c-G5N-ejGJcM-btm4Jv_h1w6-vvl79oLMZP63_Ao6MvEk
CitedBy_id crossref_primary_10_1016_j_physa_2015_10_067
crossref_primary_10_1038_s41598_018_26005_3
crossref_primary_10_1016_j_compbiomed_2014_11_012
crossref_primary_10_1016_j_jtbi_2016_06_002
crossref_primary_10_1021_ci500577m
crossref_primary_10_1016_j_physa_2013_05_015
crossref_primary_10_1186_1687_4153_2014_1
crossref_primary_10_1039_C8RA05122D
crossref_primary_10_4137_EBO_S14713
crossref_primary_10_1038_s41598_021_93154_3
crossref_primary_10_4236_cmb_2016_62003
crossref_primary_10_1016_j_jtbi_2012_03_023
crossref_primary_10_1016_j_jtbi_2013_06_037
crossref_primary_10_3389_fmicb_2018_02500
crossref_primary_10_1038_srep46237
crossref_primary_10_1142_S1793524515500631
crossref_primary_10_1016_j_compbiolchem_2017_04_001
crossref_primary_10_1002_jcc_21906
crossref_primary_10_1002_prot_26226
crossref_primary_10_1016_j_physa_2011_08_015
crossref_primary_10_1016_j_cplett_2013_10_076
crossref_primary_10_1016_j_biosystems_2019_03_002
crossref_primary_10_1371_journal_pone_0287880
crossref_primary_10_1016_j_jtbi_2011_11_021
crossref_primary_10_4236_ijamsc_2016_41001
crossref_primary_10_1109_TCBB_2013_10
crossref_primary_10_1002_jcc_21833
crossref_primary_10_1016_j_compbiolchem_2018_04_016
crossref_primary_10_1007_s00726_011_1106_9
crossref_primary_10_1016_j_cplett_2012_02_030
crossref_primary_10_1371_journal_pone_0167430
crossref_primary_10_1038_s41598_019_55378_2
crossref_primary_10_1155_2014_959753
Cites_doi 10.2174/157340906778226436
10.1002/jcc.21500
10.2165/00822942-200504030-00004
10.1016/j.cplett.2009.06.017
10.1142/S0129065705000281
10.1016/S0009-2614(03)00244-6
10.1016/j.jtbi.2007.05.019
10.1016/j.jtbi.2006.11.010
10.1016/j.febslet.2005.05.021
10.1016/j.cplett.2007.04.037
10.1002/jcc.21501
10.1016/j.cplett.2004.12.062
10.1016/j.cplett.2004.08.118
10.1186/1471-2105-7-298
10.1016/j.cplett.2005.08.011
10.1021/ci990084z
10.1021/ci000034q
10.1016/S0009-2614(02)01784-0
10.1186/1477-5956-7-27
10.1016/j.jmgm.2008.10.004
10.1002/prot.10251
10.1007/s10910-006-9091-z
10.1016/j.jtbi.2009.07.017
10.1016/j.jtbi.2009.08.005
10.1002/qua.21698
10.1016/j.jtbi.2009.03.025
10.1002/qua.21919
ContentType Journal Article
Copyright 2015 INIST-CNRS
Copyright_xml – notice: 2015 INIST-CNRS
DBID BSCLL
AAYXX
CITATION
IQODW
CGR
CUY
CVF
ECM
EIF
NPM
7X8
7QO
8FD
FR3
P64
DOI 10.1093/bioinformatics/btq521
DatabaseName Istex
CrossRef
Pascal-Francis
Medline
MEDLINE
MEDLINE (Ovid)
MEDLINE
MEDLINE
PubMed
MEDLINE - Academic
Biotechnology Research Abstracts
Technology Research Database
Engineering Research Database
Biotechnology and BioEngineering Abstracts
DatabaseTitle CrossRef
MEDLINE
Medline Complete
MEDLINE with Full Text
PubMed
MEDLINE (Ovid)
MEDLINE - Academic
Engineering Research Database
Biotechnology Research Abstracts
Technology Research Database
Biotechnology and BioEngineering Abstracts
DatabaseTitleList MEDLINE - Academic

CrossRef
Engineering Research Database
MEDLINE
Database_xml – sequence: 1
  dbid: NPM
  name: PubMed
  url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed
  sourceTypes: Index Database
– sequence: 2
  dbid: EIF
  name: MEDLINE
  url: https://proxy.k.utb.cz/login?url=https://www.webofscience.com/wos/medline/basic-search
  sourceTypes: Index Database
DeliveryMethod fulltext_linktorsrc
Discipline Biology
EISSN 1460-2059
1367-4811
EndPage 2683
ExternalDocumentID 20826879
23340941
10_1093_bioinformatics_btq521
ark_67375_HXZ_RQPQV8NN_D
Genre Research Support, Non-U.S. Gov't
Journal Article
GroupedDBID -~X
.2P
.I3
482
48X
5GY
AAMVS
ABJNI
ABPTD
ACGFS
ACUFI
ADZXQ
ALMA_UNASSIGNED_HOLDINGS
BSCLL
CZ4
EE~
F5P
F9B
H5~
HAR
HW0
IOX
KSI
KSN
NGC
Q5Y
RD5
ROZ
RXO
TLC
TN5
TOX
WH7
~91
---
-E4
.DC
0R~
1TH
23N
2WC
4.4
53G
5WA
70D
AAIJN
AAIMJ
AAJKP
AAJQQ
AAKPC
AAMDB
AAOGV
AAPQZ
AAPXW
AAUQX
AAVAP
AAVLN
AAYXX
ABEJV
ABEUO
ABGNP
ABIXL
ABNGD
ABNKS
ABPQP
ABQLI
ABWST
ABXVV
ABZBJ
ACIWK
ACPRK
ACUKT
ACUXJ
ACYTK
ADBBV
ADEYI
ADEZT
ADFTL
ADGKP
ADGZP
ADHKW
ADHZD
ADMLS
ADOCK
ADPDF
ADRDM
ADRTK
ADVEK
ADYVW
ADZTZ
AECKG
AEGPL
AEJOX
AEKKA
AEKSI
AELWJ
AEMDU
AENEX
AENZO
AEPUE
AETBJ
AEWNT
AFFZL
AFGWE
AFIYH
AFOFC
AFRAH
AGINJ
AGKEF
AGQPQ
AGQXC
AGSYK
AHMBA
AHXPO
AIJHB
AJEEA
AJEUX
AKHUL
AKWXX
ALTZX
ALUQC
AMNDL
APIBT
APWMN
ARIXL
ASPBG
ATTQO
AVWKF
AXUDD
AYOIW
AZFZN
AZVOD
BAWUL
BAYMD
BHONS
BQDIO
BQUQU
BSWAC
BTQHN
C1A
C45
CAG
CDBKE
CITATION
COF
CS3
DAKXR
DIK
DILTD
DU5
D~K
EBD
EBS
EJD
EMOBN
FEDTE
FHSFR
FLIZI
FLUFQ
FOEOM
FQBLK
GAUVT
GJXCC
GROUPED_DOAJ
GX1
H13
HVGLF
HZ~
J21
JXSIZ
KAQDR
KOP
KQ8
M-Z
MK~
ML0
N9A
NLBLG
NMDNZ
NOMLY
NU-
NVLIB
O0~
O9-
OAWHX
ODMLO
OJQWA
OK1
OVD
OVEED
P2P
PAFKI
PB-
PEELM
PQQKQ
Q1.
R44
RNS
ROL
RPM
RUSNO
RW1
SV3
TEORI
TJP
TR2
W8F
WOQ
X7H
YAYTL
YKOAZ
YXANX
ZKX
~KM
.-4
.GJ
ABEFU
AFFNX
AI.
AQDSO
ELUNK
IQODW
NTWIH
O~Y
RIG
RNI
RZF
RZO
VH1
ZGI
ABQTQ
ADRIX
AFXEN
BCRHZ
CGR
CUY
CVF
ECM
EIF
M49
NPM
ROX
7X8
7QO
8FD
FR3
P64
ID FETCH-LOGICAL-c455t-a4b1ee4f4bb969af2c4e94cdf5c42810ffbbfd81276f5db00bfb0c79c1588bab3
ISSN 1367-4803
1367-4811
IngestDate Fri Jul 11 00:44:58 EDT 2025
Thu Jul 10 17:30:22 EDT 2025
Wed Feb 19 02:06:15 EST 2025
Mon Jul 21 09:14:18 EDT 2025
Tue Jul 01 03:27:02 EDT 2025
Thu Apr 24 23:04:56 EDT 2025
Tue Aug 05 16:46:56 EDT 2025
IsDoiOpenAccess false
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 21
Keywords Analysis method
Similarity
Localization
Analysis
Protein
Prediction
Language English
License CC BY 4.0
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-c455t-a4b1ee4f4bb969af2c4e94cdf5c42810ffbbfd81276f5db00bfb0c79c1588bab3
Notes To whom correspondence should be addressed.
istex:940FFCBF772EF865A4C662DA978AED66A2BD5890
ArticleID:btq521
ark:/67375/HXZ-RQPQV8NN-D
Associate Editor: John Quackenbush
ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
OpenAccessLink https://academic.oup.com/bioinformatics/article-pdf/26/21/2678/567512/btq521.pdf
PMID 20826879
PQID 759878968
PQPubID 23479
PageCount 6
ParticipantIDs proquest_miscellaneous_855685759
proquest_miscellaneous_759878968
pubmed_primary_20826879
pascalfrancis_primary_23340941
crossref_citationtrail_10_1093_bioinformatics_btq521
crossref_primary_10_1093_bioinformatics_btq521
istex_primary_ark_67375_HXZ_RQPQV8NN_D
ProviderPackageCode CITATION
AAYXX
PublicationCentury 2000
PublicationDate 2010-11-01
PublicationDateYYYYMMDD 2010-11-01
PublicationDate_xml – month: 11
  year: 2010
  text: 2010-11-01
  day: 01
PublicationDecade 2010
PublicationPlace Oxford
PublicationPlace_xml – name: Oxford
– name: England
PublicationTitle Bioinformatics
PublicationTitleAlternate Bioinformatics
PublicationYear 2010
Publisher Oxford University Press
Publisher_xml – name: Oxford University Press
References Chen (2023012507543699200_B5) 2007; 248
Randić (2023012507543699200_B23) 2009; 27
Liao (2023012507543699200_B12) 2007; 42
Randić (2023012507543699200_B22) 2007; 440
Cao (2023012507543699200_B7) 2008; 108
Bai (2023012507543699200_B3) 2005; 413
Randić (2023012507543699200_B18) 2000; 40
Lee (2023012507543699200_B13) 2009; 7
Gao (2023012507543699200_B8) 2005; 579
Randić (2023012507543699200_B20) 2003; 371
Zhang (2023012507543699200_B28) 2009; 259
Liao (2023012507543699200_B10) 2005; 402
Liao (2023012507543699200_B11) 2006; 2
Liu (2023012507543699200_B15) 2009; 109
He (2023012507543699200_B9) 2010; 31
Chen (2023012507543699200_B6) 2007; 245
Wen (2023012507543699200_B24) 2009; 476
Yu (2023012507543699200_B26) 2010; 31
Al-Shahib (2023012507543699200_B2) 2005; 15
Al-Shahib (2023012507543699200_B1) 2005; 4
Randić (2023012507543699200_B19) 2003; 368
Nandy (2023012507543699200_B16) 1996; 12
Li (2023012507543699200_B14) 2009; 261
Zhou (2023012507543699200_B27) 2003; 50
Randić (2023012507543699200_B17) 2000; 40
Randić (2023012507543699200_B21) 2004; 397
Bulashevska (2023012507543699200_B4) 2006; 7
Yu (2023012507543699200_B25) 2009; 261
References_xml – volume: 2
  start-page: 275
  year: 2006
  ident: 2023012507543699200_B11
  article-title: Analysis of similarity/dissimilarity of DNA primary sequences based on condensed matrices and information entropies
  publication-title: Curr. Comput. Aid. Drug Des.
  doi: 10.2174/157340906778226436
– volume: 31
  start-page: 2126
  year: 2010
  ident: 2023012507543699200_B26
  article-title: Reannotation of protein-coding genes based on an improved graphical representation of DNA sequence
  publication-title: J. Comput. Chem.
  doi: 10.1002/jcc.21500
– volume: 4
  start-page: 195
  year: 2005
  ident: 2023012507543699200_B1
  article-title: Feature selection and the class imbalance problem in predicting protein function from sequence
  publication-title: Appl. Bioinform.
  doi: 10.2165/00822942-200504030-00004
– volume: 476
  start-page: 281
  year: 2009
  ident: 2023012507543699200_B24
  article-title: A 2D graphical representation of protein sequence and its numerical characterization
  publication-title: Chem. Phys. Lett.
  doi: 10.1016/j.cplett.2009.06.017
– volume: 15
  start-page: 250
  year: 2005
  ident: 2023012507543699200_B2
  article-title: FRANKSUM: new feature selection method for protein function prediction
  publication-title: Int. J. Neural Syst.
  doi: 10.1142/S0129065705000281
– volume: 371
  start-page: 202
  year: 2003
  ident: 2023012507543699200_B20
  article-title: Analysis of similarity/dissimilarity Of DNA sequences based on novel 2-D graphical representation
  publication-title: Chem. Phys. Lett.
  doi: 10.1016/S0009-2614(03)00244-6
– volume: 248
  start-page: 377
  year: 2007
  ident: 2023012507543699200_B5
  article-title: Prediction of apoptosis protein sub cellular location using improved hybrid approach and pseudo-amino acid composition
  publication-title: J. Theor. Biol.
  doi: 10.1016/j.jtbi.2007.05.019
– volume: 245
  start-page: 775
  year: 2007
  ident: 2023012507543699200_B6
  article-title: Prediction of the sub cellular location of apoptosis proteins
  publication-title: J. Theor. Biol.
  doi: 10.1016/j.jtbi.2006.11.010
– volume: 579
  start-page: 3444
  year: 2005
  ident: 2023012507543699200_B8
  article-title: Prediction of protein sub cellular location using a combined feature of sequence
  publication-title: Fed. Eur. Biochem. Soc.
  doi: 10.1016/j.febslet.2005.05.021
– volume: 440
  start-page: 291
  year: 2007
  ident: 2023012507543699200_B22
  article-title: 2-D Graphical representation of proteins based on physico-chemical properties of amino acids
  publication-title: Chem. Phys. Lett.
  doi: 10.1016/j.cplett.2007.04.037
– volume: 31
  start-page: 2136
  year: 2010
  ident: 2023012507543699200_B9
  article-title: The graphical representation of protein sequences based on the physicochemical properties and its applications
  publication-title: J. Comput. Chem.
  doi: 10.1002/jcc.21501
– volume: 402
  start-page: 380
  year: 2005
  ident: 2023012507543699200_B10
  article-title: A 4D representation of DNA sequences and its application
  publication-title: Chem. Phys. Lett.
  doi: 10.1016/j.cplett.2004.12.062
– volume: 397
  start-page: 247
  year: 2004
  ident: 2023012507543699200_B21
  article-title: Unique graphical representation of protein sequences based on nucleotide triplet codons
  publication-title: Chem. Phys. Lett.
  doi: 10.1016/j.cplett.2004.08.118
– volume: 7
  start-page: 298
  year: 2006
  ident: 2023012507543699200_B4
  article-title: Predicting protein sub cellular locations using hierarchical ensemble of Bayesian classifiers based on Markov chains
  publication-title: BMC Bioinformatics
  doi: 10.1186/1471-2105-7-298
– volume: 413
  start-page: 458
  year: 2005
  ident: 2023012507543699200_B3
  article-title: A 2-D graphical representation of protein sequences based on nucleotide triplet codons
  publication-title: Chem. Phys. Lett.
  doi: 10.1016/j.cplett.2005.08.011
– volume: 40
  start-page: 50
  year: 2000
  ident: 2023012507543699200_B17
  article-title: Condensed Representation of DNA Primary Sequences
  publication-title: J. Chem. Inform. Comput. Sci.
  doi: 10.1021/ci990084z
– volume: 40
  start-page: 1235
  year: 2000
  ident: 2023012507543699200_B18
  article-title: On 3-D graphical representation of DNA primary sequences and their numerical characterization
  publication-title: J. Chem. Inform. Comput. Sci.
  doi: 10.1021/ci000034q
– volume: 368
  start-page: 1
  year: 2003
  ident: 2023012507543699200_B19
  article-title: Novel 2-D graphical representation of DNA sequences and their numerical characterization
  publication-title: Chem. Phys. Lett.
  doi: 10.1016/S0009-2614(02)01784-0
– volume: 7
  start-page: 27
  year: 2009
  ident: 2023012507543699200_B13
  article-title: Identification of protein functions using a machine-learning approach based on sequence-derived properties
  publication-title: Proteome Sci.
  doi: 10.1186/1477-5956-7-27
– volume: 27
  start-page: 637
  year: 2009
  ident: 2023012507543699200_B23
  article-title: Graphical representation of proteins as four-color maps and their numerical characterization
  publication-title: J. Mol. Graph. Model.
  doi: 10.1016/j.jmgm.2008.10.004
– volume: 50
  start-page: 44
  year: 2003
  ident: 2023012507543699200_B27
  article-title: Sub cellular location prediction of apoptosis proteins
  publication-title: Proteins
  doi: 10.1002/prot.10251
– volume: 42
  start-page: 47
  year: 2007
  ident: 2023012507543699200_B12
  article-title: On the similarity of DNA primary sequences based on 5D representation
  publication-title: J. Math. Chem.
  doi: 10.1007/s10910-006-9091-z
– volume: 12
  start-page: 55
  year: 1996
  ident: 2023012507543699200_B16
  article-title: Two-dimensional graphical representation of DNA sequences and intron-exon discrimination in intron-rich sequences
  publication-title: Comput. Appl. Biosci.
– volume: 261
  start-page: 290
  year: 2009
  ident: 2023012507543699200_B14
  article-title: Protein functional class prediction using global encoding of amino acid sequence
  publication-title: J. Theo. Biol.
  doi: 10.1016/j.jtbi.2009.07.017
– volume: 261
  start-page: 459
  year: 2009
  ident: 2023012507543699200_B25
  article-title: TN curve: A novel 3D graphical representation of DNA sequence based on trinucleotides and its applications
  publication-title: J. Theor. Biol.
  doi: 10.1016/j.jtbi.2009.08.005
– volume: 108
  start-page: 1485
  year: 2008
  ident: 2023012507543699200_B7
  article-title: A group of 3D graphical representation of DNA sequences based on dual nucleotides
  publication-title: Int. J. Quantum Chem.
  doi: 10.1002/qua.21698
– volume: 259
  start-page: 361
  year: 2009
  ident: 2023012507543699200_B28
  article-title: A novel representation for apoptosis protein subcellular localization prediction using support vector machine
  publication-title: J. Theor. Biol.
  doi: 10.1016/j.jtbi.2009.03.025
– volume: 109
  start-page: 948
  year: 2009
  ident: 2023012507543699200_B15
  article-title: A 2-D graphical representation of DNA sequence based on dual nucleotides and its application
  publication-title: Int. J. Quant. Chem.
  doi: 10.1002/qua.21919
SSID ssj0051444
ssj0005056
Score 2.231303
Snippet Motivation: Biological sequence was regarded as an important study by many biologists, because the sequence contains a large number of biological information,...
Biological sequence was regarded as an important study by many biologists, because the sequence contains a large number of biological information, what is...
SourceID proquest
pubmed
pascalfrancis
crossref
istex
SourceType Aggregation Database
Index Database
Enrichment Source
Publisher
StartPage 2678
SubjectTerms Algorithms
Biological and medical sciences
Databases, Protein
Fundamental and applied biological sciences. Psychology
General aspects
Mathematics in biology. Statistical analysis. Models. Metrology. Data processing in biology (general aspects)
Proteins - analysis
Proteins - chemistry
Sequence Alignment - methods
Sequence Analysis, Protein - methods
Title A Novel method for similarity analysis and protein sub-cellular localization prediction
URI https://api.istex.fr/ark:/67375/HXZ-RQPQV8NN-D/fulltext.pdf
https://www.ncbi.nlm.nih.gov/pubmed/20826879
https://www.proquest.com/docview/759878968
https://www.proquest.com/docview/855685759
Volume 26
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnR1db9Mw0IJNSLwgvlc-Jj8gXqZsaewkzmOBTRUqHUUtRLxEtuOgiC2BtUUbv57zR5JG2zTgJYou8SXxnc93l_tA6BWJJStoHng8ioVHE0U9JsBqVZwHJKY-kUQnCn-YRuMFfZ-GaRerarJLVmJf_r4yr-R_qAowoKvOkv0HyrZIAQDnQF84AoXh-Fc0Hu1V9S914tpAm4jBZXlagrGqdWve1BuxtQBq3ddyb7kWnnbWm-hTs5G5RExdLSAvZUun5kdvWbvaqqaesy5Oet7Ew7sGIBvOhEnJjev1TX0JoqqLet39gjLCLoV987TZO7X3WlnJMwPYtzV3F5xPQsd3tD4JK0aJrqbOnBhVV8Cc7LXZ8o7HbKp0I0kj29rnkoi35a9E7-s1YPUzdAh6RbWnx9nRYjLJ5ofp_DbaDsCa0I0u5sdpFwnkmya_7Qs2iV4JOeg_5sA-pKfCbOvVeK5DavkSaFbYdijX2ytGb5nfR_ecwYFHlnseoFuqeoju2BakF4_QlxE2PIQtD2F4C9zxEG54CE5y7HgIb_IQ3uQh3PHQY7Q4Opy_HXuu2YYnaRiuPE7FUClaUCGSKOFFIKlKqMyLUIKFOvSLQogiB3UwjoowB2EtCuHLOJHDkDHBBXmCtqq6UjsIw6oPZDhkNKE-VSRmicj9OBYqDwngkANEm_nLpKtErxuinGQ2IoJk_WnP7LQP0H477IctxXLTgNeGOO3d_Oy7jmOMw2ycfs0-zT7OPrPpNHs3QLs96rUDAkK0_wMw4YacGYhfPcW8UvV6mcVhwuADI3b9LUwX-dN9cAfoqeWEDj8o4BGLk2c343-O7nYr7QXaWp2t1UvQh1di13DzH5mLwSk
linkProvider Oxford University Press
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=A+novel+method+for+similarity+analysis+and+protein+sub-cellular+localization+prediction&rft.jtitle=Bioinformatics+%28Oxford%2C+England%29&rft.au=Liao%2C+Bo&rft.au=Liao%2C+Benyou&rft.au=Sun%2C+Xingming&rft.au=Zeng%2C+Qingguang&rft.date=2010-11-01&rft.issn=1367-4811&rft.eissn=1367-4811&rft.volume=26&rft.issue=21&rft.spage=2678&rft_id=info:doi/10.1093%2Fbioinformatics%2Fbtq521&rft.externalDBID=NO_FULL_TEXT
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1367-4803&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1367-4803&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1367-4803&client=summon