A Novel method for similarity analysis and protein sub-cellular localization prediction

Motivation: Biological sequence was regarded as an important study by many biologists, because the sequence contains a large number of biological information, what is helpful for scientists' studies on biological cells, DNA and proteins. Currently, many researchers used the method based on prot...

Full description

Saved in:

Bibliographic Details
Published in	Bioinformatics Vol. 26; no. 21; pp. 2678 - 2683
Main Authors	Liao, Bo, Liao, Benyou, Sun, Xingming, Zeng, Qingguang
Format	Journal Article
Language	English
Published	Oxford Oxford University Press 01.11.2010
Subjects	Algorithms Biological and medical sciences Databases, Protein Fundamental and applied biological sciences. Psychology General aspects Mathematics in biology. Statistical analysis. Models. Metrology. Data processing in biology (general aspects) Proteins - analysis Proteins - chemistry Sequence Alignment - methods Sequence Analysis, Protein - methods Analysis method Similarity Localization Analysis Protein Prediction
Online Access	Get full text

Cover

Loading…

Abstract	Motivation: Biological sequence was regarded as an important study by many biologists, because the sequence contains a large number of biological information, what is helpful for scientists' studies on biological cells, DNA and proteins. Currently, many researchers used the method based on protein sequences in function classification, sub-cellular location, structure and functional site prediction, including some machine-learning methods. The purpose of this article, is to find a new way of sequence analysis, but more simple and effective. Results: According to the nature of 64 genetic codes, we propose a simple and intuitive 2D graphical expression of protein sequences. And based on this expression we give a new Euclidean-distance method to compute the distance of different sequences for the analysis of sequence similarity. This approach contains more sequence information. A typical phylogenetic tree constructed based on this method proved the effectiveness of our approach. Finally, we use this sequence-similarity-analysis method to predict protein sub-cellular localization, in the two datasets commonly used. The results show that the method is reasonable. Contact: dragonbw@163.com
AbstractList	Biological sequence was regarded as an important study by many biologists, because the sequence contains a large number of biological information, what is helpful for scientists' studies on biological cells, DNA and proteins. Currently, many researchers used the method based on protein sequences in function classification, sub-cellular location, structure and functional site prediction, including some machine-learning methods. The purpose of this article, is to find a new way of sequence analysis, but more simple and effective.MOTIVATIONBiological sequence was regarded as an important study by many biologists, because the sequence contains a large number of biological information, what is helpful for scientists' studies on biological cells, DNA and proteins. Currently, many researchers used the method based on protein sequences in function classification, sub-cellular location, structure and functional site prediction, including some machine-learning methods. The purpose of this article, is to find a new way of sequence analysis, but more simple and effective.According to the nature of 64 genetic codes, we propose a simple and intuitive 2D graphical expression of protein sequences. And based on this expression we give a new Euclidean-distance method to compute the distance of different sequences for the analysis of sequence similarity. This approach contains more sequence information. A typical phylogenetic tree constructed based on this method proved the effectiveness of our approach. Finally, we use this sequence-similarity-analysis method to predict protein sub-cellular localization, in the two datasets commonly used. The results show that the method is reasonable.RESULTSAccording to the nature of 64 genetic codes, we propose a simple and intuitive 2D graphical expression of protein sequences. And based on this expression we give a new Euclidean-distance method to compute the distance of different sequences for the analysis of sequence similarity. This approach contains more sequence information. A typical phylogenetic tree constructed based on this method proved the effectiveness of our approach. Finally, we use this sequence-similarity-analysis method to predict protein sub-cellular localization, in the two datasets commonly used. The results show that the method is reasonable. Motivation: Biological sequence was regarded as an important study by many biologists, because the sequence contains a large number of biological information, what is helpful for scientists' studies on biological cells, DNA and proteins. Currently, many researchers used the method based on protein sequences in function classification, sub-cellular location, structure and functional site prediction, including some machine-learning methods. The purpose of this article, is to find a new way of sequence analysis, but more simple and effective. Results: According to the nature of 64 genetic codes, we propose a simple and intuitive 2D graphical expression of protein sequences. And based on this expression we give a new Euclidean-distance method to compute the distance of different sequences for the analysis of sequence similarity. This approach contains more sequence information. A typical phylogenetic tree constructed based on this method proved the effectiveness of our approach. Finally, we use this sequence-similarity-analysis method to predict protein sub-cellular localization, in the two datasets commonly used. The results show that the method is reasonable. Contact: dragonbw@163.com Motivation: Biological sequence was regarded as an important study by many biologists, because the sequence contains a large number of biological information, what is helpful for scientists' studies on biological cells, DNA and proteins. Currently, many researchers used the method based on protein sequences in function classification, sub-cellular location, structure and functional site prediction, including some machine-learning methods. The purpose of this article, is to find a new way of sequence analysis, but more simple and effective. Results: According to the nature of 64 genetic codes, we propose a simple and intuitive 2D graphical expression of protein sequences. And based on this expression we give a new Euclidean-distance method to compute the distance of different sequences for the analysis of sequence similarity. This approach contains more sequence information. A typical phylogenetic tree constructed based on this method proved the effectiveness of our approach. Finally, we use this sequence-similarity-analysis method to predict protein sub-cellular localization, in the two datasets commonly used. The results show that the method is reasonable. Contact: dragonbw@163.com Motivation: Biological sequence was regarded as an important study by many biologists, because the sequence contains a large number of biological information, what is helpful for scientists' studies on biological cells, DNA and proteins. Currently, many researchers used the method based on protein sequences in function classification, sub-cellular location, structure and functional site prediction, including some machine-learning methods. The purpose of this article, is to find a new way of sequence analysis, but more simple and effective.Results: According to the nature of 64 genetic codes, we propose a simple and intuitive 2D graphical expression of protein sequences. And based on this expression we give a new Euclidean-distance method to compute the distance of different sequences for the analysis of sequence similarity. This approach contains more sequence information. A typical phylogenetic tree constructed based on this method proved the effectiveness of our approach. Finally, we use this sequence-similarity-analysis method to predict protein sub-cellular localization, in the two datasets commonly used. The results show that the method is reasonable. Biological sequence was regarded as an important study by many biologists, because the sequence contains a large number of biological information, what is helpful for scientists' studies on biological cells, DNA and proteins. Currently, many researchers used the method based on protein sequences in function classification, sub-cellular location, structure and functional site prediction, including some machine-learning methods. The purpose of this article, is to find a new way of sequence analysis, but more simple and effective. According to the nature of 64 genetic codes, we propose a simple and intuitive 2D graphical expression of protein sequences. And based on this expression we give a new Euclidean-distance method to compute the distance of different sequences for the analysis of sequence similarity. This approach contains more sequence information. A typical phylogenetic tree constructed based on this method proved the effectiveness of our approach. Finally, we use this sequence-similarity-analysis method to predict protein sub-cellular localization, in the two datasets commonly used. The results show that the method is reasonable.
Author	Sun, Xingming Liao, Bo Liao, Benyou Zeng, Qingguang
Author_xml	– sequence: 1 givenname: Bo surname: Liao fullname: Liao, Bo organization: To whom correspondence should be addressed – sequence: 2 givenname: Benyou surname: Liao fullname: Liao, Benyou organization: School of computer and communication, Hunan University, Changsha Hunan, 410082, China – sequence: 3 givenname: Xingming surname: Sun fullname: Sun, Xingming organization: School of computer and communication, Hunan University, Changsha Hunan, 410082, China – sequence: 4 givenname: Qingguang surname: Zeng fullname: Zeng, Qingguang organization: School of computer and communication, Hunan University, Changsha Hunan, 410082, China
BackLink	http://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&idt=23340941$$DView record in Pascal Francis https://www.ncbi.nlm.nih.gov/pubmed/20826879$$D View this record in MEDLINE/PubMed
BookMark	eNqFkUtP3DAUhS1ExbM_AZQNYpWOndiJLVYIKFQaDQX1JTaW7djC4MRgOxXTX19PZwDRDSufxXeuz71nG6wPftAA7CH4CUFWT6T1djA-9CJZFScyPZIKrYEthBtYVpCw9azrpi0xhfUm2I7xDkKCMMYbYLOCtGpoy7bAz-Ni5n9rV_Q63fquyBOLaHvrRLBpXohBuHm0MYuueAg-aTsUcZSl0s6NGSqcV8LZPzmFHzKhO6sWchd8MMJF_XH17oDvn8--nVyU08vzLyfH01JhQlIpsERaY4OlZA0TplJYM6w6QxSuKILGSGk6iqq2MaSTEEojoWqZQoRSKWS9Aw6Xc3O4x1HHxHsbF-HEoP0YOSWkoaQl7F0yM7SlrKGZ3F-Ro-x1xx-C7UWY8-erZeBgBYiYtzdBDMrGV66uMWQYZe5oyangYwzacGXTv0ulIKzjCPJFl_xtl3zZZXaT_9zPH7znK5c-G5N-ejGJcM-btm4Jv_h1w6-vvl79oLMZP63_Ao6MvEk
CitedBy_id	crossref_primary_10_1016_j_physa_2015_10_067 crossref_primary_10_1038_s41598_018_26005_3 crossref_primary_10_1016_j_compbiomed_2014_11_012 crossref_primary_10_1016_j_jtbi_2016_06_002 crossref_primary_10_1021_ci500577m crossref_primary_10_1016_j_physa_2013_05_015 crossref_primary_10_1186_1687_4153_2014_1 crossref_primary_10_1039_C8RA05122D crossref_primary_10_4137_EBO_S14713 crossref_primary_10_1038_s41598_021_93154_3 crossref_primary_10_4236_cmb_2016_62003 crossref_primary_10_1016_j_jtbi_2012_03_023 crossref_primary_10_1016_j_jtbi_2013_06_037 crossref_primary_10_3389_fmicb_2018_02500 crossref_primary_10_1038_srep46237 crossref_primary_10_1142_S1793524515500631 crossref_primary_10_1016_j_compbiolchem_2017_04_001 crossref_primary_10_1002_jcc_21906 crossref_primary_10_1002_prot_26226 crossref_primary_10_1016_j_physa_2011_08_015 crossref_primary_10_1016_j_cplett_2013_10_076 crossref_primary_10_1016_j_biosystems_2019_03_002 crossref_primary_10_1371_journal_pone_0287880 crossref_primary_10_1016_j_jtbi_2011_11_021 crossref_primary_10_4236_ijamsc_2016_41001 crossref_primary_10_1109_TCBB_2013_10 crossref_primary_10_1002_jcc_21833 crossref_primary_10_1016_j_compbiolchem_2018_04_016 crossref_primary_10_1007_s00726_011_1106_9 crossref_primary_10_1016_j_cplett_2012_02_030 crossref_primary_10_1371_journal_pone_0167430 crossref_primary_10_1038_s41598_019_55378_2 crossref_primary_10_1155_2014_959753
Cites_doi	10.2174/157340906778226436 10.1002/jcc.21500 10.2165/00822942-200504030-00004 10.1016/j.cplett.2009.06.017 10.1142/S0129065705000281 10.1016/S0009-2614(03)00244-6 10.1016/j.jtbi.2007.05.019 10.1016/j.jtbi.2006.11.010 10.1016/j.febslet.2005.05.021 10.1016/j.cplett.2007.04.037 10.1002/jcc.21501 10.1016/j.cplett.2004.12.062 10.1016/j.cplett.2004.08.118 10.1186/1471-2105-7-298 10.1016/j.cplett.2005.08.011 10.1021/ci990084z 10.1021/ci000034q 10.1016/S0009-2614(02)01784-0 10.1186/1477-5956-7-27 10.1016/j.jmgm.2008.10.004 10.1002/prot.10251 10.1007/s10910-006-9091-z 10.1016/j.jtbi.2009.07.017 10.1016/j.jtbi.2009.08.005 10.1002/qua.21698 10.1016/j.jtbi.2009.03.025 10.1002/qua.21919
ContentType	Journal Article
Copyright	2015 INIST-CNRS
Copyright_xml	– notice: 2015 INIST-CNRS
DBID	BSCLL AAYXX CITATION IQODW CGR CUY CVF ECM EIF NPM 7X8 7QO 8FD FR3 P64
DOI	10.1093/bioinformatics/btq521
DatabaseName	Istex CrossRef Pascal-Francis Medline MEDLINE MEDLINE (Ovid) MEDLINE MEDLINE PubMed MEDLINE - Academic Biotechnology Research Abstracts Technology Research Database Engineering Research Database Biotechnology and BioEngineering Abstracts
DatabaseTitle	CrossRef MEDLINE Medline Complete MEDLINE with Full Text PubMed MEDLINE (Ovid) MEDLINE - Academic Engineering Research Database Biotechnology Research Abstracts Technology Research Database Biotechnology and BioEngineering Abstracts
DatabaseTitleList	MEDLINE - Academic CrossRef Engineering Research Database MEDLINE
Database_xml	– sequence: 1 dbid: NPM name: PubMed url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed sourceTypes: Index Database – sequence: 2 dbid: EIF name: MEDLINE url: https://proxy.k.utb.cz/login?url=https://www.webofscience.com/wos/medline/basic-search sourceTypes: Index Database
DeliveryMethod	fulltext_linktorsrc
Discipline	Biology
EISSN	1460-2059 1367-4811
EndPage	2683
ExternalDocumentID	20826879 23340941 10_1093_bioinformatics_btq521 ark_67375_HXZ_RQPQV8NN_D
Genre	Research Support, Non-U.S. Gov't Journal Article
GroupedDBID	-~X .2P .I3 482 48X 5GY AAMVS ABJNI ABPTD ACGFS ACUFI ADZXQ ALMA_UNASSIGNED_HOLDINGS BSCLL CZ4 EE~ F5P F9B H5~ HAR HW0 IOX KSI KSN NGC Q5Y RD5 ROZ RXO TLC TN5 TOX WH7 ~91 --- -E4 .DC 0R~ 1TH 23N 2WC 4.4 53G 5WA 70D AAIJN AAIMJ AAJKP AAJQQ AAKPC AAMDB AAOGV AAPQZ AAPXW AAUQX AAVAP AAVLN AAYXX ABEJV ABEUO ABGNP ABIXL ABNGD ABNKS ABPQP ABQLI ABWST ABXVV ABZBJ ACIWK ACPRK ACUKT ACUXJ ACYTK ADBBV ADEYI ADEZT ADFTL ADGKP ADGZP ADHKW ADHZD ADMLS ADOCK ADPDF ADRDM ADRTK ADVEK ADYVW ADZTZ AECKG AEGPL AEJOX AEKKA AEKSI AELWJ AEMDU AENEX AENZO AEPUE AETBJ AEWNT AFFZL AFGWE AFIYH AFOFC AFRAH AGINJ AGKEF AGQPQ AGQXC AGSYK AHMBA AHXPO AIJHB AJEEA AJEUX AKHUL AKWXX ALTZX ALUQC AMNDL APIBT APWMN ARIXL ASPBG ATTQO AVWKF AXUDD AYOIW AZFZN AZVOD BAWUL BAYMD BHONS BQDIO BQUQU BSWAC BTQHN C1A C45 CAG CDBKE CITATION COF CS3 DAKXR DIK DILTD DU5 D~K EBD EBS EJD EMOBN FEDTE FHSFR FLIZI FLUFQ FOEOM FQBLK GAUVT GJXCC GROUPED_DOAJ GX1 H13 HVGLF HZ~ J21 JXSIZ KAQDR KOP KQ8 M-Z MK~ ML0 N9A NLBLG NMDNZ NOMLY NU- NVLIB O0~ O9- OAWHX ODMLO OJQWA OK1 OVD OVEED P2P PAFKI PB- PEELM PQQKQ Q1. R44 RNS ROL RPM RUSNO RW1 SV3 TEORI TJP TR2 W8F WOQ X7H YAYTL YKOAZ YXANX ZKX ~KM .-4 .GJ ABEFU AFFNX AI. AQDSO ELUNK IQODW NTWIH O~Y RIG RNI RZF RZO VH1 ZGI ABQTQ ADRIX AFXEN BCRHZ CGR CUY CVF ECM EIF M49 NPM ROX 7X8 7QO 8FD FR3 P64
ID	FETCH-LOGICAL-c455t-a4b1ee4f4bb969af2c4e94cdf5c42810ffbbfd81276f5db00bfb0c79c1588bab3
ISSN	1367-4803 1367-4811
IngestDate	Fri Jul 11 00:44:58 EDT 2025 Thu Jul 10 17:30:22 EDT 2025 Wed Feb 19 02:06:15 EST 2025 Mon Jul 21 09:14:18 EDT 2025 Tue Jul 01 03:27:02 EDT 2025 Thu Apr 24 23:04:56 EDT 2025 Tue Aug 05 16:46:56 EDT 2025
IsDoiOpenAccess	false
IsOpenAccess	true
IsPeerReviewed	true
IsScholarly	true
Issue	21
Keywords	Analysis method Similarity Localization Analysis Protein Prediction
Language	English
License	CC BY 4.0
LinkModel	OpenURL
MergedId	FETCHMERGED-LOGICAL-c455t-a4b1ee4f4bb969af2c4e94cdf5c42810ffbbfd81276f5db00bfb0c79c1588bab3
Notes	To whom correspondence should be addressed. istex:940FFCBF772EF865A4C662DA978AED66A2BD5890 ArticleID:btq521 ark:/67375/HXZ-RQPQV8NN-D Associate Editor: John Quackenbush ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23
OpenAccessLink	https://academic.oup.com/bioinformatics/article-pdf/26/21/2678/567512/btq521.pdf
PMID	20826879
PQID	759878968
PQPubID	23479
PageCount	6
ParticipantIDs	proquest_miscellaneous_855685759 proquest_miscellaneous_759878968 pubmed_primary_20826879 pascalfrancis_primary_23340941 crossref_citationtrail_10_1093_bioinformatics_btq521 crossref_primary_10_1093_bioinformatics_btq521 istex_primary_ark_67375_HXZ_RQPQV8NN_D
ProviderPackageCode	CITATION AAYXX
PublicationCentury	2000
PublicationDate	2010-11-01
PublicationDateYYYYMMDD	2010-11-01
PublicationDate_xml	– month: 11 year: 2010 text: 2010-11-01 day: 01
PublicationDecade	2010
PublicationPlace	Oxford
PublicationPlace_xml	– name: Oxford – name: England
PublicationTitle	Bioinformatics
PublicationTitleAlternate	Bioinformatics
PublicationYear	2010
Publisher	Oxford University Press
Publisher_xml	– name: Oxford University Press
References	Chen (2023012507543699200_B5) 2007; 248 Randić (2023012507543699200_B23) 2009; 27 Liao (2023012507543699200_B12) 2007; 42 Randić (2023012507543699200_B22) 2007; 440 Cao (2023012507543699200_B7) 2008; 108 Bai (2023012507543699200_B3) 2005; 413 Randić (2023012507543699200_B18) 2000; 40 Lee (2023012507543699200_B13) 2009; 7 Gao (2023012507543699200_B8) 2005; 579 Randić (2023012507543699200_B20) 2003; 371 Zhang (2023012507543699200_B28) 2009; 259 Liao (2023012507543699200_B10) 2005; 402 Liao (2023012507543699200_B11) 2006; 2 Liu (2023012507543699200_B15) 2009; 109 He (2023012507543699200_B9) 2010; 31 Chen (2023012507543699200_B6) 2007; 245 Wen (2023012507543699200_B24) 2009; 476 Yu (2023012507543699200_B26) 2010; 31 Al-Shahib (2023012507543699200_B2) 2005; 15 Al-Shahib (2023012507543699200_B1) 2005; 4 Randić (2023012507543699200_B19) 2003; 368 Nandy (2023012507543699200_B16) 1996; 12 Li (2023012507543699200_B14) 2009; 261 Zhou (2023012507543699200_B27) 2003; 50 Randić (2023012507543699200_B17) 2000; 40 Randić (2023012507543699200_B21) 2004; 397 Bulashevska (2023012507543699200_B4) 2006; 7 Yu (2023012507543699200_B25) 2009; 261
References_xml	– volume: 2 start-page: 275 year: 2006 ident: 2023012507543699200_B11 article-title: Analysis of similarity/dissimilarity of DNA primary sequences based on condensed matrices and information entropies publication-title: Curr. Comput. Aid. Drug Des. doi: 10.2174/157340906778226436 – volume: 31 start-page: 2126 year: 2010 ident: 2023012507543699200_B26 article-title: Reannotation of protein-coding genes based on an improved graphical representation of DNA sequence publication-title: J. Comput. Chem. doi: 10.1002/jcc.21500 – volume: 4 start-page: 195 year: 2005 ident: 2023012507543699200_B1 article-title: Feature selection and the class imbalance problem in predicting protein function from sequence publication-title: Appl. Bioinform. doi: 10.2165/00822942-200504030-00004 – volume: 476 start-page: 281 year: 2009 ident: 2023012507543699200_B24 article-title: A 2D graphical representation of protein sequence and its numerical characterization publication-title: Chem. Phys. Lett. doi: 10.1016/j.cplett.2009.06.017 – volume: 15 start-page: 250 year: 2005 ident: 2023012507543699200_B2 article-title: FRANKSUM: new feature selection method for protein function prediction publication-title: Int. J. Neural Syst. doi: 10.1142/S0129065705000281 – volume: 371 start-page: 202 year: 2003 ident: 2023012507543699200_B20 article-title: Analysis of similarity/dissimilarity Of DNA sequences based on novel 2-D graphical representation publication-title: Chem. Phys. Lett. doi: 10.1016/S0009-2614(03)00244-6 – volume: 248 start-page: 377 year: 2007 ident: 2023012507543699200_B5 article-title: Prediction of apoptosis protein sub cellular location using improved hybrid approach and pseudo-amino acid composition publication-title: J. Theor. Biol. doi: 10.1016/j.jtbi.2007.05.019 – volume: 245 start-page: 775 year: 2007 ident: 2023012507543699200_B6 article-title: Prediction of the sub cellular location of apoptosis proteins publication-title: J. Theor. Biol. doi: 10.1016/j.jtbi.2006.11.010 – volume: 579 start-page: 3444 year: 2005 ident: 2023012507543699200_B8 article-title: Prediction of protein sub cellular location using a combined feature of sequence publication-title: Fed. Eur. Biochem. Soc. doi: 10.1016/j.febslet.2005.05.021 – volume: 440 start-page: 291 year: 2007 ident: 2023012507543699200_B22 article-title: 2-D Graphical representation of proteins based on physico-chemical properties of amino acids publication-title: Chem. Phys. Lett. doi: 10.1016/j.cplett.2007.04.037 – volume: 31 start-page: 2136 year: 2010 ident: 2023012507543699200_B9 article-title: The graphical representation of protein sequences based on the physicochemical properties and its applications publication-title: J. Comput. Chem. doi: 10.1002/jcc.21501 – volume: 402 start-page: 380 year: 2005 ident: 2023012507543699200_B10 article-title: A 4D representation of DNA sequences and its application publication-title: Chem. Phys. Lett. doi: 10.1016/j.cplett.2004.12.062 – volume: 397 start-page: 247 year: 2004 ident: 2023012507543699200_B21 article-title: Unique graphical representation of protein sequences based on nucleotide triplet codons publication-title: Chem. Phys. Lett. doi: 10.1016/j.cplett.2004.08.118 – volume: 7 start-page: 298 year: 2006 ident: 2023012507543699200_B4 article-title: Predicting protein sub cellular locations using hierarchical ensemble of Bayesian classifiers based on Markov chains publication-title: BMC Bioinformatics doi: 10.1186/1471-2105-7-298 – volume: 413 start-page: 458 year: 2005 ident: 2023012507543699200_B3 article-title: A 2-D graphical representation of protein sequences based on nucleotide triplet codons publication-title: Chem. Phys. Lett. doi: 10.1016/j.cplett.2005.08.011 – volume: 40 start-page: 50 year: 2000 ident: 2023012507543699200_B17 article-title: Condensed Representation of DNA Primary Sequences publication-title: J. Chem. Inform. Comput. Sci. doi: 10.1021/ci990084z – volume: 40 start-page: 1235 year: 2000 ident: 2023012507543699200_B18 article-title: On 3-D graphical representation of DNA primary sequences and their numerical characterization publication-title: J. Chem. Inform. Comput. Sci. doi: 10.1021/ci000034q – volume: 368 start-page: 1 year: 2003 ident: 2023012507543699200_B19 article-title: Novel 2-D graphical representation of DNA sequences and their numerical characterization publication-title: Chem. Phys. Lett. doi: 10.1016/S0009-2614(02)01784-0 – volume: 7 start-page: 27 year: 2009 ident: 2023012507543699200_B13 article-title: Identification of protein functions using a machine-learning approach based on sequence-derived properties publication-title: Proteome Sci. doi: 10.1186/1477-5956-7-27 – volume: 27 start-page: 637 year: 2009 ident: 2023012507543699200_B23 article-title: Graphical representation of proteins as four-color maps and their numerical characterization publication-title: J. Mol. Graph. Model. doi: 10.1016/j.jmgm.2008.10.004 – volume: 50 start-page: 44 year: 2003 ident: 2023012507543699200_B27 article-title: Sub cellular location prediction of apoptosis proteins publication-title: Proteins doi: 10.1002/prot.10251 – volume: 42 start-page: 47 year: 2007 ident: 2023012507543699200_B12 article-title: On the similarity of DNA primary sequences based on 5D representation publication-title: J. Math. Chem. doi: 10.1007/s10910-006-9091-z – volume: 12 start-page: 55 year: 1996 ident: 2023012507543699200_B16 article-title: Two-dimensional graphical representation of DNA sequences and intron-exon discrimination in intron-rich sequences publication-title: Comput. Appl. Biosci. – volume: 261 start-page: 290 year: 2009 ident: 2023012507543699200_B14 article-title: Protein functional class prediction using global encoding of amino acid sequence publication-title: J. Theo. Biol. doi: 10.1016/j.jtbi.2009.07.017 – volume: 261 start-page: 459 year: 2009 ident: 2023012507543699200_B25 article-title: TN curve: A novel 3D graphical representation of DNA sequence based on trinucleotides and its applications publication-title: J. Theor. Biol. doi: 10.1016/j.jtbi.2009.08.005 – volume: 108 start-page: 1485 year: 2008 ident: 2023012507543699200_B7 article-title: A group of 3D graphical representation of DNA sequences based on dual nucleotides publication-title: Int. J. Quantum Chem. doi: 10.1002/qua.21698 – volume: 259 start-page: 361 year: 2009 ident: 2023012507543699200_B28 article-title: A novel representation for apoptosis protein subcellular localization prediction using support vector machine publication-title: J. Theor. Biol. doi: 10.1016/j.jtbi.2009.03.025 – volume: 109 start-page: 948 year: 2009 ident: 2023012507543699200_B15 article-title: A 2-D graphical representation of DNA sequence based on dual nucleotides and its application publication-title: Int. J. Quant. Chem. doi: 10.1002/qua.21919
SSID	ssj0051444 ssj0005056
Score	2.231303
Snippet	Motivation: Biological sequence was regarded as an important study by many biologists, because the sequence contains a large number of biological information,... Biological sequence was regarded as an important study by many biologists, because the sequence contains a large number of biological information, what is...
SourceID	proquest pubmed pascalfrancis crossref istex
SourceType	Aggregation Database Index Database Enrichment Source Publisher
StartPage	2678
SubjectTerms	Algorithms Biological and medical sciences Databases, Protein Fundamental and applied biological sciences. Psychology General aspects Mathematics in biology. Statistical analysis. Models. Metrology. Data processing in biology (general aspects) Proteins - analysis Proteins - chemistry Sequence Alignment - methods Sequence Analysis, Protein - methods
Title	A Novel method for similarity analysis and protein sub-cellular localization prediction
URI	https://api.istex.fr/ark:/67375/HXZ-RQPQV8NN-D/fulltext.pdf https://www.ncbi.nlm.nih.gov/pubmed/20826879 https://www.proquest.com/docview/759878968 https://www.proquest.com/docview/855685759
Volume	26
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
link	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnR1db9Mw0IJNSLwgvlc-Jj8gXqZsaewkzmOBTRUqHUUtRLxEtuOgiC2BtUUbv57zR5JG2zTgJYou8SXxnc93l_tA6BWJJStoHng8ioVHE0U9JsBqVZwHJKY-kUQnCn-YRuMFfZ-GaRerarJLVmJf_r4yr-R_qAowoKvOkv0HyrZIAQDnQF84AoXh-Fc0Hu1V9S914tpAm4jBZXlagrGqdWve1BuxtQBq3ddyb7kWnnbWm-hTs5G5RExdLSAvZUun5kdvWbvaqqaesy5Oet7Ew7sGIBvOhEnJjev1TX0JoqqLet39gjLCLoV987TZO7X3WlnJMwPYtzV3F5xPQsd3tD4JK0aJrqbOnBhVV8Cc7LXZ8o7HbKp0I0kj29rnkoi35a9E7-s1YPUzdAh6RbWnx9nRYjLJ5ofp_DbaDsCa0I0u5sdpFwnkmya_7Qs2iV4JOeg_5sA-pKfCbOvVeK5DavkSaFbYdijX2ytGb5nfR_ecwYFHlnseoFuqeoju2BakF4_QlxE2PIQtD2F4C9zxEG54CE5y7HgIb_IQ3uQh3PHQY7Q4Opy_HXuu2YYnaRiuPE7FUClaUCGSKOFFIKlKqMyLUIKFOvSLQogiB3UwjoowB2EtCuHLOJHDkDHBBXmCtqq6UjsIw6oPZDhkNKE-VSRmicj9OBYqDwngkANEm_nLpKtErxuinGQ2IoJk_WnP7LQP0H477IctxXLTgNeGOO3d_Oy7jmOMw2ycfs0-zT7OPrPpNHs3QLs96rUDAkK0_wMw4YacGYhfPcW8UvV6mcVhwuADI3b9LUwX-dN9cAfoqeWEDj8o4BGLk2c343-O7nYr7QXaWp2t1UvQh1di13DzH5mLwSk
linkProvider	Oxford University Press
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=A+novel+method+for+similarity+analysis+and+protein+sub-cellular+localization+prediction&rft.jtitle=Bioinformatics+%28Oxford%2C+England%29&rft.au=Liao%2C+Bo&rft.au=Liao%2C+Benyou&rft.au=Sun%2C+Xingming&rft.au=Zeng%2C+Qingguang&rft.date=2010-11-01&rft.issn=1367-4811&rft.eissn=1367-4811&rft.volume=26&rft.issue=21&rft.spage=2678&rft_id=info:doi/10.1093%2Fbioinformatics%2Fbtq521&rft.externalDBID=NO_FULL_TEXT
thumbnail_l	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1367-4803&client=summon
thumbnail_m	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1367-4803&client=summon
thumbnail_s	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1367-4803&client=summon