An enhanced approach on handling missing values using bagging k-NN imputation
Researchers in the database community have aroused great interest in handling high dimensional data sets for the past decades. Today's business captures inundate sets of data which includes digital documents, web pages-customer databases, hyper-spectral imagery, social networks, gene arrays, pr...
Saved in:
Published in | 2013 International Conference on Computer Communication and Informatics pp. 1 - 8 |
---|---|
Main Authors | , |
Format | Conference Proceeding Journal Article |
Language | English |
Published |
IEEE
01.01.2013
|
Subjects | |
Online Access | Get full text |
ISBN | 1467329061 9781467329064 |
DOI | 10.1109/ICCCI.2013.6466301 |
Cover
Abstract | Researchers in the database community have aroused great interest in handling high dimensional data sets for the past decades. Today's business captures inundate sets of data which includes digital documents, web pages-customer databases, hyper-spectral imagery, social networks, gene arrays, proteomics data, neurobiological signals, high dimensional dynamical systems, sensor networks, financial transactions and traffic statistics thereby generating massive high dimensional datasets. DNA microarray paves methods in identifying different expression levels of thousands of genes during biological process. The problem with microarrays is to measure gene expression from thousands of genes (features) from only tens of hundreds of samples. Microarray data often contain several missing values that may affect subsequent analysis. In this paper, a novel approach on imputation using k-NN with bagging method is proposed to handle missing value. The experimental result shows that the proposed method outperforms other methods in terms of distance and density of clusters. The proposed approach has enhanced the performance of traditional k-NN impute using bagging method. |
---|---|
AbstractList | Researchers in the database community have aroused great interest in handling high dimensional data sets for the past decades. Today's business captures inundate sets of data which includes digital documents, web pages-customer databases, hyper-spectral imagery, social networks, gene arrays, proteomics data, neurobiological signals, high dimensional dynamical systems, sensor networks, financial transactions and traffic statistics thereby generating massive high dimensional datasets. DNA microarray paves methods in identifying different expression levels of thousands of genes during biological process. The problem with microarrays is to measure gene expression from thousands of genes (features) from only tens of hundreds of samples. Microarray data often contain several missing values that may affect subsequent analysis. In this paper, a novel approach on imputation using k-NN with bagging method is proposed to handle missing value. The experimental result shows that the proposed method outperforms other methods in terms of distance and density of clusters. The proposed approach has enhanced the performance of traditional k-NN impute using bagging method. |
Author | Palaniammal, S. Kumutha, V. |
Author_xml | – sequence: 1 givenname: V. surname: Kumutha fullname: Kumutha, V. email: kumuthav@gmail.com organization: Dept. of Comput. Sci., D.J. Acad. for Manage. Excellence, Coimbatore, India – sequence: 2 givenname: S. surname: Palaniammal fullname: Palaniammal, S. email: splvlb@yahoo.com organization: Dept. of Sci. &Humanities, Sri Krishna Coll. of Technol., Coimbatore, India |
BookMark | eNo1kMtOwzAQRY0ACVr6A7Dxkk3K-BE7XlYRj0qlbGAdTRK7NaROqBMk_p5Ay-roju6MjmZCzkIbLCHXDOaMgblb5nm-nHNgYq6kUgLYCZkZnTGptOAGNDslk_-g2AWZxfgOAOOy4hwuyfMiUBu2GCpbU-y6fYvVlraBjqO68WFDdz7GX35hM9hIh79Q4mbzy49kvaZ-1w099r4NV-TcYRPt7MgpeXu4f82fktXL4zJfrBLPIesTqYThlUMtapdmUDNtoJYGldal0CatU8c5y5xDhkZXpuRSlA6F5imX0mkxJbeHu6Pv52jVF6NlZZsGg22HWLAMQApjmBqrN4eqt9YW3d7vcP9dHJ8lfgD2kF39 |
ContentType | Conference Proceeding Journal Article |
DBID | 6IE 6IL CBEJK RIE RIL 7SC 7SP 8FD JQ2 L7M L~C L~D |
DOI | 10.1109/ICCCI.2013.6466301 |
DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Xplore POP ALL IEEE Xplore All Conference Proceedings IEEE/IET Electronic Library IEEE Proceedings Order Plans (POP All) 1998-Present Computer and Information Systems Abstracts Electronics & Communications Abstracts Technology Research Database ProQuest Computer Science Collection Advanced Technologies Database with Aerospace Computer and Information Systems Abstracts Academic Computer and Information Systems Abstracts Professional |
DatabaseTitle | Technology Research Database Computer and Information Systems Abstracts – Academic Electronics & Communications Abstracts ProQuest Computer Science Collection Computer and Information Systems Abstracts Advanced Technologies Database with Aerospace Computer and Information Systems Abstracts Professional |
DatabaseTitleList | Technology Research Database |
Database_xml | – sequence: 1 dbid: RIE name: IEEE Electronic Library (IEL) url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher |
DeliveryMethod | fulltext_linktorsrc |
EISBN | 9781467329071 1467329053 9781467329057 146732907X |
EndPage | 8 |
ExternalDocumentID | 6466301 |
Genre | orig-research |
GroupedDBID | 6IE 6IF 6IK 6IL 6IN AAJGR AAWTH ADFMO ALMA_UNASSIGNED_HOLDINGS BEFXN BFFAM BGNUA BKEBE BPEOZ CBEJK IEGSK IERZE OCL RIE RIL 7SC 7SP 8FD JQ2 L7M L~C L~D |
ID | FETCH-LOGICAL-i208t-46392cfa73df580d1790d49a677b3795d5f2218ffa1a97c9b243bfa3725244f73 |
IEDL.DBID | RIE |
ISBN | 1467329061 9781467329064 |
IngestDate | Fri Jul 11 16:27:27 EDT 2025 Wed Aug 27 04:54:27 EDT 2025 |
IsPeerReviewed | false |
IsScholarly | false |
Language | English |
LinkModel | DirectLink |
MergedId | FETCHMERGED-LOGICAL-i208t-46392cfa73df580d1790d49a677b3795d5f2218ffa1a97c9b243bfa3725244f73 |
Notes | ObjectType-Article-2 SourceType-Scholarly Journals-1 ObjectType-Conference-1 ObjectType-Feature-3 content type line 23 SourceType-Conference Papers & Proceedings-2 |
PQID | 1800439916 |
PQPubID | 23500 |
PageCount | 8 |
ParticipantIDs | proquest_miscellaneous_1800439916 ieee_primary_6466301 |
PublicationCentury | 2000 |
PublicationDate | 2013-Jan. 20130101 |
PublicationDateYYYYMMDD | 2013-01-01 |
PublicationDate_xml | – month: 01 year: 2013 text: 2013-Jan. |
PublicationDecade | 2010 |
PublicationTitle | 2013 International Conference on Computer Communication and Informatics |
PublicationTitleAbbrev | ICCCI |
PublicationYear | 2013 |
Publisher | IEEE |
Publisher_xml | – name: IEEE |
SSID | ssj0001106220 |
Score | 1.5805638 |
Snippet | Researchers in the database community have aroused great interest in handling high dimensional data sets for the past decades. Today's business captures... |
SourceID | proquest ieee |
SourceType | Aggregation Database Publisher |
StartPage | 1 |
SubjectTerms | Bagging Classification algorithms clustering Clustering algorithms Computers Correlation Density Dynamical systems Gene expression Genes Handling microarray missing value Statistical methods |
Title | An enhanced approach on handling missing values using bagging k-NN imputation |
URI | https://ieeexplore.ieee.org/document/6466301 https://www.proquest.com/docview/1800439916 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3PS8MwFA5zJ08qmzh_EcGj6dokbZajFMcmbHhwsFtp0hcdg05ce_GvN8naDdSDtwba0ry-9n15ed_3ELoXNopo0IZIBZJwQTmRIhbEFMx-TdouKBLHRp7Nk8mCPy_jZQc97LkwAOCLzyBwh34vv9jo2qXKhgm38dGRtY6sm-24Wod8il3bUBp67lYimFMxj1pJp2bMW9JMKIfTNE2nrrKLBc1dm_Yqv_7JPtCMT9CsfcRdfck6qCsV6K8f6o3_ncMp6h8offhlH6zOUAfKHpo9lhjKd18EgFt1cbwpsddesOdh6wUumYCdJjhsce0HKndp6je8JvM5XrmuEP719tFi_PSaTkjTX4GsaDiqCLfohGqTC1aYeBQWTqyr4DJPhFBMyLiIDbUIwJg8yqXQUlHOlMmZoLEFBUawc9QtNyVcIEyVhY2JMJGCmBuQuYUtwJhOJIBUkg9Qz5ki-9hJaGSNFQborjV2Zifk9iryEjb1NotGbo_SgdfLvy-9QsfUd6Zw2ZBr1K0-a7ix-KBSt94xvgFOEbXJ |
linkProvider | IEEE |
linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3PT4MwFG6WedCTms04f9bEozBoC12PhrhsOoiHLdmNUHjVZQkzblz862072BL14I0mQOij8L6-977vIXTPtRfJIVeOkCAcxglzBA-4owqqv6ZcbyhCw0aOk3A0Y8_zYN5CDzsuDADY4jNwzaHN5RervDKhsn7ItH80ZK0D7fdZsGVr7SMqendDiGfZWyGnRsfcb0Sd6jFraDOe6I-jKBqb2i7q1vetG6z8-itbVzM8RnHzkNsKk6VbbaSbf_3Qb_zvLE5Qd0_qw687d3WKWlB2UPxYYijfbRkAbvTF8arEVn1Bn4f1OjDhBGxUwWGNKzuQmQlUv-GlkyR4YfpC2BfcRbPh0zQaOXWHBWdBvMHGYRqfkFxlnBYqGHiFkesqmMhCziXlIigCRTQGUCrzM8FzIQmjUmWUk0DDAsXpGWqXqxLOESZSA8eQK19CwBSITAMXoDQPBYCQgvVQx5gi_diKaKS1FXrorjF2qidkshVZCatqnfoDk6U08PXi70tv0eFoGk_SyTh5uURHxPapMLGRK9TefFZwrdHCRt7YRfINBiO5Fg |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2013+International+Conference+on+Computer+Communication+and+Informatics&rft.atitle=An+enhanced+approach+on+handling+missing+values+using+bagging+k-NN+imputation&rft.au=Kumutha%2C+V.&rft.au=Palaniammal%2C+S.&rft.date=2013-01-01&rft.pub=IEEE&rft.isbn=9781467329064&rft.spage=1&rft.epage=8&rft_id=info:doi/10.1109%2FICCCI.2013.6466301&rft.externalDocID=6466301 |
thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9781467329064/lc.gif&client=summon&freeimage=true |
thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9781467329064/mc.gif&client=summon&freeimage=true |
thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9781467329064/sc.gif&client=summon&freeimage=true |