An enhanced approach on handling missing values using bagging k-NN imputation

Researchers in the database community have aroused great interest in handling high dimensional data sets for the past decades. Today's business captures inundate sets of data which includes digital documents, web pages-customer databases, hyper-spectral imagery, social networks, gene arrays, pr...

Full description

Saved in:
Bibliographic Details
Published in2013 International Conference on Computer Communication and Informatics pp. 1 - 8
Main Authors Kumutha, V., Palaniammal, S.
Format Conference Proceeding Journal Article
LanguageEnglish
Published IEEE 01.01.2013
Subjects
Online AccessGet full text
ISBN1467329061
9781467329064
DOI10.1109/ICCCI.2013.6466301

Cover

Abstract Researchers in the database community have aroused great interest in handling high dimensional data sets for the past decades. Today's business captures inundate sets of data which includes digital documents, web pages-customer databases, hyper-spectral imagery, social networks, gene arrays, proteomics data, neurobiological signals, high dimensional dynamical systems, sensor networks, financial transactions and traffic statistics thereby generating massive high dimensional datasets. DNA microarray paves methods in identifying different expression levels of thousands of genes during biological process. The problem with microarrays is to measure gene expression from thousands of genes (features) from only tens of hundreds of samples. Microarray data often contain several missing values that may affect subsequent analysis. In this paper, a novel approach on imputation using k-NN with bagging method is proposed to handle missing value. The experimental result shows that the proposed method outperforms other methods in terms of distance and density of clusters. The proposed approach has enhanced the performance of traditional k-NN impute using bagging method.
AbstractList Researchers in the database community have aroused great interest in handling high dimensional data sets for the past decades. Today's business captures inundate sets of data which includes digital documents, web pages-customer databases, hyper-spectral imagery, social networks, gene arrays, proteomics data, neurobiological signals, high dimensional dynamical systems, sensor networks, financial transactions and traffic statistics thereby generating massive high dimensional datasets. DNA microarray paves methods in identifying different expression levels of thousands of genes during biological process. The problem with microarrays is to measure gene expression from thousands of genes (features) from only tens of hundreds of samples. Microarray data often contain several missing values that may affect subsequent analysis. In this paper, a novel approach on imputation using k-NN with bagging method is proposed to handle missing value. The experimental result shows that the proposed method outperforms other methods in terms of distance and density of clusters. The proposed approach has enhanced the performance of traditional k-NN impute using bagging method.
Author Palaniammal, S.
Kumutha, V.
Author_xml – sequence: 1
  givenname: V.
  surname: Kumutha
  fullname: Kumutha, V.
  email: kumuthav@gmail.com
  organization: Dept. of Comput. Sci., D.J. Acad. for Manage. Excellence, Coimbatore, India
– sequence: 2
  givenname: S.
  surname: Palaniammal
  fullname: Palaniammal, S.
  email: splvlb@yahoo.com
  organization: Dept. of Sci. &Humanities, Sri Krishna Coll. of Technol., Coimbatore, India
BookMark eNo1kMtOwzAQRY0ACVr6A7Dxkk3K-BE7XlYRj0qlbGAdTRK7NaROqBMk_p5Ay-roju6MjmZCzkIbLCHXDOaMgblb5nm-nHNgYq6kUgLYCZkZnTGptOAGNDslk_-g2AWZxfgOAOOy4hwuyfMiUBu2GCpbU-y6fYvVlraBjqO68WFDdz7GX35hM9hIh79Q4mbzy49kvaZ-1w099r4NV-TcYRPt7MgpeXu4f82fktXL4zJfrBLPIesTqYThlUMtapdmUDNtoJYGldal0CatU8c5y5xDhkZXpuRSlA6F5imX0mkxJbeHu6Pv52jVF6NlZZsGg22HWLAMQApjmBqrN4eqt9YW3d7vcP9dHJ8lfgD2kF39
ContentType Conference Proceeding
Journal Article
DBID 6IE
6IL
CBEJK
RIE
RIL
7SC
7SP
8FD
JQ2
L7M
L~C
L~D
DOI 10.1109/ICCCI.2013.6466301
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Xplore POP ALL
IEEE Xplore All Conference Proceedings
IEEE/IET Electronic Library
IEEE Proceedings Order Plans (POP All) 1998-Present
Computer and Information Systems Abstracts
Electronics & Communications Abstracts
Technology Research Database
ProQuest Computer Science Collection
Advanced Technologies Database with Aerospace
Computer and Information Systems Abstracts – Academic
Computer and Information Systems Abstracts Professional
DatabaseTitle Technology Research Database
Computer and Information Systems Abstracts – Academic
Electronics & Communications Abstracts
ProQuest Computer Science Collection
Computer and Information Systems Abstracts
Advanced Technologies Database with Aerospace
Computer and Information Systems Abstracts Professional
DatabaseTitleList Technology Research Database

Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
EISBN 9781467329071
1467329053
9781467329057
146732907X
EndPage 8
ExternalDocumentID 6466301
Genre orig-research
GroupedDBID 6IE
6IF
6IK
6IL
6IN
AAJGR
AAWTH
ADFMO
ALMA_UNASSIGNED_HOLDINGS
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CBEJK
IEGSK
IERZE
OCL
RIE
RIL
7SC
7SP
8FD
JQ2
L7M
L~C
L~D
ID FETCH-LOGICAL-i208t-46392cfa73df580d1790d49a677b3795d5f2218ffa1a97c9b243bfa3725244f73
IEDL.DBID RIE
ISBN 1467329061
9781467329064
IngestDate Fri Jul 11 16:27:27 EDT 2025
Wed Aug 27 04:54:27 EDT 2025
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i208t-46392cfa73df580d1790d49a677b3795d5f2218ffa1a97c9b243bfa3725244f73
Notes ObjectType-Article-2
SourceType-Scholarly Journals-1
ObjectType-Conference-1
ObjectType-Feature-3
content type line 23
SourceType-Conference Papers & Proceedings-2
PQID 1800439916
PQPubID 23500
PageCount 8
ParticipantIDs proquest_miscellaneous_1800439916
ieee_primary_6466301
PublicationCentury 2000
PublicationDate 2013-Jan.
20130101
PublicationDateYYYYMMDD 2013-01-01
PublicationDate_xml – month: 01
  year: 2013
  text: 2013-Jan.
PublicationDecade 2010
PublicationTitle 2013 International Conference on Computer Communication and Informatics
PublicationTitleAbbrev ICCCI
PublicationYear 2013
Publisher IEEE
Publisher_xml – name: IEEE
SSID ssj0001106220
Score 1.5805638
Snippet Researchers in the database community have aroused great interest in handling high dimensional data sets for the past decades. Today's business captures...
SourceID proquest
ieee
SourceType Aggregation Database
Publisher
StartPage 1
SubjectTerms Bagging
Classification algorithms
clustering
Clustering algorithms
Computers
Correlation
Density
Dynamical systems
Gene expression
Genes
Handling
microarray
missing value
Statistical methods
Title An enhanced approach on handling missing values using bagging k-NN imputation
URI https://ieeexplore.ieee.org/document/6466301
https://www.proquest.com/docview/1800439916
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3PS8MwFA5zJ08qmzh_EcGj6dokbZajFMcmbHhwsFtp0hcdg05ce_GvN8naDdSDtwba0ry-9n15ed_3ELoXNopo0IZIBZJwQTmRIhbEFMx-TdouKBLHRp7Nk8mCPy_jZQc97LkwAOCLzyBwh34vv9jo2qXKhgm38dGRtY6sm-24Wod8il3bUBp67lYimFMxj1pJp2bMW9JMKIfTNE2nrrKLBc1dm_Yqv_7JPtCMT9CsfcRdfck6qCsV6K8f6o3_ncMp6h8offhlH6zOUAfKHpo9lhjKd18EgFt1cbwpsddesOdh6wUumYCdJjhsce0HKndp6je8JvM5XrmuEP719tFi_PSaTkjTX4GsaDiqCLfohGqTC1aYeBQWTqyr4DJPhFBMyLiIDbUIwJg8yqXQUlHOlMmZoLEFBUawc9QtNyVcIEyVhY2JMJGCmBuQuYUtwJhOJIBUkg9Qz5ki-9hJaGSNFQborjV2Zifk9iryEjb1NotGbo_SgdfLvy-9QsfUd6Zw2ZBr1K0-a7ix-KBSt94xvgFOEbXJ
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3PT4MwFG6WedCTms04f9bEozBoC12PhrhsOoiHLdmNUHjVZQkzblz862072BL14I0mQOij8L6-977vIXTPtRfJIVeOkCAcxglzBA-4owqqv6ZcbyhCw0aOk3A0Y8_zYN5CDzsuDADY4jNwzaHN5RervDKhsn7ItH80ZK0D7fdZsGVr7SMqendDiGfZWyGnRsfcb0Sd6jFraDOe6I-jKBqb2i7q1vetG6z8-itbVzM8RnHzkNsKk6VbbaSbf_3Qb_zvLE5Qd0_qw687d3WKWlB2UPxYYijfbRkAbvTF8arEVn1Bn4f1OjDhBGxUwWGNKzuQmQlUv-GlkyR4YfpC2BfcRbPh0zQaOXWHBWdBvMHGYRqfkFxlnBYqGHiFkesqmMhCziXlIigCRTQGUCrzM8FzIQmjUmWUk0DDAsXpGWqXqxLOESZSA8eQK19CwBSITAMXoDQPBYCQgvVQx5gi_diKaKS1FXrorjF2qidkshVZCatqnfoDk6U08PXi70tv0eFoGk_SyTh5uURHxPapMLGRK9TefFZwrdHCRt7YRfINBiO5Fg
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2013+International+Conference+on+Computer+Communication+and+Informatics&rft.atitle=An+enhanced+approach+on+handling+missing+values+using+bagging+k-NN+imputation&rft.au=Kumutha%2C+V.&rft.au=Palaniammal%2C+S.&rft.date=2013-01-01&rft.pub=IEEE&rft.isbn=9781467329064&rft.spage=1&rft.epage=8&rft_id=info:doi/10.1109%2FICCCI.2013.6466301&rft.externalDocID=6466301
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9781467329064/lc.gif&client=summon&freeimage=true
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9781467329064/mc.gif&client=summon&freeimage=true
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9781467329064/sc.gif&client=summon&freeimage=true