Merlin: Exploratory Analysis with Imprecise Queries

Merlin supports exploratory search in large databases. The user interacts with it by specifying probability distributions over attributes, which express imprecise conditions about the entities of interest. Merlin helps the user home in on the right query conditions by addressing three key challenges...

Full description

Saved in:
Bibliographic Details
Published inIEEE transactions on knowledge and data engineering Vol. 28; no. 2; pp. 342 - 355
Main Authors Qarabaqi, Bahar, Riedewald, Mirek
Format Journal Article
LanguageEnglish
Published New York IEEE 01.02.2016
The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Subjects
Online AccessGet full text

Cover

Loading…
Abstract Merlin supports exploratory search in large databases. The user interacts with it by specifying probability distributions over attributes, which express imprecise conditions about the entities of interest. Merlin helps the user home in on the right query conditions by addressing three key challenges: (1) efficiently computing results for an imprecise query, (2) providing feedback about the sensitivity of the result to changes of individual conditions, and (3) suggesting new conditions. We formally introduce the notion of sensitivity and prove structural properties that enable efficient algorithms for quantifying the effect of uncertainty in user-specified conditions. To support interactive responses, we also develop techniques that can deliver probability estimates within a given realtime limit and are able to adapt automatically as interactive query refinement proceeds.
AbstractList Merlin supports exploratory search in large databases. The user interacts with it by specifying probability distributions over attributes, which express imprecise conditions about the entities of interest. Merlin helps the user home in on the right query conditions by addressing three key challenges: (1) efficiently computing results for an imprecise query, (2) providing feedback about the sensitivity of the result to changes of individual conditions, and (3) suggesting new conditions. We formally introduce the notion of sensitivity and prove structural properties that enable efficient algorithms for quantifying the effect of uncertainty in user-specified conditions. To support interactive responses, we also develop techniques that can deliver probability estimates within a given realtime limit and are able to adapt automatically as interactive query refinement proceeds.
Author Qarabaqi, Bahar
Riedewald, Mirek
Author_xml – sequence: 1
  givenname: Bahar
  surname: Qarabaqi
  fullname: Qarabaqi, Bahar
  email: bahar@ccs.neu.edu
  organization: College of Computer and Information Science, Northeastern University, Boston, MA
– sequence: 2
  givenname: Mirek
  surname: Riedewald
  fullname: Riedewald, Mirek
  email: mirek@ccs.neu.edu
  organization: College of Computer and Information Science, Northeastern University, Boston, MA
BookMark eNpdkMFKw0AQhhdRsK0-gHgJePGSOrO7ye56K7VqsSJCPYdtMsUtaVJ3E7Rvb0KLB08zh-__h_mG7LSqK2LsCmGMCOZu-fIwG3PAZMylSbmCEzbAJNExR4On3Q4SYymkOmfDEDYAoJXGAROv5EtX3Uezn11Ze9vUfh9NKlvugwvRt2s-o_l25yl3gaL3lryjcMHO1rYMdHmcI_bxOFtOn-PF29N8OlnEueBpE1tRrFAbTdYWBlJuV3KFBERFoQspC6NBGyNRJqogoa3i2goAlJgoITmIEbs99O58_dVSaLKtCzmVpa2obkOGynSHEuC6Q2_-oZu69d0bPZWkXHMhTUfhgcp9HYKndbbzbmv9PkPIeo1ZrzHrNWZHjV3m-pBxRPTHK4HcGBC_k55trQ
CODEN ITKEEH
CitedBy_id crossref_primary_10_1016_j_datak_2019_101758
crossref_primary_10_3390_electronics13040759
crossref_primary_10_1145_3465375
crossref_primary_10_1109_ACCESS_2018_2882244
Cites_doi 10.1145/2588555.2610523
10.1145/1963405.1963424
10.1145/1142473.1142518
10.1145/1102351.1102430
10.14778/1952376.1952377
10.1007/978-3-540-75549-4_4
10.1145/1476589.1476628
10.1007/BF00058655
10.1145/1559845.1559919
10.1145/2463664.2465220
10.1016/S0004-3702(02)00209-6
10.1145/1390334.1390435
10.1016/j.jss.2013.01.069
10.1109/ICDE.2006.20
10.1109/TKDE.2009.175
10.1145/103418.103469
10.1145/45945.48027
10.1145/1516360.1516459
10.1109/ICDE.2012.137
10.1145/1166074.1166085
10.1016/j.patcog.2006.11.008
10.1145/1401890.1401995
10.1145/1807167.1807172
10.14778/2556549.2556560
10.1145/2465351.2465355
10.1109/TKDE.2011.31
ContentType Journal Article
Copyright Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2016
Copyright_xml – notice: Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2016
DBID 97E
RIA
RIE
AAYXX
CITATION
7SC
7SP
8FD
JQ2
L7M
L~C
L~D
F28
FR3
DOI 10.1109/TKDE.2015.2496270
DatabaseName IEEE All-Society Periodicals Package (ASPP) 2005–Present
IEEE All-Society Periodicals Package (ASPP) 1998-Present
IEEE
CrossRef
Computer and Information Systems Abstracts
Electronics & Communications Abstracts
Technology Research Database
ProQuest Computer Science Collection
Advanced Technologies Database with Aerospace
Computer and Information Systems Abstracts – Academic
Computer and Information Systems Abstracts Professional
ANTE: Abstracts in New Technology & Engineering
Engineering Research Database
DatabaseTitle CrossRef
Technology Research Database
Computer and Information Systems Abstracts – Academic
Electronics & Communications Abstracts
ProQuest Computer Science Collection
Computer and Information Systems Abstracts
Advanced Technologies Database with Aerospace
Computer and Information Systems Abstracts Professional
Engineering Research Database
ANTE: Abstracts in New Technology & Engineering
DatabaseTitleList Technology Research Database

Technology Research Database
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Engineering
Computer Science
EISSN 1558-2191
EndPage 355
ExternalDocumentID 3919993571
10_1109_TKDE_2015_2496270
7312990
Genre orig-research
GrantInformation_xml – fundername: US National Science Foundation
  grantid: IIS-1017793; DRL-1010818
  funderid: 10.13039/100000001
GroupedDBID -~X
.DC
0R~
29I
4.4
5GY
6IK
97E
AAJGR
AASAJ
ABQJQ
ABVLG
ACGFO
ACIWK
AENEX
ALMA_UNASSIGNED_HOLDINGS
ASUFR
ATWAV
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CS3
DU5
EBS
EJD
F5P
HZ~
IEDLZ
IFIPE
IPLJI
JAVBF
LAI
M43
MS~
O9-
OCL
P2P
PQQKQ
RIA
RIC
RIE
RIG
RNS
RXW
TAE
TN5
UHB
AAYXX
CITATION
7SC
7SP
8FD
JQ2
L7M
L~C
L~D
F28
FR3
ID FETCH-LOGICAL-c326t-a3db1898eaad9062ab4b1e0eedd8d44d98089941457de38a728a3001415734203
IEDL.DBID RIE
ISSN 1041-4347
IngestDate Sat Aug 17 02:20:05 EDT 2024
Thu Oct 10 20:25:06 EDT 2024
Fri Aug 23 01:04:24 EDT 2024
Wed Jun 26 19:28:22 EDT 2024
IsPeerReviewed true
IsScholarly true
Issue 2
Keywords Interactive data exploration and discovery
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c326t-a3db1898eaad9062ab4b1e0eedd8d44d98089941457de38a728a3001415734203
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
PQID 1756282349
PQPubID 85438
PageCount 14
ParticipantIDs proquest_journals_1756282349
crossref_primary_10_1109_TKDE_2015_2496270
proquest_miscellaneous_1793265028
ieee_primary_7312990
PublicationCentury 2000
PublicationDate 2016-Feb.-1
2016-2-1
20160201
PublicationDateYYYYMMDD 2016-02-01
PublicationDate_xml – month: 02
  year: 2016
  text: 2016-Feb.-1
  day: 01
PublicationDecade 2010
PublicationPlace New York
PublicationPlace_xml – name: New York
PublicationTitle IEEE transactions on knowledge and data engineering
PublicationTitleAbbrev TKDE
PublicationYear 2016
Publisher IEEE
The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Publisher_xml – name: IEEE
– name: The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
References ref13
ref34
ref37
ref15
(ref20) 0
ref36
ref14
suciu (ref1) 2011
ref31
ref30
ref33
ref11
ref32
ref10
ref2
ref17
ref16
(ref9) 0
ref19
han (ref3) 2011
branson (ref21) 0
ref24
qarabaqi (ref4) 0
ref26
ref25
cetintemel (ref12) 0
ref22
ref28
ref27
ref29
ref8
ref7
agrawal (ref35) 0
ref6
mitchell (ref5) 1997
junker (ref18) 0
settles (ref23) 2009
References_xml – ident: ref13
  doi: 10.1145/2588555.2610523
– ident: ref29
  doi: 10.1145/1963405.1963424
– ident: ref37
  doi: 10.1145/1142473.1142518
– ident: ref6
  doi: 10.1145/1102351.1102430
– ident: ref28
  doi: 10.14778/1952376.1952377
– ident: ref27
  doi: 10.1007/978-3-540-75549-4_4
– year: 2011
  ident: ref3
  publication-title: Data Mining Concepts and Techniques
  contributor:
    fullname: han
– ident: ref10
  doi: 10.1145/1476589.1476628
– year: 1997
  ident: ref5
  publication-title: Machine Learning
  contributor:
    fullname: mitchell
– ident: ref8
  doi: 10.1007/BF00058655
– ident: ref31
  doi: 10.1145/1559845.1559919
– ident: ref15
  doi: 10.1145/2463664.2465220
– ident: ref24
  doi: 10.1016/S0004-3702(02)00209-6
– ident: ref2
  doi: 10.1145/1390334.1390435
– year: 2009
  ident: ref23
  article-title: Active learning literature survey
  contributor:
    fullname: settles
– year: 0
  ident: ref12
  article-title: Query steering for interactive data exploration
  publication-title: Proc 5th Biennial Conf Innov Data Syst Res
  contributor:
    fullname: cetintemel
– ident: ref14
  doi: 10.1016/j.jss.2013.01.069
– ident: ref34
  doi: 10.1109/ICDE.2006.20
– ident: ref7
  doi: 10.1109/TKDE.2009.175
– ident: ref32
  doi: 10.1145/103418.103469
– start-page: 916
  year: 0
  ident: ref4
  article-title: User-driven refinement of imprecise queries
  publication-title: Proc IEEE 30th Int Conf Data Eng
  contributor:
    fullname: qarabaqi
– year: 0
  ident: ref20
– ident: ref33
  doi: 10.1145/45945.48027
– ident: ref19
  doi: 10.1145/1516360.1516459
– year: 2011
  ident: ref1
  publication-title: Probabilistic Databases
  contributor:
    fullname: suciu
– ident: ref26
  doi: 10.1109/ICDE.2012.137
– start-page: 167
  year: 0
  ident: ref18
  article-title: QUICKXPLAIN: Preferred explanations and relaxations for over-constrained problems
  publication-title: Proc 19th Nat Conf Artif Intell
  contributor:
    fullname: junker
– start-page: 438
  year: 0
  ident: ref21
  article-title: Visual recognition with humans in the loop
  publication-title: Proc 11th Eur Conf Comput Vis
  contributor:
    fullname: branson
– year: 0
  ident: ref9
  article-title: eBird: An online database of bird distribution and abundance [web application]
– ident: ref36
  doi: 10.1145/1166074.1166085
– ident: ref25
  doi: 10.1016/j.patcog.2006.11.008
– ident: ref30
  doi: 10.1145/1401890.1401995
– ident: ref16
  doi: 10.1145/1807167.1807172
– start-page: 888
  year: 0
  ident: ref35
  article-title: Automated ranking of database query results
  publication-title: Proc Conf Innovative Data Syst Res
  contributor:
    fullname: agrawal
– ident: ref17
  doi: 10.14778/2556549.2556560
– ident: ref11
  doi: 10.1145/2465351.2465355
– ident: ref22
  doi: 10.1109/TKDE.2011.31
SSID ssj0008781
Score 2.2314982
Snippet Merlin supports exploratory search in large databases. The user interacts with it by specifying probability distributions over attributes, which express...
SourceID proquest
crossref
ieee
SourceType Aggregation Database
Publisher
StartPage 342
SubjectTerms 8.II.VIII.VIII
Algorithms
Data models
Estimates
Image color analysis
Interactive
Interactive data exploration and discovery
Probability distribution
Queries
Real time
Searching
Uncertainty
Title Merlin: Exploratory Analysis with Imprecise Queries
URI https://ieeexplore.ieee.org/document/7312990
https://www.proquest.com/docview/1756282349
https://search.proquest.com/docview/1793265028
Volume 28
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3fS8MwED7mnvTB6aZYnVLBJ7Fbu6RN4pvoxlAmCBvsraRN-iJ04toX_3pz6Q-G-uBboaENudzdd7nLdwA3BgOnSgXKi3QmPcqU8Lixel42EVwmWYhOAqstXqP5ij6vw3UH7tq7MFprW3ymR_hoc_lqk5Z4VDZmJEDruQd7TIjqrlZrdTmzDUlNdGFiIkJZncEMfDFevjxNsYgrHJlYI5pgX-IdH2SbqvyyxNa9zHqwaCZWVZW8j8oiGaVfPzgb_zvzIziscab7UG2MY-jovA-9poeDW6t0Hw52CAkHQBaW_ererUrzbAbebXhLXDyzdfEUAvvyaPetRJLk7QmsZtPl49yr2yp4qcFqhSeJSgIuuJZSIUuxTGgSaN84S8UVpUpwTAXSgIZMacIlm3BJbEFoyAid-OQUuvkm12fgJgbfIeWgsQwMiQ0FjyJpIFzmI6wjmQO3zULHHxV7RmyjDl_EKJUYpRLXUnFggAvXDqzXzIFhI5q41q9tbEBPZIJFQoUD1-1roxmY7pC53pQ4BrGpmSA___vLF7Bv_l_XYA-hW3yW-tJAjCK5snvrGyZfymo
link.rule.ids 315,783,787,799,27938,27939,55088
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LT8MwDLbGOAAHBhuIwoAicUJ0a5u0Tbkh2DTYQ0LapN2qdEkvSB1i7YVfT5w-NAEHbpUaRVac2J9j5zPArcLAKyEcYfky4RYNRGgxZfWsxA0ZjxMPnQRWW8z80YK-Lr1lA-7rtzBSSl18Jnv4qXP5Yr3K8aqsHxAHrecO7HqIK4rXWrXdZYFuSariCxUVERqUOUzHDvvz8fMAy7i8noo2fBc7E295Id1W5Zct1g5m2IJpJVpRV_Ley7O4t_r6wdr4X9mP4LBEmuZjsTWOoSHTNrSqLg5meajbcLBFSdgBMtX8Vw9mUZync_BmxVxi4q2tifcQ2JlHmm850iRvTmAxHMyfRlbZWMFaKbSWWZyI2GEhk5wL5CnmMY0daSt3KZigVIQMk4HUoV4gJGE8cBknuiTUCwh1bXIKzXSdyjMwY4XwkHRQ2YYAqQ1D5vtcgbjERmBHEgPuqoWOPgr-jEjHHXYYoVYi1EpUasWADi5cPbBcMwO6lWqi8oRtIgV7fBUuEhoacFP_VmcDEx48lescxyA6VQKy879nvoa90Xw6iSYvs_EF7CtZyorsLjSzz1xeKsCRxVd6n30D737Ntw
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Merlin%3A+Exploratory+Analysis+with+Imprecise+Queries&rft.jtitle=IEEE+transactions+on+knowledge+and+data+engineering&rft.au=Qarabaqi%2C+Bahar&rft.au=Riedewald%2C+Mirek&rft.date=2016-02-01&rft.pub=The+Institute+of+Electrical+and+Electronics+Engineers%2C+Inc.+%28IEEE%29&rft.issn=1041-4347&rft.eissn=1558-2191&rft.volume=28&rft.issue=2&rft.spage=342&rft_id=info:doi/10.1109%2FTKDE.2015.2496270&rft.externalDBID=NO_FULL_TEXT&rft.externalDocID=3919993571
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1041-4347&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1041-4347&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1041-4347&client=summon