Using wavelet analysis for text categorization in digital libraries: a first experiment with Strathprints

Digital libraries increasingly benefit from research on automated text categorization for improved access. Such research is typically carried out by means of standard test collections. In this article, we present a pilot experiment of replacing such test collections by a set of 6,000 objects from a...

Full description

Saved in:
Bibliographic Details
Published inInternational journal on digital libraries Vol. 12; no. 1; pp. 3 - 12
Main Authors Darányi, Sándor, Wittek, Peter, Dobreva, Milena
Format Journal Article
LanguageEnglish
Published Berlin/Heidelberg Springer-Verlag 01.07.2012
Springer Nature B.V
Subjects
Online AccessGet full text

Cover

Loading…
Abstract Digital libraries increasingly benefit from research on automated text categorization for improved access. Such research is typically carried out by means of standard test collections. In this article, we present a pilot experiment of replacing such test collections by a set of 6,000 objects from a real-world digital repository, indexed by Library of Congress Subject Headings, and test support vector machines in a supervised learning setting for their ability to reproduce the existing classification. To augment the standard approach, we introduce a combination of two novel elements: using functions for document content representation in Hilbert space, and adding extra semantics from lexical resources to the representation. Results suggest that wavelet-based kernels slightly outperformed traditional kernels on classification reconstruction from abstracts and vice versa from full-text documents, the latter outcome being due to word sense ambiguity. The practical implementation of our methodological framework enhances the analysis and representation of specific knowledge relevant to large-scale digital collections, in this case the thematic coverage of the collections. Representation of specific knowledge about digital collections is one of the basic elements of the persistent archives and the less studied one (compared to representations of digital objects and collections). Our research is an initial step in this direction developing further the methodological approach and demonstrating that text categorization can be applied to analyse the thematic coverage in digital repositories.
AbstractList Issue Title: Focused Issue on Persistent Archives Digital libraries increasingly benefit from research on automated text categorization for improved access. Such research is typically carried out by means of standard test collections. In this article, we present a pilot experiment of replacing such test collections by a set of 6,000 objects from a real-world digital repository, indexed by Library of Congress Subject Headings, and test support vector machines in a supervised learning setting for their ability to reproduce the existing classification. To augment the standard approach, we introduce a combination of two novel elements: using functions for document content representation in Hilbert space, and adding extra semantics from lexical resources to the representation. Results suggest that wavelet-based kernels slightly outperformed traditional kernels on classification reconstruction from abstracts and vice versa from full-text documents, the latter outcome being due to word sense ambiguity. The practical implementation of our methodological framework enhances the analysis and representation of specific knowledge relevant to large-scale digital collections, in this case the thematic coverage of the collections. Representation of specific knowledge about digital collections is one of the basic elements of the persistent archives and the less studied one (compared to representations of digital objects and collections). Our research is an initial step in this direction developing further the methodological approach and demonstrating that text categorization can be applied to analyse the thematic coverage in digital repositories.[PUBLICATION ABSTRACT]
Digital libraries increasingly benefit from research on automated text categorization for improved access. Such research is typically carried out by means of standard test collections. In this article, we present a pilot experiment of replacing such test collections by a set of 6,000 objects from a real-world digital repository, indexed by Library of Congress Subject Headings, and test support vector machines in a supervised learning setting for their ability to reproduce the existing classification. To augment the standard approach, we introduce a combination of two novel elements: using functions for document content representation in Hilbert space, and adding extra semantics from lexical resources to the representation. Results suggest that wavelet-based kernels slightly outperformed traditional kernels on classification reconstruction from abstracts and vice versa from full-text documents, the latter outcome being due to word sense ambiguity. The practical implementation of our methodological framework enhances the analysis and representation of specific knowledge relevant to large-scale digital collections, in this case the thematic coverage of the collections. Representation of specific knowledge about digital collections is one of the basic elements of the persistent archives and the less studied one (compared to representations of digital objects and collections). Our research is an initial step in this direction developing further the methodological approach and demonstrating that text categorization can be applied to analyse the thematic coverage in digital repositories.
Digital libraries increasingly benefit from research on automated text categorization for improved access. Such research is typically carried out by means of standard test collections. In this article, we present a pilot experiment of replacing such test collections by a set of 6,000 objects from a real-world digital repository, indexed by Library of Congress Subject Headings, and test support vector machines in a supervised learning setting for their ability to reproduce the existing classification. To augment the standard approach, we introduce a combination of two novel elements: using functions for document content representation in Hilbert space, and adding extra semantics from lexical resources to the representation. Results suggest that wavelet-based kernels slightly outperformed traditional kernels on classification reconstruction from abstracts and vice versa from full-text documents, the latter outcome being due to word sense ambiguity. The practical implementation of our methodological framework enhances the analysis and representation of specific knowledge relevant to large-scale digital collections, in this case the thematic coverage of the collections. Representation of specific knowledge about digital collections is one of the basic elements of the persistent archives and the less studied one (compared to representations of digital objects and collections). Our research is an initial step in this direction developing further the methodological approach and demonstrating that text categorization can be applied to analyse the thematic coverage in digital repositories.
Author Darányi, Sándor
Dobreva, Milena
Wittek, Peter
Author_xml – sequence: 1
  givenname: Sándor
  surname: Darányi
  fullname: Darányi, Sándor
  email: sandor.daranyi@hb.se
  organization: Swedish School of Library and Information Science, University of Borås
– sequence: 2
  givenname: Peter
  surname: Wittek
  fullname: Wittek, Peter
  organization: Swedish School of Library and Information Science, University of Borås
– sequence: 3
  givenname: Milena
  surname: Dobreva
  fullname: Dobreva, Milena
  organization: Centre for Digital Library Research, University of Strathclyde
BookMark eNp1kEtLAzEUhYNUsK3-AHcBN25Gk8wz7qT4goIL7TpkpjdtyjSpSWo7_nozjAsR3CTnJt-5cM4EjYw1gNAlJTeUkPLWx4PzhFCW9CrpTtCYZilLaErI6Efn8fsMTbzfEEJoRcsx0guvzQof5Ce0ELA0su289lhZhwMcA25kgJV1-ksGbQ3WBi_1SgfZ4lbXTjoN_g5LrLTzAcNxB05vwQR80GGN34KTYb1z2gR_jk6VbD1c_NxTtHh8eJ89J_PXp5fZ_Txp0iwLSUlZxes650WR5mlNasZUBkrKLJWcxKyUFrTIaqaWTdWwksX3khFQKqVMxWmKroe9O2c_9uCD2GrfQNtKA3bvBc147IIXFYno1R90Y_cuVhApwoqS8rzikaID1TjrvQMlYqCtdF2ERF--GMoXsV3RK9FFDxs8vg-_Avd783-mb63ligU
Cites_doi 10.1145/860435.860487
10.1109/JCDL.2003.1204842
10.1145/1348246.1348248
10.3115/980691.980696
10.1145/544220.544248
10.1002/(SICI)1097-4571(1999)50:9<826::AID-ASI11>3.0.CO;2-H
10.1007/s007990050023
10.1109/IJCNN.2000.861458
10.7551/mitpress/7287.001.0001
10.1145/1141753.1141778
10.1017/CBO9780511809682
10.1007/s007990050026
10.1162/coli.2006.32.1.13
10.1145/996350.996386
10.1145/1526993.1526997
10.1145/1065385.1065454
10.1145/1242572.1242780
10.1002/(SICI)1097-4571(199009)41:6<391::AID-ASI1>3.0.CO;2-9
10.1145/996350.996384
10.1016/S0893-6080(98)00032-X
10.1109/ICDM.2006.141
10.1023/A:1009982220290
10.1075/cilt.260.13agi
10.1002/asi.10360
10.1145/1378889.1378968
10.1002/asi.21147
10.1109/TSMCB.2003.811113
10.1145/1138379.1138380
10.3115/1596374.1596403
10.1145/1555400.1555436
10.3115/1706543.1706545
10.1007/s007990050033
10.1145/1555400.1555431
10.1145/1141753.1141760
10.1145/1065385.1065418
10.1007/s00799-007-0011-z
10.1002/asi.10211
10.1145/1185448.1185487
10.1023/A:1013625426931
10.1007/978-3-642-04584-4_9
10.1007/11564126_21
10.1007/BFb0026683
10.1145/253495.253506
ContentType Journal Article
Copyright Springer-Verlag 2012
Copyright_xml – notice: Springer-Verlag 2012
DBID AAYXX
CITATION
3V.
7SC
7XB
88I
8AL
8FD
8FE
8FG
8FK
8G5
ABUWG
AFKRA
ALSLI
ARAPS
AZQEC
BENPR
BGLVJ
CCPQU
CNYFK
DWQXO
GNUQQ
GUQSH
HCIFZ
JQ2
K7-
L7M
L~C
L~D
M0N
M1O
M2O
M2P
MBDVC
P5Z
P62
PADUT
PHGZM
PHGZT
PKEHL
PQEST
PQGLB
PQQKQ
PQUKI
PRQQA
Q9U
DOI 10.1007/s00799-012-0079-y
DatabaseName CrossRef
ProQuest Central (Corporate)
Computer and Information Systems Abstracts
ProQuest Central (purchase pre-March 2016)
Science Database (Alumni Edition)
Computing Database (Alumni Edition)
Technology Research Database
ProQuest SciTech Collection
ProQuest Technology Collection
ProQuest Central (Alumni) (purchase pre-March 2016)
ProQuest Research Library
ProQuest Central
ProQuest Central UK/Ireland
Social Science Premium Collection
Advanced Technologies & Aerospace Collection
ProQuest Central Essentials
ProQuest Central
Technology Collection
ProQuest One
Library & Information Science Collection
ProQuest Central Korea
ProQuest Central Student
ProQuest Research Library
SciTech Premium Collection
ProQuest Computer Science Collection
Computer Science Database
Advanced Technologies Database with Aerospace
Computer and Information Systems Abstracts – Academic
Computer and Information Systems Abstracts Professional
Computing Database
Library Science Database
Research Library (ProQuest)
Science Database
Research Library (Corporate)
Advanced Technologies & Aerospace Database
ProQuest Advanced Technologies & Aerospace Collection
Research Library China
ProQuest Central Premium
ProQuest One Academic (New)
ProQuest One Academic Middle East (New)
ProQuest One Academic Eastern Edition (DO NOT USE)
ProQuest One Applied & Life Sciences
ProQuest One Academic
ProQuest One Academic UKI Edition
ProQuest One Social Sciences
ProQuest Central Basic
DatabaseTitle CrossRef
Research Library Prep
Computer Science Database
ProQuest Central Student
Technology Collection
Technology Research Database
Computer and Information Systems Abstracts – Academic
ProQuest One Academic Middle East (New)
ProQuest Advanced Technologies & Aerospace Collection
ProQuest Central Essentials
ProQuest Computer Science Collection
Computer and Information Systems Abstracts
ProQuest Central (Alumni Edition)
SciTech Premium Collection
ProQuest One Community College
Research Library (Alumni Edition)
ProQuest Central
ProQuest One Applied & Life Sciences
ProQuest Library Science
ProQuest Central Korea
Library & Information Science Collection
ProQuest Research Library
Research Library China
ProQuest Central (New)
Advanced Technologies Database with Aerospace
Advanced Technologies & Aerospace Collection
Social Science Premium Collection
ProQuest Computing
ProQuest Science Journals (Alumni Edition)
ProQuest One Social Sciences
ProQuest Central Basic
ProQuest Science Journals
ProQuest Computing (Alumni Edition)
ProQuest One Academic Eastern Edition
ProQuest Technology Collection
ProQuest SciTech Collection
Computer and Information Systems Abstracts Professional
Advanced Technologies & Aerospace Database
ProQuest One Academic UKI Edition
ProQuest One Academic
ProQuest One Academic (New)
ProQuest Central (Alumni)
DatabaseTitleList Research Library Prep

Computer and Information Systems Abstracts
Database_xml – sequence: 1
  dbid: 8FG
  name: ProQuest Technology Collection
  url: https://search.proquest.com/technologycollection1
  sourceTypes: Aggregation Database
DeliveryMethod fulltext_linktorsrc
Discipline Library & Information Science
Computer Science
EISSN 1432-1300
EndPage 12
ExternalDocumentID 2713349171
10_1007_s00799_012_0079_y
Genre Feature
GroupedDBID -59
-5G
-BR
-D7
-EM
-Y2
-~C
.4I
.4S
.86
.DC
.VR
06D
0R~
0VY
1N0
1SB
203
29J
2J2
2JN
2JY
2KG
2LR
2P1
2VQ
2~H
30V
3V.
4.4
406
408
409
40D
40E
5GY
5VS
67Z
6IK
6NX
88I
8FE
8FG
8G5
8TC
8UJ
95-
95.
95~
96X
AAAVM
AABHQ
AACDK
AAHNG
AAIAL
AAJBT
AAJKR
AANZL
AARHV
AARTL
AASML
AATNV
AATVU
AAUYE
AAWCG
AAYIU
AAYQN
AAYTO
AAYZH
ABAKF
ABBBX
ABBXA
ABDBF
ABDZT
ABECU
ABFTD
ABFTV
ABHLI
ABHQN
ABJNI
ABJOX
ABKCH
ABKTR
ABMNI
ABMQK
ABNWP
ABQBU
ABQSL
ABSXP
ABTEG
ABTHY
ABTKH
ABTMW
ABULA
ABUWG
ABWNU
ABXPI
ACAOD
ACBXY
ACDTI
ACGFS
ACGOD
ACHSB
ACHXU
ACIHN
ACKNC
ACMDZ
ACMLO
ACOKC
ACOMO
ACPIV
ACREN
ACSNA
ACUHS
ACWUS
ACZOJ
ADHHG
ADHIR
ADINQ
ADKNI
ADKPE
ADRFC
ADTPH
ADURQ
ADYFF
ADYOE
ADZKW
AEAQA
AEBTG
AEFQL
AEGAL
AEGNC
AEJHL
AEJRE
AEKMD
AEMSY
AEOHA
AEPYU
AESKC
AETLH
AEVLU
AEXYK
AFBBN
AFGCZ
AFKRA
AFLOW
AFQWF
AFWTZ
AFYQB
AFZKB
AGAYW
AGDGC
AGGDS
AGJBK
AGMZJ
AGQEE
AGQMX
AGRTI
AGWIL
AGWZB
AGYKE
AHAVH
AHBYD
AHKAY
AHSBF
AHYZX
AIAKS
AIGIU
AIIXL
AILAN
AITGF
AJBLW
AJRNO
AJZVZ
ALMA_UNASSIGNED_HOLDINGS
ALSLI
ALWAN
AMKLP
AMTXH
AMXSW
AMYLF
AMYQR
AOCGG
ARAPS
ARCSS
ARMRJ
ASPBG
AVWKF
AXYYD
AYJHY
AZFZN
AZQEC
B-.
B0M
BA0
BDATZ
BENPR
BGLVJ
BGNMA
BPHCQ
BSONS
CAG
CCPQU
CNYFK
COF
CS3
CSCUP
DDRTE
DL5
DNIVK
DPUIP
DU5
DWQXO
EAD
EAP
EBLON
EBS
ECS
EDO
EIOEI
EJD
ELW
EMK
EPL
ESBYG
EST
ESX
FD6
FEDTE
FERAY
FFXSO
FIGPU
FINBP
FNLPD
FRRFC
FSGXE
FWDCC
GGCAI
GGRSB
GJIRD
GNUQQ
GNWQR
GQ6
GQ7
GQ8
GUQSH
GXS
HCIFZ
HF~
HG5
HG6
HMJXF
HQYDN
HRMNR
HVGLF
HZ~
I09
IHE
IJ-
IKXTQ
ITM
IWAJR
IXC
IXE
IZIGR
IZQ
I~X
I~Z
J-C
J0Z
JBSCW
JCJTX
JZLTJ
K6V
K7-
KDC
KOV
KZ1
LAS
LLZTM
M0N
M1O
M2O
M2P
M4Y
MA-
N2Q
N9A
NB0
NPVJJ
NQJWS
NU0
O9-
O93
O9J
OAM
P2P
P62
P9O
PADUT
PF0
PQQKQ
PROAC
PT4
PT5
Q2X
QOS
R89
R9I
RIG
RNI
RNS
ROL
RPX
RSV
RZK
S16
S1Z
S27
S3B
SAP
SCO
SDH
SHX
SISQX
SJYHP
SNE
SNPRN
SNX
SOHCF
SOJ
SPISZ
SRMVM
SSLCW
STPWE
SZN
T13
TN5
TSG
TSK
TSV
TUC
TUS
U2A
UG4
UOJIU
UTJUX
UZXMN
VC2
VFIZW
W23
W48
WK8
YLTOR
Z45
Z7Y
Z81
Z83
Z88
ZMTXR
~8M
AAPKM
AAYXX
ABBRH
ABDBE
ABFSG
ACMFV
ACSTC
ADHKG
AEZWR
AFDZB
AFHIU
AFOHR
AGQPQ
AHPBZ
AHWEU
AIXLP
ATHPR
AYFIA
CITATION
PHGZM
PHGZT
7SC
7XB
8AL
8FD
8FK
ABRTQ
JQ2
L7M
L~C
L~D
MBDVC
PKEHL
PQEST
PQGLB
PQUKI
PRQQA
Q9U
ID FETCH-LOGICAL-c344t-71289bb5966353b0b22f4efaa43a90100116164b2fdc8c27243a720eff312f243
IEDL.DBID BENPR
ISSN 1432-5012
IngestDate Fri Jul 11 10:41:55 EDT 2025
Fri Jul 25 03:01:47 EDT 2025
Tue Jul 01 03:13:27 EDT 2025
Fri Feb 21 02:33:59 EST 2025
IsDoiOpenAccess false
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 1
Keywords Support vector machines
Wavelet analysis
Text categorization
Digital libraries
Analogical information representation
Machine learning
Language English
License http://www.springer.com/tdm
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c344t-71289bb5966353b0b22f4efaa43a90100116164b2fdc8c27243a720eff312f243
Notes SourceType-Scholarly Journals-1
ObjectType-Feature-1
content type line 14
ObjectType-Article-2
content type line 23
OpenAccessLink http://urn.kb.se/resolve?urn=urn:nbn:se:hb:diva-3241
PQID 1026719589
PQPubID 54113
PageCount 10
ParticipantIDs proquest_miscellaneous_1494329680
proquest_journals_1026719589
crossref_primary_10_1007_s00799_012_0079_y
springer_journals_10_1007_s00799_012_0079_y
ProviderPackageCode CITATION
AAYXX
PublicationCentury 2000
PublicationDate 20120700
2012-7-00
20120701
PublicationDateYYYYMMDD 2012-07-01
PublicationDate_xml – month: 7
  year: 2012
  text: 20120700
PublicationDecade 2010
PublicationPlace Berlin/Heidelberg
PublicationPlace_xml – name: Berlin/Heidelberg
– name: Heidelberg
PublicationTitle International journal on digital libraries
PublicationTitleAbbrev Int J Digit Libr
PublicationYear 2012
Publisher Springer-Verlag
Springer Nature B.V
Publisher_xml – name: Springer-Verlag
– name: Springer Nature B.V
References CristianiniN.Shawe-TaylorJ.LodhiH.Latent semantic kernelsJ. Intell. Inf. Syst.200218212715210.1023/A:1013625426931
Chang, C.C., Lin, C.J.: LIBSVM: A library for support vector machines. http://www.csie.ntu.edu.tw/~cjlin/libsvm (2001)
AvanciniH.LavelliA.SebastianiF.ZanoliR.Automatic expansion of domain-specific lexicons by term categorizationACM Trans. Speech Lang. Process.20063113010.1145/1138379.1138380
SmolaA.SchölkopfB.MüllerK.The connection between regularization operators and support vector kernelsNeural Netw.199811463764910.1016/S0893-6080(98)00032-X
Rodriguez, M., Hidalgo, J.: Using WordNet to complement training information in text categorization. In: Proceedings of RANLP-97, 2nd international conference on recent advances in natural language processing (1997)
Dawson, A., Slevin, A.: Repository case history: University of Strathclyde Strathprints. http://www.rsp.ac.uk/repos/casestudies/pdfs/strathclyde.pdf (2008)
Mohammad, S., Hirst, G.: Distributional measures as proxies for semantic relatedness (2005, submitted)
Joachims, T.: Text categorization with support vector machines: Learning with many relevant features. In: Proceedings of ECML-98, 10th European conference on machine learning, pp. 137–142. Chemnitz, Germany (1998)
Wetzler, P., Bethard, S., Butcher, K., Martin, J., Sumner, T.: Automatically assessing resource quality for educational digital libraries. In: Proceedings of WICOW-09, 3rd workshop on information credibility on the web, pp. 3–10. Madrid, Spain (2009)
ManningC.SchützeH.Foundations of Statistical Natural Language Processing1999CambridgeMIT Press0951.68158
Hu, Y., Li, H., Cao, Y., Meyerzon, D., Zheng, Q.: Automatic extraction of titles from general documents using machine learning. In: Proceedings of JCDL-05, 5th ACM/IEEE-CS joint conference on digital libraries, pp. 145–154. Denver, CO, USA (2005)
Xia, Z., Dong, Y., Xing, G.: Support vector machines for collaborative filtering. In: Proceedings of ACMSE-06, 44th annual southeast regional conference, pp. 169–174. Melbourne, FL, USA (2006)
Cui, H.: An application for semantic markup of biodiversity documents. In: Proceedings of JCDL-08, 8th ACM/IEEE-CS joint conference on digital libraries, pp. 421–421. Pittsburgh, PA, USA (2008)
Brocks, H., Kranstedt, A., Jäschke, G., Hemmje, M.: Modeling context for digital preservation. In: Nguyen, N., Szczerbicki, E. (eds.) Smart Information and Knowledge Management: Advances, Challenges, and Critical Issues. Springer, Berlin (2009)
Wittek, P., Darányi, S., Tan, C.: Improving text classification by a sense spectrum approach to term expansion. In: Proceedings of CoNLL-09, 13th conference on computational natural language learning, pp. 183–191. Boulder, CO, USA (2009)
Basili, R., Cammisa, M., Moschitti, A.: Effective use of WordNet semantics via kernel-based learning. In: Proceedings of CoNLL-05, 9th conference on computational natural language learning, pp. 1–8. Ann Arbor, MI, USA (2005)
Bethard, S., Wetzer, P., Butcher, K., Martin, J., Sumner, T.: Automatically characterizing resource quality for educational digital libraries. In: Proceedings of JCDL-09, 9th joint international conference on digital libraries, pp. 221–230. Austin, TX, USA (2009)
SebastianiF.ZanasiA.Text categorizationText Mining and its Applications2005SouthamptonWIT Press109129
Miller, N., Wong, P., Brewster, M., Foote, H.: TOPIC ISLANDS—a wavelet-based text visualization system. In: Proceedings of InfoVis-98, IEEE symposium on information visualization, pp. 189–196. Research Triangle Park, NC, USA (1998)
Gabrilovich, E., Markovitch, S.: Feature generation for text categorization using world knowledge. In: Proceedings of IJCAI-05, 19th international joint conference on artificial intelligence, vol. 19. Edinburgh, UK (2005)
FrankE.PaynterG.Predicting library of congress classifications from library of congress subject headingsJ. Am. Soc. Inf. Sci. Technol.200455321422710.1002/asi.10360
RamseyM.ChenH.ZhuB.SchatzB.A collection of visual thesauri for browsing large collections of geographic imagesJ. Am. Soc. Inf. Sci.199950982683410.1002/(SICI)1097-4571(1999)50:9<826::AID-ASI11>3.0.CO;2-H
Li, T., Ogihara, M., Li, Q.: A comparative study on content-based music genre classification. In: Proceedings of SIGIR-03, 26th international conference on research and development in information retrieval, pp. 282–289. Toronto, ON, Canada (2003)
Agirre, E., De Lacalle, O.: Clustering WordNet word senses. In: Proceedings of RANLP-03, 4th international conference on recent advances in natural language processing, pp. 121–130. Borovets, Bulgaria (2003)
HagedornK.ChapmanS.NewmanD.Enhancing search and browse using automated clustering of subject metadataD-Lib Mag.2007137/810829873
Lyu, M., Yau, E., Sze, S.: A multilingual, multimodal digital video library system. In: Proceedings of JCDL-02, 2nd ACM/IEEE-CS joint conference on digital libraries, pp. 145–153. Portland, OR, USA (2002)
WangJ.An extensive study on automated Dewey decimal classificationJ. Am. Soc. Inf. Sci. Technol.2009601122692286
WangJ.WiederholdG.FirscheinO.Xin WeiS.Content-based image indexing and searching using Daubechies’ waveletsInt. J. Digit. Libr.19981431132810.1007/s007990050026
Shawe-TaylorJ.CristianiniN.Kernel Methods for Pattern Analysis2004New YorkCambridge University Press10.1017/CBO9780511809682
Wong, S., Ziarko, W., Wong, P.: Generalized vector space model in information retrieval. In: Proceedings of SIGIR-85, 8th international conference on research and development in information retrieval, pp. 18–25. Montréal, Québec, Canada (1985)
Lin, D.: Automatic retrieval and clustering of similar words. In: Proceedings of ACL-98, 36th annual meeting of association for computational linguistics, vol. 36, pp. 768–774. Montréal, Québec, Canada (1998)
de Carvalho, M., Gonçalves, M., Laender, A., da Silva, A.: Learning to deduplicate. In: Proceedings of JCDL-06, 6th ACM/IEEE-CS joint conference on digital libraries, pp. 41–50. Chapel Hill, NC, USA (2006)
Hotho, A., Staab, S., Stumme, G.: WordNet improves text document clustering. In: Proceedings of SIGIR-03, 26th international conference on research and development in information retrieval. Toronto, Canada (2003)
Wilson, B.: A special issue on digital library evolution. D-Lib Mag. 12(3), 56 (2006)
Agirre, E., Alfonseca, E., de Lacalle, O.: Approximating hierarchy-based similarity for WordNet nominal synsets using topic signatures. In: Proceedings of GWC-04, 2nd global WordNet conference, pp. 15–22. Brno, Czech Republic (2004)
Cormen, T., Leiserson, C., Rivest, R.: Introduction to algorithms. MIT Press, Cambridge (2001)
PurcellG.RennelsG.ShortliffeE.Development and evaluation of a context-based document representation for searching the medical literatureInt. J. Digit. Libr.19971328829610.1007/s007990050023
Resnik, P.: Using information content to evaluate semantic similarity in a taxonomy. In: Proceedings of IJCAI-95, 14th international joint conference on artificial intelligence, vol. 1, pp. 448–453. Montréal, Québec, Canada (1995)
DattaR.JoshiD.LiJ.WangJ.Image retrieval: ideas, influences, and trends of the new ageACM Comput. Surv.200840216010.1145/1348246.1348248
FuhrN.TsakonasG.AalbergT.AgostiM.HansenP.KapidakisS.KlasC.KovácsL.LandoniM.MicsikA.Evaluation of digital librariesInt. J. Digit. Libr.200781213810.1007/s00799-007-0011-z
Mavroeidis, D., Tsatsaronis, G., Vazirgiannis, M., Theobald, M., Weikum, G.: Word sense disambiguation for exploiting hierarchical thesauri in text classification. In: Proceedings of PKDD-05, 9th European conference on the principles of data mining and knowledge discovery, pp. 181–192. Porto, Portugal (2005)
Moore, R., Rajasekar, A., Baru, C., Ludaescher, B., Gupta, A., Marciano, R.: Persistent archives. US Patent 6,963,875 (2005)
Paynter, G.: Developing practical automatic metadata assignment and evaluation tools for internet resources. In: Proceedings of JCDL-05, 5th ACM/IEEE-CS joint conference on digital libraries, pp. 291–300. Denver, CO, USA (2005)
Efron, M., Elsas, J., Marchionini, G., Zhang, J.: Machine learning for information architecture in a large governmental web site. In: Proceedings of JCDL-04, 4th ACM/IEEE-CS joint conference on digital libraries, pp. 151–159. Tucson, AZ, USA (2004)
Lu, X., Wang, J., Mitra, P., Giles, C.: Deriving knowledge from figures for digital libraries. In: Proceedings of WWW-07, 16th international conference on world wide web, pp. 1229–1230. Banff, AB, Canada (2007)
DeerwesterS.DumaisS.FurnasG.LandauerT.HarshmanR.Indexing by latent semantic analysisJ. Am. Soc. Inf. Sci.199041639140710.1002/(SICI)1097-4571(199009)41:6<391::AID-ASI1>3.0.CO;2-9
Jiang, J., Conrath, D.: Semantic similarity based on corpus statistics and lexical taxonomy. In: Proceedings of ROCLING-97, international conference on research in computational linguistics, pp. 19–33. Taipei, Taiwan (1997)
HoenkampE.Unitary operators on the document spaceJ. Am. Soc. Inf. Sci. Technol.200354431432010.1002/asi.10211
Pant, G., Tsioutsiouliklis, K., Johnson, J., Giles, C.: Panorama: extending digital libraries with topical crawlers. In: Proceedings of JCDL-04, 4th ACM/IEEE-CS joint conference on digital libraries, pp. 142–150. Tucson, AZ, USA (2004)
Lu, X., Mitra, P., Wang, J., Giles, C.: Automatic categorization of figures in scientific documents. In: Proceedings of JCDL-06, 6th ACM/IEEE-CS joint conference on digital libraries, pp. 129–138. Chapel Hill, NC, USA (2006)
Siolas, G., d’Alché Buc, F.: Support vector machines based on a semantic kernel for text categorization. In: Proceedings of IJCNN-00, IEEE international joint conference on neural networks. Austin, TX, USA (2000)
Bloehdorn, S., Basili, R., Cammisa, M., Moschitti, A.: Semantic kernels for text classification based on topological measures of feature similarity. In: Proceedings of ICDM-06, 6th IEEE international conference on data mining. Hong Kong (2006)
Martins, W., Gonçalves, M., Laender, A., Pappa, G.: Learning to assess the quality of scientific conferences: a case study in computer science. In: Proceeding
L. Zhang (79_CR60) 2004; 34
79_CR15
K. Hagedorn (79_CR23) 2007; 13
79_CR17
C. Fellbaum (79_CR19) 1998
F. Sebastiani (79_CR48) 2005
79_CR50
79_CR12
79_CR56
79_CR55
79_CR14
79_CR58
J. Wang (79_CR53) 1998; 1
79_CR57
79_CR10
79_CR54
79_CR27
79_CR26
79_CR29
79_CR28
N. Cristianini (79_CR11) 2002; 18
M. Ramsey (79_CR45) 1999; 50
C. Manning (79_CR36) 1999
79_CR22
79_CR24
H. Avancini (79_CR3) 2006; 3
79_CR38
79_CR37
79_CR39
E. Frank (79_CR20) 2004; 55
J. Shawe-Taylor (79_CR49) 2004
E. Hoenkamp (79_CR25) 2003; 54
F. Esposito (79_CR18) 1998; 2
N. Fuhr (79_CR21) 2007; 8
79_CR34
Y. Yang (79_CR59) 1999; 1
79_CR33
R. Datta (79_CR13) 2008; 40
79_CR35
S. Deerwester (79_CR16) 1990; 41
79_CR30
A. Smola (79_CR51) 1998; 11
79_CR32
79_CR31
79_CR5
79_CR4
79_CR7
79_CR6
79_CR9
79_CR1
79_CR2
A. Budanitsky (79_CR8) 2006; 32
G. Purcell (79_CR44) 1997; 1
79_CR47
79_CR46
79_CR41
79_CR40
79_CR43
79_CR42
J. Wang (79_CR52) 2009; 60
References_xml – reference: CristianiniN.Shawe-TaylorJ.LodhiH.Latent semantic kernelsJ. Intell. Inf. Syst.200218212715210.1023/A:1013625426931
– reference: YangY.An evaluation of statistical approaches to text categorizationInf. Retr.199911699010.1023/A:1009982220290
– reference: Cui, H.: An application for semantic markup of biodiversity documents. In: Proceedings of JCDL-08, 8th ACM/IEEE-CS joint conference on digital libraries, pp. 421–421. Pittsburgh, PA, USA (2008)
– reference: Lu, X., Mitra, P., Wang, J., Giles, C.: Automatic categorization of figures in scientific documents. In: Proceedings of JCDL-06, 6th ACM/IEEE-CS joint conference on digital libraries, pp. 129–138. Chapel Hill, NC, USA (2006)
– reference: Brocks, H., Kranstedt, A., Jäschke, G., Hemmje, M.: Modeling context for digital preservation. In: Nguyen, N., Szczerbicki, E. (eds.) Smart Information and Knowledge Management: Advances, Challenges, and Critical Issues. Springer, Berlin (2009)
– reference: Bethard, S., Wetzer, P., Butcher, K., Martin, J., Sumner, T.: Automatically characterizing resource quality for educational digital libraries. In: Proceedings of JCDL-09, 9th joint international conference on digital libraries, pp. 221–230. Austin, TX, USA (2009)
– reference: Rodriguez, M., Hidalgo, J.: Using WordNet to complement training information in text categorization. In: Proceedings of RANLP-97, 2nd international conference on recent advances in natural language processing (1997)
– reference: Miller, N., Wong, P., Brewster, M., Foote, H.: TOPIC ISLANDS—a wavelet-based text visualization system. In: Proceedings of InfoVis-98, IEEE symposium on information visualization, pp. 189–196. Research Triangle Park, NC, USA (1998)
– reference: Martins, W., Gonçalves, M., Laender, A., Pappa, G.: Learning to assess the quality of scientific conferences: a case study in computer science. In: Proceedings of JCDL-09, 9th joint international conference on digital libraries, pp. 193–202. Austin, TX, USA (2009)
– reference: Jiang, J., Conrath, D.: Semantic similarity based on corpus statistics and lexical taxonomy. In: Proceedings of ROCLING-97, international conference on research in computational linguistics, pp. 19–33. Taipei, Taiwan (1997)
– reference: SmolaA.SchölkopfB.MüllerK.The connection between regularization operators and support vector kernelsNeural Netw.199811463764910.1016/S0893-6080(98)00032-X
– reference: AvanciniH.LavelliA.SebastianiF.ZanoliR.Automatic expansion of domain-specific lexicons by term categorizationACM Trans. Speech Lang. Process.20063113010.1145/1138379.1138380
– reference: Bloehdorn, S., Basili, R., Cammisa, M., Moschitti, A.: Semantic kernels for text classification based on topological measures of feature similarity. In: Proceedings of ICDM-06, 6th IEEE international conference on data mining. Hong Kong (2006)
– reference: FrankE.PaynterG.Predicting library of congress classifications from library of congress subject headingsJ. Am. Soc. Inf. Sci. Technol.200455321422710.1002/asi.10360
– reference: Lin, D.: Automatic retrieval and clustering of similar words. In: Proceedings of ACL-98, 36th annual meeting of association for computational linguistics, vol. 36, pp. 768–774. Montréal, Québec, Canada (1998)
– reference: HoenkampE.Unitary operators on the document spaceJ. Am. Soc. Inf. Sci. Technol.200354431432010.1002/asi.10211
– reference: Siolas, G., d’Alché Buc, F.: Support vector machines based on a semantic kernel for text categorization. In: Proceedings of IJCNN-00, IEEE international joint conference on neural networks. Austin, TX, USA (2000)
– reference: Agirre, E., Alfonseca, E., de Lacalle, O.: Approximating hierarchy-based similarity for WordNet nominal synsets using topic signatures. In: Proceedings of GWC-04, 2nd global WordNet conference, pp. 15–22. Brno, Czech Republic (2004)
– reference: Lyu, M., Yau, E., Sze, S.: A multilingual, multimodal digital video library system. In: Proceedings of JCDL-02, 2nd ACM/IEEE-CS joint conference on digital libraries, pp. 145–153. Portland, OR, USA (2002)
– reference: ISO 14721: Reference model for an Open Archival Information System (OAIS) fCCSDS 650.0-B-1 Blue book (2003)
– reference: Lu, X., Wang, J., Mitra, P., Giles, C.: Deriving knowledge from figures for digital libraries. In: Proceedings of WWW-07, 16th international conference on world wide web, pp. 1229–1230. Banff, AB, Canada (2007)
– reference: Resnik, P.: Using information content to evaluate semantic similarity in a taxonomy. In: Proceedings of IJCAI-95, 14th international joint conference on artificial intelligence, vol. 1, pp. 448–453. Montréal, Québec, Canada (1995)
– reference: Dawson, A., Slevin, A.: Repository case history: University of Strathclyde Strathprints. http://www.rsp.ac.uk/repos/casestudies/pdfs/strathclyde.pdf (2008)
– reference: Han, H., Giles, C., Manavoglu, E., Zha, H., Zhang, Z., Fox, E.: Automatic document metadata extraction using support vector machines. In: Proceedings of JCDL-03, 3rd ACM/IEEE-CS joint conference on digital libraries, pp. 37–48. Houston, TX, USA (2003)
– reference: Wittek, P., Darányi, S., Tan, C.: Improving text classification by a sense spectrum approach to term expansion. In: Proceedings of CoNLL-09, 13th conference on computational natural language learning, pp. 183–191. Boulder, CO, USA (2009)
– reference: Hotho, A., Staab, S., Stumme, G.: WordNet improves text document clustering. In: Proceedings of SIGIR-03, 26th international conference on research and development in information retrieval. Toronto, Canada (2003)
– reference: Basili, R., Cammisa, M., Moschitti, A.: Effective use of WordNet semantics via kernel-based learning. In: Proceedings of CoNLL-05, 9th conference on computational natural language learning, pp. 1–8. Ann Arbor, MI, USA (2005)
– reference: ZhangL.ZhouW.JiaoL.Wavelet support vector machineIEEE Trans. Syst. Man Cybern.2004341343910.1109/TSMCB.2003.811113
– reference: Wong, S., Ziarko, W., Wong, P.: Generalized vector space model in information retrieval. In: Proceedings of SIGIR-85, 8th international conference on research and development in information retrieval, pp. 18–25. Montréal, Québec, Canada (1985)
– reference: Hu, Y., Li, H., Cao, Y., Meyerzon, D., Zheng, Q.: Automatic extraction of titles from general documents using machine learning. In: Proceedings of JCDL-05, 5th ACM/IEEE-CS joint conference on digital libraries, pp. 145–154. Denver, CO, USA (2005)
– reference: DattaR.JoshiD.LiJ.WangJ.Image retrieval: ideas, influences, and trends of the new ageACM Comput. Surv.200840216010.1145/1348246.1348248
– reference: FellbaumC.WordNet: An Electronic Lexical Database1998CambridgeMIT Press0913.68054
– reference: Xia, Z., Dong, Y., Xing, G.: Support vector machines for collaborative filtering. In: Proceedings of ACMSE-06, 44th annual southeast regional conference, pp. 169–174. Melbourne, FL, USA (2006)
– reference: Mavroeidis, D., Tsatsaronis, G., Vazirgiannis, M., Theobald, M., Weikum, G.: Word sense disambiguation for exploiting hierarchical thesauri in text classification. In: Proceedings of PKDD-05, 9th European conference on the principles of data mining and knowledge discovery, pp. 181–192. Porto, Portugal (2005)
– reference: Wetzler, P., Bethard, S., Butcher, K., Martin, J., Sumner, T.: Automatically assessing resource quality for educational digital libraries. In: Proceedings of WICOW-09, 3rd workshop on information credibility on the web, pp. 3–10. Madrid, Spain (2009)
– reference: de Carvalho, M., Gonçalves, M., Laender, A., da Silva, A.: Learning to deduplicate. In: Proceedings of JCDL-06, 6th ACM/IEEE-CS joint conference on digital libraries, pp. 41–50. Chapel Hill, NC, USA (2006)
– reference: Mohammad, S., Hirst, G.: Distributional measures as proxies for semantic relatedness (2005, submitted)
– reference: Pant, G., Tsioutsiouliklis, K., Johnson, J., Giles, C.: Panorama: extending digital libraries with topical crawlers. In: Proceedings of JCDL-04, 4th ACM/IEEE-CS joint conference on digital libraries, pp. 142–150. Tucson, AZ, USA (2004)
– reference: Paynter, G.: Developing practical automatic metadata assignment and evaluation tools for internet resources. In: Proceedings of JCDL-05, 5th ACM/IEEE-CS joint conference on digital libraries, pp. 291–300. Denver, CO, USA (2005)
– reference: WangJ.An extensive study on automated Dewey decimal classificationJ. Am. Soc. Inf. Sci. Technol.2009601122692286
– reference: Agirre, E., De Lacalle, O.: Clustering WordNet word senses. In: Proceedings of RANLP-03, 4th international conference on recent advances in natural language processing, pp. 121–130. Borovets, Bulgaria (2003)
– reference: RamseyM.ChenH.ZhuB.SchatzB.A collection of visual thesauri for browsing large collections of geographic imagesJ. Am. Soc. Inf. Sci.199950982683410.1002/(SICI)1097-4571(1999)50:9<826::AID-ASI11>3.0.CO;2-H
– reference: Moore, R., Rajasekar, A., Baru, C., Ludaescher, B., Gupta, A., Marciano, R.: Persistent archives. US Patent 6,963,875 (2005)
– reference: Shawe-TaylorJ.CristianiniN.Kernel Methods for Pattern Analysis2004New YorkCambridge University Press10.1017/CBO9780511809682
– reference: Cormen, T., Leiserson, C., Rivest, R.: Introduction to algorithms. MIT Press, Cambridge (2001)
– reference: DeerwesterS.DumaisS.FurnasG.LandauerT.HarshmanR.Indexing by latent semantic analysisJ. Am. Soc. Inf. Sci.199041639140710.1002/(SICI)1097-4571(199009)41:6<391::AID-ASI1>3.0.CO;2-9
– reference: Efron, M., Elsas, J., Marchionini, G., Zhang, J.: Machine learning for information architecture in a large governmental web site. In: Proceedings of JCDL-04, 4th ACM/IEEE-CS joint conference on digital libraries, pp. 151–159. Tucson, AZ, USA (2004)
– reference: Gabrilovich, E., Markovitch, S.: Feature generation for text categorization using world knowledge. In: Proceedings of IJCAI-05, 19th international joint conference on artificial intelligence, vol. 19. Edinburgh, UK (2005)
– reference: Li, T., Ogihara, M., Li, Q.: A comparative study on content-based music genre classification. In: Proceedings of SIGIR-03, 26th international conference on research and development in information retrieval, pp. 282–289. Toronto, ON, Canada (2003)
– reference: BudanitskyA.HirstG.Evaluating WordNet-based measures of lexical semantic relatednessComput. Linguist.200632113471234.6839910.1162/coli.2006.32.1.13
– reference: WangJ.WiederholdG.FirscheinO.Xin WeiS.Content-based image indexing and searching using Daubechies’ waveletsInt. J. Digit. Libr.19981431132810.1007/s007990050026
– reference: Wilson, B.: A special issue on digital library evolution. D-Lib Mag. 12(3), 56 (2006)
– reference: Chang, C.C., Lin, C.J.: LIBSVM: A library for support vector machines. http://www.csie.ntu.edu.tw/~cjlin/libsvm (2001)
– reference: PurcellG.RennelsG.ShortliffeE.Development and evaluation of a context-based document representation for searching the medical literatureInt. J. Digit. Libr.19971328829610.1007/s007990050023
– reference: HagedornK.ChapmanS.NewmanD.Enhancing search and browse using automated clustering of subject metadataD-Lib Mag.2007137/810829873
– reference: EspositoF.MalerbaD.SemeraroG.FanizziN.FerilliS.Adding machine learning and knowledge intensive techniques to a digital library serviceInt. J. Digit. Libr.199821319
– reference: FuhrN.TsakonasG.AalbergT.AgostiM.HansenP.KapidakisS.KlasC.KovácsL.LandoniM.MicsikA.Evaluation of digital librariesInt. J. Digit. Libr.200781213810.1007/s00799-007-0011-z
– reference: SebastianiF.ZanasiA.Text categorizationText Mining and its Applications2005SouthamptonWIT Press109129
– reference: ManningC.SchützeH.Foundations of Statistical Natural Language Processing1999CambridgeMIT Press0951.68158
– reference: Joachims, T.: Text categorization with support vector machines: Learning with many relevant features. In: Proceedings of ECML-98, 10th European conference on machine learning, pp. 137–142. Chemnitz, Germany (1998)
– ident: 79_CR28
– ident: 79_CR31
  doi: 10.1145/860435.860487
– ident: 79_CR24
  doi: 10.1109/JCDL.2003.1204842
– volume: 40
  start-page: 1
  issue: 2
  year: 2008
  ident: 79_CR13
  publication-title: ACM Comput. Surv.
  doi: 10.1145/1348246.1348248
– ident: 79_CR40
– ident: 79_CR32
  doi: 10.3115/980691.980696
– ident: 79_CR35
  doi: 10.1145/544220.544248
– volume: 50
  start-page: 826
  issue: 9
  year: 1999
  ident: 79_CR45
  publication-title: J. Am. Soc. Inf. Sci.
  doi: 10.1002/(SICI)1097-4571(1999)50:9<826::AID-ASI11>3.0.CO;2-H
– volume: 1
  start-page: 288
  issue: 3
  year: 1997
  ident: 79_CR44
  publication-title: Int. J. Digit. Libr.
  doi: 10.1007/s007990050023
– ident: 79_CR50
  doi: 10.1109/IJCNN.2000.861458
– volume-title: WordNet: An Electronic Lexical Database
  year: 1998
  ident: 79_CR19
  doi: 10.7551/mitpress/7287.001.0001
– ident: 79_CR33
  doi: 10.1145/1141753.1141778
– ident: 79_CR9
– volume-title: Kernel Methods for Pattern Analysis
  year: 2004
  ident: 79_CR49
  doi: 10.1017/CBO9780511809682
– ident: 79_CR22
– volume: 1
  start-page: 311
  issue: 4
  year: 1998
  ident: 79_CR53
  publication-title: Int. J. Digit. Libr.
  doi: 10.1007/s007990050026
– volume: 32
  start-page: 13
  issue: 1
  year: 2006
  ident: 79_CR8
  publication-title: Comput. Linguist.
  doi: 10.1162/coli.2006.32.1.13
– ident: 79_CR47
– ident: 79_CR17
  doi: 10.1145/996350.996386
– volume-title: Foundations of Statistical Natural Language Processing
  year: 1999
  ident: 79_CR36
– ident: 79_CR54
  doi: 10.1145/1526993.1526997
– ident: 79_CR43
  doi: 10.1145/1065385.1065454
– ident: 79_CR34
  doi: 10.1145/1242572.1242780
– ident: 79_CR55
– volume: 41
  start-page: 391
  issue: 6
  year: 1990
  ident: 79_CR16
  publication-title: J. Am. Soc. Inf. Sci.
  doi: 10.1002/(SICI)1097-4571(199009)41:6<391::AID-ASI1>3.0.CO;2-9
– ident: 79_CR26
– ident: 79_CR42
  doi: 10.1145/996350.996384
– volume: 11
  start-page: 637
  issue: 4
  year: 1998
  ident: 79_CR51
  publication-title: Neural Netw.
  doi: 10.1016/S0893-6080(98)00032-X
– ident: 79_CR6
  doi: 10.1109/ICDM.2006.141
– ident: 79_CR2
– volume: 1
  start-page: 69
  issue: 1
  year: 1999
  ident: 79_CR59
  publication-title: Inf. Retr.
  doi: 10.1023/A:1009982220290
– ident: 79_CR1
  doi: 10.1075/cilt.260.13agi
– volume: 55
  start-page: 214
  issue: 3
  year: 2004
  ident: 79_CR20
  publication-title: J. Am. Soc. Inf. Sci. Technol.
  doi: 10.1002/asi.10360
– ident: 79_CR12
  doi: 10.1145/1378889.1378968
– volume: 60
  start-page: 2269
  issue: 11
  year: 2009
  ident: 79_CR52
  publication-title: J. Am. Soc. Inf. Sci. Technol.
  doi: 10.1002/asi.21147
– volume: 34
  start-page: 34
  issue: 1
  year: 2004
  ident: 79_CR60
  publication-title: IEEE Trans. Syst. Man Cybern.
  doi: 10.1109/TSMCB.2003.811113
– volume: 3
  start-page: 1
  issue: 1
  year: 2006
  ident: 79_CR3
  publication-title: ACM Trans. Speech Lang. Process.
  doi: 10.1145/1138379.1138380
– ident: 79_CR56
  doi: 10.3115/1596374.1596403
– ident: 79_CR46
– start-page: 109
  volume-title: Text Mining and its Applications
  year: 2005
  ident: 79_CR48
– ident: 79_CR5
  doi: 10.1145/1555400.1555436
– volume: 13
  start-page: 1082
  issue: 7/8
  year: 2007
  ident: 79_CR23
  publication-title: D-Lib Mag.
– ident: 79_CR4
  doi: 10.3115/1706543.1706545
– ident: 79_CR29
– volume: 2
  start-page: 3
  issue: 1
  year: 1998
  ident: 79_CR18
  publication-title: Int. J. Digit. Libr.
  doi: 10.1007/s007990050033
– ident: 79_CR37
  doi: 10.1145/1555400.1555431
– ident: 79_CR15
  doi: 10.1145/1141753.1141760
– ident: 79_CR27
  doi: 10.1145/1065385.1065418
– volume: 8
  start-page: 21
  issue: 1
  year: 2007
  ident: 79_CR21
  publication-title: Int. J. Digit. Libr.
  doi: 10.1007/s00799-007-0011-z
– ident: 79_CR39
– volume: 54
  start-page: 314
  issue: 4
  year: 2003
  ident: 79_CR25
  publication-title: J. Am. Soc. Inf. Sci. Technol.
  doi: 10.1002/asi.10211
– ident: 79_CR14
– ident: 79_CR58
  doi: 10.1145/1185448.1185487
– volume: 18
  start-page: 127
  issue: 2
  year: 2002
  ident: 79_CR11
  publication-title: J. Intell. Inf. Syst.
  doi: 10.1023/A:1013625426931
– ident: 79_CR7
  doi: 10.1007/978-3-642-04584-4_9
– ident: 79_CR41
– ident: 79_CR10
– ident: 79_CR38
  doi: 10.1007/11564126_21
– ident: 79_CR30
  doi: 10.1007/BFb0026683
– ident: 79_CR57
  doi: 10.1145/253495.253506
SSID ssj0001817
Score 1.894698
Snippet Digital libraries increasingly benefit from research on automated text categorization for improved access. Such research is typically carried out by means of...
Issue Title: Focused Issue on Persistent Archives Digital libraries increasingly benefit from research on automated text categorization for improved access....
SourceID proquest
crossref
springer
SourceType Aggregation Database
Index Database
Publisher
StartPage 3
SubjectTerms Artificial intelligence
Classification
Computer Science
Database Management
Digital libraries
Information Systems and Communication Service
Library collections
Wavelet transforms
SummonAdditionalLinks – databaseName: SpringerLINK - Czech Republic Consortium
  dbid: AGYKE
  link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlR3LTtwwcNQul17Kq1UXdquphDiAghLHeXFDCIpalRNIcIrsxKYrpCzazQrB1zOTtQNF5cAlcuKJE2fGnpnMC2AnipMiMkS8OlQykNrGQVHVYUCkUqdGRsYKjh3-c56eXcpfV8mVi-Oee293b5Lsduo-2I0OHDHPrgTUCh4-wgqJH6EcwMrRz-vfJ_0GTEyrq6kiY9KzCN4bM_83yL_s6FnGfGUW7bjN6Spc-PdcOpncHixafVA9vkrh-M6JrMFnJ33i0ZJc1uGDaTZg1Vd2QLfQN2DswhlwF128EuPP92_CpPM0wHvFZStaVC6zCRIosisJsp_VzXTmgjxx0mA9ueH6JNjr54eo0E5I-MTnKgPIv4Wxy5j7lyfVzr_A5enJxfFZ4Io2BFUsZRtkxPAKrZOCRZlYh1oIK41VSsaKXUHY8EMqmha2rvJKZIKuZyI01saRsHT2FQbNtDHfAFVtq1pENqWbpTaFTki7UXlGe0xa53kyhD2Pu_JumZuj7LMwdx-5pI9ccqt8GMLIY7d0y3ROoCLNON1OMYQffTctMLaaqMZMFwQjCyKkIs3DIex7hL4c4o0Hbr0Lehs-iY4i2BF4BIN2tjBjEnda_d2R9xPEs_i4
  priority: 102
  providerName: Springer Nature
Title Using wavelet analysis for text categorization in digital libraries: a first experiment with Strathprints
URI https://link.springer.com/article/10.1007/s00799-012-0079-y
https://www.proquest.com/docview/1026719589
https://www.proquest.com/docview/1494329680
Volume 12
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfR3LTuMwcAT0wgWxPEQXqGYlxGFRpMRxXlxWdNWCQHQRAglOkZ3Y0EvK0iDE3zOTOu2CxJ7ixEmczIzHM54XwEEQRllgiHi1r6QntQ29rCh9j0iljI0MjBUcO3w5is9u5flddOc23KbOrbLliQ2jLicF75HT7BZxwplRsl9Pfz2uGsXWVVdCYxk6xIJTUr46_cHo6nrOi2n9asqryJBULuLFrV3Tb9KIJhyBz64J1PLePq5MC3Hzk4W0WXiG67DmJEY8maH4GyyZagP2XbwBHqILKGIAo5upmzBuXAHwVXFdiRqVSz2CdCuyrweyI9QD_d8sChPHFZbjBy4ggnMF-hgV2jFJh7goA4C8b4tNSttH_t56ugW3w8HN7zPPVVXwilDK2ksIUJnWUcayRqh9LYSVxiolQ8W-GmyZIR1KC1sWaSESQdcT4Rtrw0BYOtuGlWpSmR1AVdqiFIGN6WGpTaYjUj9UmhATiMs0jbrws4Vo_jRLnpHP0yQ34M8J_Dm38rcu7LUwz908muYLrHfhx7ybZgCbNVRlJi90j8wIvVmc-l04anH17yu-GPD7_wfchVXREAe75u7BSv38YvZJAKl1D5bT4WkPOifDfn_Ex9P7i0HP0R71XgZ_3gFG4d6Z
linkProvider ProQuest
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtR3LbtQwcFTaA1wQ5SEW-hgk2gMoUmI7DyNVVdWy3dLHqZV6C3Zil71kC5uq2p_iG5nxJruABLfenMSJo3l5xvMCeJ_IVCeOiNfGRkXKehnpqo4jIpU6cypxXnDu8PlFNrpSX67T6xX42efCcFhlLxODoK4nFZ-RE3eLLOfKKHr_9nvEXaPYu9q30JiTxamb3ZPJNt07OSL87ggx_Hx5OIq6rgJRJZVqo5wksrY21bzXShtbIbxy3hglDccqsGeCbAgrfF0VlcgF3c9F7LyXifB0Rd99BGtKSs0cVQyPF5KfdsvQzEVJMvBI8vde1DgULc05358DIWgUzf7cB5fK7V_-2LDNDZ_B004_xYM5Qa3Dimuew2aX3YC72KUvMTqxkwsvYBwCD_DecBeLFk1X6ARpKnJkCXLY1Q1Bc57zieMG6_ENtyvBhbn-CQ36MemiuGw6gHxKjKGA7jf-33b6Eq4eBNqvYLWZNO41oKl9VYvEZ_Sysk7blIwdU-QkcrK6KNIBfOghWt7OS3WUi6LMAfwlgb_kUTkbwEYP87Lj2mm5pLEBvFs8Jn5jJ4pp3OSO5ihN6NVZEQ_gY4-r3z_xjwXf_H_BbXg8ujw_K89OLk7fwhMRCIWDgjdgtf1x5zZJ9WntVqA3hK8PTeC_AJ3gE0Q
linkToPdf http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMw1V1Lb9QwEB6VVkJcWspDbB90kIADKCVxnBcSQqhl1VJaOFCpt9RO7HZVKdt2s6q2P61_hT_DjJPsQiW49cAtDyeW7c8ztuebGYCXQRhlgSHwal9JT2obellR-h5BpYyNDIwV7Du8fxDvHMovR9HRHNx0vjBMq-xkohPU5bDgM3Ka3SJOODJK9s62tIjv2_2P5xceZ5BiS2uXTqOByJ6ZXNH2bfRhd5vG-pUQ_c8_tna8NsOAV4RS1l5C0jnTOspY74ba10JYaaxSMlTMW2ArBe0ntLBlkRYiEfQ8Eb6xNgyEpTv67z1YSGScMp1wP_g21QKkOV1iFxnSZo-0QGdR9V0A04R9_5kUQVfe5E-dOFvo3rLNOpXXX4KfXWc1TJezzXGtN4vrW3Ek_8_efAiL7UocPzVTZxnmTPUI1ls_DnyNraMWAxdbCfgYBo5igVeK83XUqNqQLkhFkZuHTDA7odY13q04qLAcnHBiFpweTLxHhXZAq26cpVdAPg9HFyr4lEejHj2Bwztp_VOYr4aVeQaoSluUIrAxfSy1yXRE2zqVJiRc4zJNox686fCSnzdBSfJp-GkHrpzAlfNVPunBWgeBvJVPo3w2_j14MX1NkoXNRaoywzGVkRmBN4tTvwdvOyT-_ou_VLjy7wo34D5BLf-6e7C3Cg-EmwXMfl6D-fpybNZpjVfr524yIRzfNd5-AR4lXhA
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Using+wavelet+analysis+for+text+categorization+in+digital+libraries%3A+a+first+experiment+with+Strathprints&rft.jtitle=International+journal+on+digital+libraries&rft.au=Dar%C3%A1nyi%2C+S%C3%A1ndor&rft.au=Wittek%2C+Peter&rft.au=Dobreva%2C+Milena&rft.date=2012-07-01&rft.pub=Springer+Nature+B.V&rft.issn=1432-5012&rft.eissn=1432-1300&rft.volume=12&rft.issue=1&rft.spage=3&rft_id=info:doi/10.1007%2Fs00799-012-0079-y&rft.externalDBID=HAS_PDF_LINK&rft.externalDocID=2713349171
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1432-5012&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1432-5012&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1432-5012&client=summon