A first estimation of the proportion of cybercriminal entities in the bitcoin ecosystem using supervised machine learning

Bitcoin, a peer-to-peer payment system and digital currency, is often involved in illicit activities such as scamming, ransomware attacks, illegal goods trading, and thievery. At the time of writing, the Bitcoin ecosystem has not yet been mapped and as such there is no estimate of the share of illic...

Full description

Saved in:
Bibliographic Details
Published in2017 IEEE International Conference on Big Data (Big Data) pp. 3690 - 3699
Main Authors Haohua Sun Yin, Vatrapu, Ravi
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.12.2017
Subjects
Online AccessGet full text
DOI10.1109/BigData.2017.8258365

Cover

Loading…
Abstract Bitcoin, a peer-to-peer payment system and digital currency, is often involved in illicit activities such as scamming, ransomware attacks, illegal goods trading, and thievery. At the time of writing, the Bitcoin ecosystem has not yet been mapped and as such there is no estimate of the share of illicit activities. This paper provides the first estimation of the portion of cyber-criminal entities in the Bitcoin ecosystem. Our dataset consists of 854 observations categorised into 12 classes (out of which 5 are cybercrime-related) and a total of 100,000 uncategorised observations. The dataset was obtained from the data provider who applied three types of clustering of Bitcoin transactions to categorise entities: co-spend, intelligence-based, and behaviour-based. Thirteen supervised learning classifiers were then tested, of which four prevailed with a cross-validation accuracy of 77.38%, 76.47%, 78.46%, 80.76% respectively. From the top four classifiers, Bagging and Gradient Boosting classifiers were selected based on their weighted average and per class precision on the cybercrime-related categories. Both models were used to classify 100,000 uncategorised entities, showing that the share of cybercrime-related is 29.81% according to Bagging, and 10.95% according to Gradient Boosting with number of entities as the metric. With regard to the number of addresses and current coins held by this type of entities, the results are: 5.79% and 10.02% according to Bagging; and 3.16% and 1.45% according to Gradient Boosting.
AbstractList Bitcoin, a peer-to-peer payment system and digital currency, is often involved in illicit activities such as scamming, ransomware attacks, illegal goods trading, and thievery. At the time of writing, the Bitcoin ecosystem has not yet been mapped and as such there is no estimate of the share of illicit activities. This paper provides the first estimation of the portion of cyber-criminal entities in the Bitcoin ecosystem. Our dataset consists of 854 observations categorised into 12 classes (out of which 5 are cybercrime-related) and a total of 100,000 uncategorised observations. The dataset was obtained from the data provider who applied three types of clustering of Bitcoin transactions to categorise entities: co-spend, intelligence-based, and behaviour-based. Thirteen supervised learning classifiers were then tested, of which four prevailed with a cross-validation accuracy of 77.38%, 76.47%, 78.46%, 80.76% respectively. From the top four classifiers, Bagging and Gradient Boosting classifiers were selected based on their weighted average and per class precision on the cybercrime-related categories. Both models were used to classify 100,000 uncategorised entities, showing that the share of cybercrime-related is 29.81% according to Bagging, and 10.95% according to Gradient Boosting with number of entities as the metric. With regard to the number of addresses and current coins held by this type of entities, the results are: 5.79% and 10.02% according to Bagging; and 3.16% and 1.45% according to Gradient Boosting.
Author Haohua Sun Yin
Vatrapu, Ravi
Author_xml – sequence: 1
  surname: Haohua Sun Yin
  fullname: Haohua Sun Yin
  email: awasunyin@gmail.com
  organization: Centre for Bus. Data Analytics, Copenhagen Bus. Sch., Copenhagen, Denmark
– sequence: 2
  givenname: Ravi
  surname: Vatrapu
  fullname: Vatrapu, Ravi
  email: vatrapu@cbs.dk
  organization: Centre for Bus. Data Analytics, Copenhagen Bus. Sch., Copenhagen, Denmark
BookMark eNo1UMlOAzEUCxIcoPQL4JAfaMkymeVYyipV4tJ79ZJ5aZ_USUZJijR_TwXlZMu2LNl37DrEgIw9SrGUUnRPz7R_gQJLJWSzbJVpdW2u2LxrWml0W6tGGnHLphX3lHLhmAsNUCgGHj0vB-RjimNM_4qbLCaXaKAAR46hUCHMnMJv1lJx8czRxTzlggM_ZQp7nk8jpm_K2PMB3IEC8iNCCmfvnt14OGacX3DGtm-v2_XHYvP1_rlebRbUibJwwkgJ3oK1wvWV6n3nlbFKmKptXe2NcyhqbWulAHQLPfSiE67CqtdW2ErP2MNfLSHibjwPgDTtLn_oH21BXyE
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1109/BigData.2017.8258365
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Xplore POP ALL
IEEE Xplore All Conference Proceedings
IEEE Electronic Library (IEL)
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
EISBN 9781538627150
1538627159
EndPage 3699
ExternalDocumentID 8258365
Genre orig-research
GroupedDBID 6IE
6IL
CBEJK
RIE
RIL
ID FETCH-LOGICAL-i90t-c0511afbabb0cd42df9f25b205488c6f5cce063b622aa38adad090c4e4d3b0b43
IEDL.DBID RIE
IngestDate Thu Jun 29 18:36:31 EDT 2023
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i90t-c0511afbabb0cd42df9f25b205488c6f5cce063b622aa38adad090c4e4d3b0b43
PageCount 10
ParticipantIDs ieee_primary_8258365
PublicationCentury 2000
PublicationDate 2017-Dec.
PublicationDateYYYYMMDD 2017-12-01
PublicationDate_xml – month: 12
  year: 2017
  text: 2017-Dec.
PublicationDecade 2010
PublicationTitle 2017 IEEE International Conference on Big Data (Big Data)
PublicationTitleAbbrev BigData
PublicationYear 2017
Publisher IEEE
Publisher_xml – name: IEEE
Score 2.0264893
Snippet Bitcoin, a peer-to-peer payment system and digital currency, is often involved in illicit activities such as scamming, ransomware attacks, illegal goods...
SourceID ieee
SourceType Publisher
StartPage 3690
SubjectTerms Bitcoin
Blockchain
Cryptocurrency
Cybercrime
Ecosystem
Ecosystems
Machine Learning
Malware
Peer-to-peer computing
Public key
Ransomware
Supervised Learning
Title A first estimation of the proportion of cybercriminal entities in the bitcoin ecosystem using supervised machine learning
URI https://ieeexplore.ieee.org/document/8258365
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV09a8MwEBVJpk5tSUq_0dCxdixbcuyxXyEUUjqkkC1IJymYUjsk9pD--p5sJ6WlQxcjhMBGh_ze2ffeEXJjGQclpUDmhhcOIPDMyZGH6BPZONVISpzeefoST97481zMO-R2r4UxxtTFZ8Z3w_pfvi6gcp_KhpjNJFEsuqSLiVuj1WrVcCxIh_fZ8lGWzkuIjfx26Y-eKTVkjA_JdHezplLk3a9K5cPnLx_G_z7NERl8i_Po6x52jknH5H2yvaM2QyJHnWlGo0akhaXI7ujK9UFY72Zgq8wa2l5edOeoSrO8XquyEgocY07aWDxTVxe_pJtq5d4pG6PpR119aWjbbmI5ILPx0-xh4rVdFbwsDUoP8BQyaZVUKgDNQ21TGwoVInVLEoitADBIW1QchlJGidQS4xUAN1xHKlA8OiG9vMjNKaGhiEAJKZlJY844S7VBTLQjBD0IIYjOSN_t2mLV-GYs2g07_3v6ghy4yDWlIpekV64rc4WAX6rrOtJfSDixxQ
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV09T8MwELVKGWAC1CK-8cBI0nzYaTLyqQJtxVCkbpV9tqsI0VRtMpRfzzlJi0AMLJFlWYrlk_OenXvvCLkyPgMpBEfmhg8GwHHPia6D6BOaKFFISqzeeTCMem_seczHDXK90cJorcvkM-3aZvkvX2VQ2KuyDp5m4jDiW2QbcZ_7lVqr1sP5XtK5Taf3IrduQn7XrQf_qJpSgsbjHhmsX1fliry7RS5d-PzlxPjf-eyT9rc8j75ugOeANPSsRVY31KRI5ai1zaj0iDQzFPkdndtKCIt1D6ykXkBdzYuuPVVpOivHyjSHDNt4Kq1MnqnNjJ_SZTG3X5WlVvSjzL_UtC44MW2T0ePD6K7n1HUVnDTxcgdwH_rCSCGlB4oFyiQm4DJA8hbHEBkOoJG4yCgIhAhjoQRGzAOmmQqlJ1l4SJqzbKaPCA14CJIL4eskYj7zE6URFU0XYQ8C8MJj0rKrNplXzhmTesFO_u6-JDu90aA_6T8NX07Jro1ilThyRpr5otDnCP-5vCij_gW1S7UO
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2017+IEEE+International+Conference+on+Big+Data+%28Big+Data%29&rft.atitle=A+first+estimation+of+the+proportion+of+cybercriminal+entities+in+the+bitcoin+ecosystem+using+supervised+machine+learning&rft.au=Haohua+Sun+Yin&rft.au=Vatrapu%2C+Ravi&rft.date=2017-12-01&rft.pub=IEEE&rft.spage=3690&rft.epage=3699&rft_id=info:doi/10.1109%2FBigData.2017.8258365&rft.externalDocID=8258365