A first estimation of the proportion of cybercriminal entities in the bitcoin ecosystem using supervised machine learning
Bitcoin, a peer-to-peer payment system and digital currency, is often involved in illicit activities such as scamming, ransomware attacks, illegal goods trading, and thievery. At the time of writing, the Bitcoin ecosystem has not yet been mapped and as such there is no estimate of the share of illic...
Saved in:
Published in | 2017 IEEE International Conference on Big Data (Big Data) pp. 3690 - 3699 |
---|---|
Main Authors | , |
Format | Conference Proceeding |
Language | English |
Published |
IEEE
01.12.2017
|
Subjects | |
Online Access | Get full text |
DOI | 10.1109/BigData.2017.8258365 |
Cover
Loading…
Abstract | Bitcoin, a peer-to-peer payment system and digital currency, is often involved in illicit activities such as scamming, ransomware attacks, illegal goods trading, and thievery. At the time of writing, the Bitcoin ecosystem has not yet been mapped and as such there is no estimate of the share of illicit activities. This paper provides the first estimation of the portion of cyber-criminal entities in the Bitcoin ecosystem. Our dataset consists of 854 observations categorised into 12 classes (out of which 5 are cybercrime-related) and a total of 100,000 uncategorised observations. The dataset was obtained from the data provider who applied three types of clustering of Bitcoin transactions to categorise entities: co-spend, intelligence-based, and behaviour-based. Thirteen supervised learning classifiers were then tested, of which four prevailed with a cross-validation accuracy of 77.38%, 76.47%, 78.46%, 80.76% respectively. From the top four classifiers, Bagging and Gradient Boosting classifiers were selected based on their weighted average and per class precision on the cybercrime-related categories. Both models were used to classify 100,000 uncategorised entities, showing that the share of cybercrime-related is 29.81% according to Bagging, and 10.95% according to Gradient Boosting with number of entities as the metric. With regard to the number of addresses and current coins held by this type of entities, the results are: 5.79% and 10.02% according to Bagging; and 3.16% and 1.45% according to Gradient Boosting. |
---|---|
AbstractList | Bitcoin, a peer-to-peer payment system and digital currency, is often involved in illicit activities such as scamming, ransomware attacks, illegal goods trading, and thievery. At the time of writing, the Bitcoin ecosystem has not yet been mapped and as such there is no estimate of the share of illicit activities. This paper provides the first estimation of the portion of cyber-criminal entities in the Bitcoin ecosystem. Our dataset consists of 854 observations categorised into 12 classes (out of which 5 are cybercrime-related) and a total of 100,000 uncategorised observations. The dataset was obtained from the data provider who applied three types of clustering of Bitcoin transactions to categorise entities: co-spend, intelligence-based, and behaviour-based. Thirteen supervised learning classifiers were then tested, of which four prevailed with a cross-validation accuracy of 77.38%, 76.47%, 78.46%, 80.76% respectively. From the top four classifiers, Bagging and Gradient Boosting classifiers were selected based on their weighted average and per class precision on the cybercrime-related categories. Both models were used to classify 100,000 uncategorised entities, showing that the share of cybercrime-related is 29.81% according to Bagging, and 10.95% according to Gradient Boosting with number of entities as the metric. With regard to the number of addresses and current coins held by this type of entities, the results are: 5.79% and 10.02% according to Bagging; and 3.16% and 1.45% according to Gradient Boosting. |
Author | Haohua Sun Yin Vatrapu, Ravi |
Author_xml | – sequence: 1 surname: Haohua Sun Yin fullname: Haohua Sun Yin email: awasunyin@gmail.com organization: Centre for Bus. Data Analytics, Copenhagen Bus. Sch., Copenhagen, Denmark – sequence: 2 givenname: Ravi surname: Vatrapu fullname: Vatrapu, Ravi email: vatrapu@cbs.dk organization: Centre for Bus. Data Analytics, Copenhagen Bus. Sch., Copenhagen, Denmark |
BookMark | eNo1UMlOAzEUCxIcoPQL4JAfaMkymeVYyipV4tJ79ZJ5aZ_USUZJijR_TwXlZMu2LNl37DrEgIw9SrGUUnRPz7R_gQJLJWSzbJVpdW2u2LxrWml0W6tGGnHLphX3lHLhmAsNUCgGHj0vB-RjimNM_4qbLCaXaKAAR46hUCHMnMJv1lJx8czRxTzlggM_ZQp7nk8jpm_K2PMB3IEC8iNCCmfvnt14OGacX3DGtm-v2_XHYvP1_rlebRbUibJwwkgJ3oK1wvWV6n3nlbFKmKptXe2NcyhqbWulAHQLPfSiE67CqtdW2ErP2MNfLSHibjwPgDTtLn_oH21BXyE |
ContentType | Conference Proceeding |
DBID | 6IE 6IL CBEJK RIE RIL |
DOI | 10.1109/BigData.2017.8258365 |
DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Xplore POP ALL IEEE Xplore All Conference Proceedings IEEE Electronic Library (IEL) IEEE Proceedings Order Plans (POP All) 1998-Present |
DatabaseTitleList | |
Database_xml | – sequence: 1 dbid: RIE name: IEEE Electronic Library (IEL) url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher |
DeliveryMethod | fulltext_linktorsrc |
EISBN | 9781538627150 1538627159 |
EndPage | 3699 |
ExternalDocumentID | 8258365 |
Genre | orig-research |
GroupedDBID | 6IE 6IL CBEJK RIE RIL |
ID | FETCH-LOGICAL-i90t-c0511afbabb0cd42df9f25b205488c6f5cce063b622aa38adad090c4e4d3b0b43 |
IEDL.DBID | RIE |
IngestDate | Thu Jun 29 18:36:31 EDT 2023 |
IsPeerReviewed | false |
IsScholarly | false |
Language | English |
LinkModel | DirectLink |
MergedId | FETCHMERGED-LOGICAL-i90t-c0511afbabb0cd42df9f25b205488c6f5cce063b622aa38adad090c4e4d3b0b43 |
PageCount | 10 |
ParticipantIDs | ieee_primary_8258365 |
PublicationCentury | 2000 |
PublicationDate | 2017-Dec. |
PublicationDateYYYYMMDD | 2017-12-01 |
PublicationDate_xml | – month: 12 year: 2017 text: 2017-Dec. |
PublicationDecade | 2010 |
PublicationTitle | 2017 IEEE International Conference on Big Data (Big Data) |
PublicationTitleAbbrev | BigData |
PublicationYear | 2017 |
Publisher | IEEE |
Publisher_xml | – name: IEEE |
Score | 2.0264893 |
Snippet | Bitcoin, a peer-to-peer payment system and digital currency, is often involved in illicit activities such as scamming, ransomware attacks, illegal goods... |
SourceID | ieee |
SourceType | Publisher |
StartPage | 3690 |
SubjectTerms | Bitcoin Blockchain Cryptocurrency Cybercrime Ecosystem Ecosystems Machine Learning Malware Peer-to-peer computing Public key Ransomware Supervised Learning |
Title | A first estimation of the proportion of cybercriminal entities in the bitcoin ecosystem using supervised machine learning |
URI | https://ieeexplore.ieee.org/document/8258365 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV09a8MwEBVJpk5tSUq_0dCxdixbcuyxXyEUUjqkkC1IJymYUjsk9pD--p5sJ6WlQxcjhMBGh_ze2ffeEXJjGQclpUDmhhcOIPDMyZGH6BPZONVISpzeefoST97481zMO-R2r4UxxtTFZ8Z3w_pfvi6gcp_KhpjNJFEsuqSLiVuj1WrVcCxIh_fZ8lGWzkuIjfx26Y-eKTVkjA_JdHezplLk3a9K5cPnLx_G_z7NERl8i_Po6x52jknH5H2yvaM2QyJHnWlGo0akhaXI7ujK9UFY72Zgq8wa2l5edOeoSrO8XquyEgocY07aWDxTVxe_pJtq5d4pG6PpR119aWjbbmI5ILPx0-xh4rVdFbwsDUoP8BQyaZVUKgDNQ21TGwoVInVLEoitADBIW1QchlJGidQS4xUAN1xHKlA8OiG9vMjNKaGhiEAJKZlJY844S7VBTLQjBD0IIYjOSN_t2mLV-GYs2g07_3v6ghy4yDWlIpekV64rc4WAX6rrOtJfSDixxQ |
linkProvider | IEEE |
linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV09T8MwELVKGWAC1CK-8cBI0nzYaTLyqQJtxVCkbpV9tqsI0VRtMpRfzzlJi0AMLJFlWYrlk_OenXvvCLkyPgMpBEfmhg8GwHHPia6D6BOaKFFISqzeeTCMem_seczHDXK90cJorcvkM-3aZvkvX2VQ2KuyDp5m4jDiW2QbcZ_7lVqr1sP5XtK5Taf3IrduQn7XrQf_qJpSgsbjHhmsX1fliry7RS5d-PzlxPjf-eyT9rc8j75ugOeANPSsRVY31KRI5ai1zaj0iDQzFPkdndtKCIt1D6ykXkBdzYuuPVVpOivHyjSHDNt4Kq1MnqnNjJ_SZTG3X5WlVvSjzL_UtC44MW2T0ePD6K7n1HUVnDTxcgdwH_rCSCGlB4oFyiQm4DJA8hbHEBkOoJG4yCgIhAhjoQRGzAOmmQqlJ1l4SJqzbKaPCA14CJIL4eskYj7zE6URFU0XYQ8C8MJj0rKrNplXzhmTesFO_u6-JDu90aA_6T8NX07Jro1ilThyRpr5otDnCP-5vCij_gW1S7UO |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2017+IEEE+International+Conference+on+Big+Data+%28Big+Data%29&rft.atitle=A+first+estimation+of+the+proportion+of+cybercriminal+entities+in+the+bitcoin+ecosystem+using+supervised+machine+learning&rft.au=Haohua+Sun+Yin&rft.au=Vatrapu%2C+Ravi&rft.date=2017-12-01&rft.pub=IEEE&rft.spage=3690&rft.epage=3699&rft_id=info:doi/10.1109%2FBigData.2017.8258365&rft.externalDocID=8258365 |