MFGSCOPE: A Lightweight Framework for Efficient Graph-based Analysis on Blockchain

With the prosperity of the blockchain and the DeFi ecosystem, money flow activities in the blockchains are becoming increasingly frequent, complex, and diverse. The Money Flow Graph (MFG) serves as the foundation for various behavioral analysis, malicious activity detection, and money flow tracing t...

Full description

Saved in:
Bibliographic Details
Published inIEEE transactions on dependable and secure computing pp. 1 - 16
Main Authors Hu, Yufeng, Sun, Yingshi, Chen, Yuan, Chen, Zhuo, He, Bowen, Wu, Lei, Zhou, Yajin, Chang, Rui
Format Journal Article
LanguageEnglish
Published IEEE 18.07.2024
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:With the prosperity of the blockchain and the DeFi ecosystem, money flow activities in the blockchains are becoming increasingly frequent, complex, and diverse. The Money Flow Graph (MFG) serves as the foundation for various behavioral analysis, malicious activity detection, and money flow tracing tasks. However, traditional graph databases face the issue of storage requirement and performance when analyzing large-scale MFGs. In this work, we present MFGSCOPE, a lightweight domain-specific framework designed for graph-based analysis on EVM-compatible blockchains, with extensive optimizations for storage efficiency and query performance. The prototype of MFGSCOPE for the Ethereum network achieves the storage of over 3 billion transfers and 1.7 billion relevant transactions in a single instance with less than 450 GB of disk usage. The evaluation shows that for common tasks, MFGSCOPE is more than 30 times faster and requires 78% less storage space than the commonly used graph database Neo4j. For the applications of MFGSCOPE, we present several use cases based on the MFG which cannot be performed efficiently using traditional graph databases and report interesting findings. To engage the community, the prototype of MFGSCOPE for the Ethereum blockchain with the complete dataset will be open source.
ISSN:1545-5971
1941-0018
DOI:10.1109/TDSC.2024.3431011