AIセキュリティポータル K Program
Blockchain Large Language Models
Share
Abstract
This paper presents a dynamic, real-time approach to detecting anomalous blockchain transactions. The proposed tool, BlockGPT, generates tracing representations of blockchain activity and trains from scratch a large language model to act as a real-time Intrusion Detection System. Unlike traditional methods, BlockGPT is designed to offer an unrestricted search space and does not rely on predefined rules or patterns, enabling it to detect a broader range of anomalies. We demonstrate the effectiveness of BlockGPT through its use as an anomaly detection tool for Ethereum transactions. In our experiments, it effectively identifies abnormal transactions among a dataset of 68M transactions and has a batched throughput of 2284 transactions per second on average. Our results show that, BlockGPT identifies abnormal transactions by ranking 49 out of 124 attacks among the top-3 most abnormal transactions interacting with their victim contracts. This work makes contributions to the field of blockchain transaction analysis by introducing a custom data encoding compatible with the transformer architecture, a domain-specific tokenization technique, and a tree encoding method specifically crafted for the Ethereum Virtual Machine (EVM) trace representation.
Bitcoin: A peer-to-peer electronic cash system
S. Nakamoto
Published: 2008
Do you need a blockchain?
K. Wust, A. Gervais
Published: 2018
Blockchain without waste: Proof-of-stake
F. Saleh
Published: 2021
Sok: Consensus in the age of blockchains
S. Bano, A. Sonnino, M. Al-Bassam, S. Azouvi, P. McCorry, S. Meiklejohn, G. Danezis
Published: 2019
Quantifying blockchain extractable value: How dark is the forest?
K. Qin, L. Zhou, A. Gervais
Published: 2022
Sok: Research perspectives and challenges for bitcoin and cryptocurrencies
J. Bonneau, A. Miller, J. Clark, A. Narayanan, J. A. Kroll, E. W. Felten
Published: 2015
A survey of attacks on ethereum smart contracts (sok)
N. Atzei, M. Bartoletti, T. Cimoli
Published: 2017
Attacking the defi ecosystem with flash loans for fun and profit
K. Qin, L. Zhou, B. Livshits, A. Gervais
Published: 2021
High-frequency trading on decentralized on-chain exchanges
L. Zhou, K. Qin, C. F. Torres, D. V. Le, A. Gervais
Published: 2021
Scalatrace: Scalable compression and replay of communication traces for high-performance computing
M. Noeth, P. Ratn, F. Mueller, M. Schulz, B. R. De Supinski
Published: 2009
Novel positional encodings to enable tree-based transformers
V. Shiv, C. Quirk
Published: 2019
Attention is all you need
A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez, Ł. Kaiser, I. Polosukhin
Published: 2017
Automatic differentiation in pytorch
Adam Paszke, Sam Gross, Soumith Chintala, Gregory Chanan, Edward Yang, Zachary DeVito, Zeming Lin, Alban Desmaison, Luca Antiga, Adam Lerer
Published: 2017
Decoupled weight decay regularization
Ilya Loshchilov, Frank Hutter
Published: 2018
Communication efficient distributed machine learning with the parameter server
M. Li, D. G. Andersen, A. J. Smola, K. Yu
Published: 2014
The blockchain imitation game
K. Qin, S. Chaliasos, L. Zhou, B. Livshits, D. Song, A. Gervais
Published: 2023
Sereum: Protecting existing smart contracts against re-entrancy attacks
M. Rodler, W. Li, G. O. Karame, L. Davi
Published: 2019
Txspector: Uncovering attacks in ethereum from transactions
M. Zhang, X. Zhang, Y. Zhang, Z. Lin
Published: 2020
Confuzzius: A data dependency-aware hybrid fuzzer for smart contracts
C. Ferreira Torres, A. K. Iannillo, A. Gervais
Published: 2021
Oracle-supported dynamic exploit generation for smart contracts
H. Wang, Y. Liu, Y. Li, S.-W. Lin, C. Artho, L. Ma, Y. Liu
Published: 2020
Echidna: effective, usable, and fast fuzzing for smart contracts
G. Grieco, W. Song, A. Cygan, J. Feist, A. Groce
Published: 2020
Harvey: A greybox fuzzer for smart contracts
V. Wustholz, M. Christakis
Published: 2020
Learning to fuzz from symbolic execution with application to smart contracts
J. He, M. Balunovic, N. Ambroladze, P. Tsankov, M. Vechev
Published: 2019
sfuzz: An efficient adaptive fuzzer for solidity smart contracts
T. D. Nguyen, L. H. Pham, J. Sun, Y. Lin, Q. T. Minh
Published: 2020
Ethir: A framework for high-level analysis of ethereum bytecode
E. Albert, P. Gordillo, B. Livshits, A. Rubio, I. Sergey
Published: 2018
Finding the greedy, prodigal, and suicidal contracts at scale
I. Nikolic, A. Kolluri, I. Sergey, P. Saxena, A. Hobor
Published: 2018
Manticore: A user-friendly symbolic execution framework for binaries and smart contracts
M. Mossberg, F. Manzano, E. Hennenfent, A. Groce, G. Grieco, J. Feist, T. Brunson, A. Dinaburg
Published: 2019
Detecting nondeterministic payment bugs in ethereum smart contracts
S. Wang, C. Zhang, Z. Su
Published: 2019
Osiris: Hunting for integer bugs in ethereum smart contracts
C. F. Torres, J. Schutte, R. State
Published: 2018
Making smart contracts smarter
L. Luu, D.-H. Chu, H. Olickel, P. Saxena, A. Hobor
Published: 2016
Sailfish: Vetting smart contract state-inconsistency bugs in seconds
P. Bose, D. Das, Y. Chen, Y. Feng, C. Kruegel, G. Vigna
Published: 2022
Sguard: Smart contracts made vulnerability-free
T. D. Nguyen, L. H. Pham, J. Sun
Published: 2021
Smartest: Effectively hunting vulnerable transaction sequences in smart contracts through language model-guided symbolic execution
S. So, S. Hong, H. Oh
Published: 2021
etainter: Detecting gas-related vulnerabilities in smart contracts
A. Ghaleb, J. Rubin, K. Pattabiraman
Published: 2022
Ethainter: A smart contract security analyzer for composite vulnerabilities
L. Brent, N. Grech, S. Lagouvardos, B. Scholz, Y. Smaragdakis
Published: 2020
Securify: Practical security analysis of smart contracts
P. Tsankov, A. Dan, D. Drachsler-Cohen, A. Gervais, F. Buenzli, M. Vechev
Published: 2018
Ethersolve: Computing an accurate control-flow graph from ethereum bytecode
F. Contro, M. Crosara, M. Ceccato, M. Dalla Preda
Published: 2021
ethor: Practical and provably sound static analysis of ethereum smart contracts
C. Schneidewind, I. Grishchenko, M. Scherer, M. Maffei
Published: 2020
Madmax: Surviving out-of-gas conditions in ethereum smart contracts
N. Grech, M. Kong, A. Jurisevic, L. Brent, B. Scholz, Y. Smaragdakis
Published: 2018
Slither: a static analysis framework for smart contracts
J. Feist, G. Grieco, A. Groce
Published: 2019
Smartcheck: Static analysis of ethereum smart contracts
S. Tikhomirov, E. Voskresenskaya, I. Ivanitskiy, R. Takhaviev, E. Marchenko, Y. Alexandrov
Published: 2018
Monitoring smart contracts: Contractlarva and open challenges beyond
S. Azzopardi, J. Ellul, G. J. Pace
Published: 2018
{ETHBMC}: A bounded model checker for smart contracts
J. Frank, C. Aschermann, T. Holz
Published: 2020
Ethertrust: Sound static analysis of ethereum bytecode
I. Grishchenko, M. Maffei, C. Schneidewind
Published: 2018
Dynamic vulnerability detection on smart contracts using machine learning
M. Eshghie, C. Artho, D. Gurov
Published: 2021
Deep contextualized word representations
Matthew E. Peters, Mark Neumann, Mohit Iyyer, Matt Gardner, Christopher Clark, Kenton Lee, Luke Zettlemoyer
Published: 2018
Bert: Pre-training of deep bidirectional transformers for language understanding
Jacob Devlin, Ming-Wei Chang, Kenton Lee, Kristina Toutanova
Published: 2019
Information retrieval 2nd ed buttersworth
C. Rijsbergen
Published: 1979
A framework for the evaluation of intrusion detection systems
A. A. Cardenas, J. S. Baras, K. Seamon
Published: 2006
Measuring intrusion detection capability: An information-theoretic approach
G. Gu, P. Fogla, D. Dagon, W. Lee, B. Skoric
Published: 2006
Distributed representations of words and phrases and their compositionality
T. Mikolov, I. Sutskever, K. Chen, G. S. Corrado, J. Dean
Published: 2013
Distributed representations of sentences and documents
Le, Q. V., Mikolov, T.
Published: 2014
The bayesian information criterion: background, derivation, and applications
A. A. Neath, J. E. Cavanaugh
Published: 2012
Share