A curated list of awesome smart contract datasets.
Source codes, vulnerabilities, and more.
-
solidity-dataset - This dataset is collected from public GitHub repositories written in Solidity programming language.
-
smart-contract-fiesta - Zellic is making publicly available a dataset of known Ethereum mainnet smart contract source code.
-
smartbugs-wild - This repository contains 47,398 smart contracts extracted from the Ethereum network.
-
DISL - 514506 unique Solidity files that have been deployed to Ethereum mainnet.
-
HuangGai - HuangGai is an Ethereum smart contract bug injection framework, it can inject 20 types of bugs into Solidity smart contract.
-
JiuZhou - JiuZhou is a data set of Ethereum bug smart contracts.
-
ScrawlD - A Dataset of Real World Ethereum Smart Contracts Labelled with Vulnerabilities.
-
Smart-Contract-Benchmark-Suites - Empirical Evaluation of Smart Contract Testing: What Is the Best Choice?
-
Smart-Contract-Dataset - Datasets for evaluating smart contract security analysis tools.
-
VeriSmart-benchmarks - VeriSmart is a safety analyzer for Ethereum smart contracts written in Solidity. This repository contains dataset that we used for experiments in our paper.
-
DAppSCAN - Building Large-Scale Datasets for Smart Contract Weaknesses in DApp Projects.
-
smartbugs-curated - SB Curated: A Curated Dataset of Vulnerable Solidity Smart Contracts.
-
SmartScan-Dataset - This repositoray presents datasets and empirical analysis results that use code clone detection techniques for identifying vulnerabilities and their variations in smart contracts.
-
ReentrancyStudy-Data - ReentrancyStudy-Data is a large-scale dataset of reentrancy vulnerability in smart contracts, which is built from 230,548 open-source smart contracts from Etherscan.
-
AFT2020/dataset - (UNVERIFIED) This repository contains a smart contract vulnerability dataset that is used in machine learning experiments.
-
Web3Bugs - This project aims to provide a valuable resource for Web3 developers and security analysts by facilitating their understanding of exploitable bugs in smart contracts.
-
smart-contract-vulndb - An open dataset of publicly available smart contract issues aggregated from various audit reports.
-
Reviews - A Review of Smart Contract Vulnerability Datasets.
-
solidity-defi-vulnerabilities - It contains data related to various decentralized finance (DeFi) attack scenarios and vulnerabilities, including information about attack explanations, test cases, dates, token counts, interfaces, lost values, attack titles, and contract paths.
-
slither-audited-smart-contracts - This dataset contains source code and deployed bytecode for Solidity Smart Contracts that have been verified on Etherscan.io, along with a classification of their vulnerabilities according to the Slither static analysis framework.
-
SolidiFI-benchmark - Repository of benchmarks to evaluate Solidity Smart contract analysis tools.
-
A_benchmark_dataset_of_Solidity_smart_contracts - A benchmark dataset contains 4,364 real-world Solidity smart contracts, which are manually labeled with ten types of vulnerabilities.
-
Vulnerable_Verified_Smart_Contracts - Dataset of real vulnerable Ethereum smart contracts.
-
DeFiHackLabs - Reproduce DeFi hack incidents using Foundry. Real world DeFi exploit POCs and vulnerable codes (Inside the POC as a comment).
-
smart-contract-vulnerability-dataset - Tagged dataset for Time Manipulation Vulnerability in Ethereum Smart Contracts.
Who else should we be following!?
Contributions of any kind welcome, just follow the guidelines!