-
Notifications
You must be signed in to change notification settings - Fork 3
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
Showing
9 changed files
with
2,670 additions
and
0 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,74 @@ | ||
# Monero Nonstandard Fees | ||
|
||
Monero has four standard fee levels, but not every Monero wallet implementation creates transactions that pay one of the standard fee levels. Wallets that pay nonstandard fees can reduce their users' privacy. See [Rucknium (2023) "Discussion Note: Formula for Accuracy of Guessing Monero Real Spends Using Fungibility Defects"](https://github.com/Rucknium/misc-research/tree/main/Monero-Fungibility-Defect-Classifier/pdf) for more explanation. | ||
|
||
The data in this directory can help identify which wallet implementations may be creating transactions with nonstandard fees. The developers of the wallets can be contacted and asked to fix their implementations. | ||
|
||
## Identification of nonstandard fees in Monero | ||
|
||
The tabulation of nonstandard fees will use "nanonero" as the basic unit. A [nanonero](https://web.getmonero.org/resources/moneropedia/denominations.html) is 0.000000001 XMR. In other words, it is 1/1000th of the smallest Monero unit, the piconero. In the tables in the `data ` directory, fee per byte is rounded down to the nearest integer nanonero. The complete operation is `floor( (tx_fee/tx_size_bytes)/1000 )`. | ||
|
||
Except when the dynamic block/fee algorithm is raising block size and fees, a `get_fee_estimate` RPC call to `monerod` will return these four values for nanoneros per byte: 20, 80, 320, 4000. These four levels are supposed to give transactions different priorities: "slow, normal, fast, fastest". However, since Monero blocks are usually not full, paying a higher fee usually does not mean that a transaction will be confirmed in a block any faster than a lower fee unless mining pool operators update their block templates more frequently when they receive transactions with higher fees. Any fees outside of these four values are considered nonstandard. | ||
|
||
The `get_fee_estimate` RPC call also returns a suggestion for a "quantization mask" of 10 nanoneros. Monero transactions are supposed to round up the total fee (not fee per byte) of a transaction to 10 nanoneros of precision. Transactions do not have to follow the suggestion because is not required by blockchain consensus rules. Only 880 of the 7.9 million Monero transactions since the August 2022 hard fork do not follow the quantization mask suggestion. These transactions will not be separately tabulated because they are so rare. | ||
|
||
## Determining which wallets are creating transactions with nonstandard fees | ||
|
||
The timing of transactions can help form hypotheses about which wallets may be using nonstandard fees. Many wallets ceased functioning when the Monero upgraded its privacy features on August 14, 2022 with a hard fork (network upgrade). The developers of these wallets were not prepared for the required changes in transaction format: increase ring size from 11 to 16, Bulletproofs+ for smaller transaction sizes, view tags for faster wallet syncing, etc. | ||
|
||
According to my research, at least eight wallets did not upgrade in time for the hard fork. Monerujo, Feather, Cake, and of course the GUI/CLI wallets did upgrade in time. Below is a table of wallets that were not ready for the hard fork and the wallets' probable date of return to operation. After the hard fork date, but before these wallets were fixed, almost all transactions on the blockchain had standard fee levels. After the wallets were fixed, the percentage of transactions with nonstandard fees increased to about 10%. The timing of nonstandard transactions appearing on the blockchain and the fix date of the wallets could be hints about which wallets are responsible for the nonstandard fees. | ||
|
||
| Wallet | Fix Date | Source | | ||
|----------|------------------------|:-------------------------------------------------------------------------------------------------------------------------:| | ||
| WooKey | 2022-08-19 | [■](https://github.com/WooKeyWallet/monero-wallet-android-app/releases/tag/v2.2.1) | | ||
| Exodus | 2022-08-25 | [■](https://twitter.com/exodus_io/status/1562918301181034496) | | ||
| Edge | 2022-08-27 | [■](https://twitter.com/EdgeWallet/status/1563584457361149952) | | ||
| MyMonero | 2022-08-29 | [■](https://twitter.com/MyMonero/status/1564149853478760448) | | ||
| Zelcore | Before 2022-11-17(?) | [■](https://twitter.com/zelcore_io/status/1593287952041357318) | | ||
| Guarda | 2023-02-07 | [■](https://web.archive.org/web/20230207112954/https://www.reddit.com/r/GuardaWallet/comments/10vgd90/monero_xmr_wallet/) | | ||
| Atomic | 2023-02-21 | [■](https://twitter.com/AtomicWallet/status/1628019129553657856) | | ||
| Coinomi | Still nonfunctional(?) | [■](https://reddit.com/r/COINOMI/comments/14u5gp3/does_coinomi_still_support_monero/) | | ||
|
||
Wallets accessible to ordinary users are not the only wallet implementations creating transactions on the Monero blockchain. Services like centralized exchanges also create transactions. Downtime/uptime of withdrawal capability of these services can provide clues about which set of transactions with nonstandard fees they may be creating. moneroj.net has a record of Binance withdrawal suspensions: https://moneroj.net/withdrawals/ | ||
|
||
A researcher could create transactions with nonstandard wallets to provide evidence that a specific wallet is responsible for certain types of nonstandard fees. The fees of any transaction can be viewed by inputting its transaction ID into a block explorer like https://xmrchain.net/ . Note that` xmrchain.net`'s definition of kB is 1024 bytes, not 1000 bytes. | ||
|
||
## Tabulated data | ||
|
||
In the `data` directory are tables of the number of transactions that use each fee level since the August 2022 hard fork. There are two versions of each table. One version tabulates by day. The other version tabulates by ISO week, which is a way to give numbers to weeks in a calendar that span every Monday to Sunday. The https://www.epochconverter.com/weeks web page has a table that converts ISO week numbers to dates. | ||
|
||
Only transactions that have exactly two outputs are included in the table. Most transactions have two outputs. Blockchain consensus rules require that transactions have at least two outputs. If the entire balance of inputs in a transaction are spent to one output, the other output has a XMR value of zero. Only wallets that enable a "pay-to-many" transaction type can create transactions with more than two outputs. The definition of "standard" fee is more complicated when the number of outputs in a transaction is greater than two (see section 7.3.2 Dynamic block weight of _Zero to Monero 2.0_), so they have been excluded from this version of the tables. | ||
|
||
I have identified five "clusters" of nonstandard fees. These are: | ||
1. 500-520 nanoneros per byte | ||
2. 98-109 nanoneros per byte | ||
3. 29-32 nanoneros per byte | ||
4. 240600, 342450, and 444300 nanoneros fee total (about 160 nanoneros/byte for txs with 1, 2, and 3 inputs) | ||
5. 31720000 and 45300000 nanoneros fee total | ||
|
||
In `fee-clusters-by-week.csv` and `fee-clusters-by-day.csv` these clusters are labeled in columns as `500_520_fee_per_byte`, `98_109_fee_per_byte`, `29_32_fee_per_byte`, `24_34_44_fee`, and `317_453_fee`, respectively. | ||
|
||
Using new draft research, [Rucknium (2023) "Discussion Note: Formula for Accuracy of Guessing Monero Real Spends Using Fungibility Defects"](https://github.com/Rucknium/misc-research/tree/main/Monero-Fungibility-Defect-Classifier/pdf), the risk to the privacy of users who are using wallets with these fee levels can be estimated. Rucknium (2023) develops a formula for the probability that a simple classifier can correctly guess the real spend in a ring when a ring is in a transaction with a specific nonstandard fee level. This value is the Positive Predictive Value (PPV). Higher PPV means higher privacy risk. Completely random guessing between a ring's 16 ring members would produce a PPV of 1/16 = 6.25%. Rucknium (2023) also provides formulas for the proportion of transaction outputs on the blockchain with a specific nonstandard fee (`beta`) and the probability that a ring's real spend contains a wallet's change output (`mu_C`). | ||
|
||
The PPV and `beta` can set priorities for which fee clusters to investigate first. PPV is a metric of the level of privacy risk to users who are using the wallets that create the nonstandard fees. `beta` is a rough estimate of the proportion of uses who are using each fee level. (The exact proportion of transactions with each nonstandard fee level can be checked directly in `fee-clusters-by-week.csv` and `fee-clusters-by-day.csv`.) Below is a table for the fee clusters estimated from the 8-week period July 31, 2023 to September 24, 2023. | ||
|
||
**Estimated PPV, beta, and mu_C, in percent** | ||
| fee | PPV | beta | mu_C | | ||
|------------------------|-------|------|-------| | ||
| `500_520_fee_per_byte` | 37.94 | 1.69 | 38.63 | | ||
| `98_109_fee_per_byte` | 19.97 | 5.09 | 21.67 | | ||
| `29_32_fee_per_byte` | 62.19 | 0.36 | 61.41 | | ||
| `24_34_44_fee` | 36.59 | 3.00 | 40.93 | | ||
| `317_453_fee` | 41.10 | 0.34 | 38.20 | | ||
|
||
Note: It is believed that the wallet implementation creating `24_34_44_fee` transactions has been identified and a fix is being developed. | ||
|
||
The files `raw-fee-counts-by-week.csv`, `raw-fee-counts-by-day.csv`, `raw-fee-counts-by-week-prevalence-sort.csv`, and `raw-fee-counts-by-day-prevalence-sort.csv` tabulate transactions by fee per byte without grouping them into clusters. Each fee level to the nanonero precision is tabulated. The `prevalence-sort` orders columns so that the more common fee levels are closest to the left side of the table. | ||
|
||
The `example-tx-ids-by-fee.csv` file lists ten random IDs for transactions that pay each level of fee per byte in the `raw-fee-counts` files. If fewer than 10 transactions pay a given fee level, then all the transaction IDs are listed. Transaction IDs for transactions that pay 240600, 342450, 444300, 31720000, and 45300000 nanoneros fee total are listed at the top of the table with the `_fee_tx_id` suffix. Transaction IDs can be searched in `xmrchain.net` to view examples of transactions that pay each fee level. | ||
|
||
## Code | ||
|
||
The code to reproduce the data is in the `code` directory. | ||
|
||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,222 @@ | ||
|
||
|
||
# install.packages("data.table") | ||
# install.packages("lubridate") | ||
|
||
# To get the xmr.rings and output.index objects, run this first: | ||
# https://github.com/Rucknium/misc-research/blob/main/Monero-Effective-Ring-Size/xmr-ring-gathering.R | ||
# Setting initial block height to 1220516 will use over 150GB of RAM. | ||
# A smaller portion of the blockchain can be analyzed, but the full PPV | ||
# calculation cannot be done without all RingCT outputs that may have the | ||
# fee fungibility defect. | ||
|
||
|
||
|
||
xmr.rings[, fee_per_byte := tx_fee / tx_size_bytes] | ||
xmr.rings[, fee_per_byte_nanoneros := floor(fee_per_byte/1000)] | ||
|
||
fees <- output.index[, .( | ||
block_height = block_height[1], | ||
block_timestamp = block_timestamp[1], | ||
tx_fee = tx_fee[1], | ||
tx_size_bytes = tx_size_bytes[1], | ||
n.outputs = max(output_num)), | ||
by = tx_hash] | ||
|
||
fees[, fee_per_byte := tx_fee/tx_size_bytes] | ||
fees[, fee_per_byte_nanoneros := floor(fee_per_byte/1000)] | ||
|
||
fees[, block_timestamp_date := as.Date(as.POSIXct(block_timestamp, origin = "1970-01-01", tz = "UTC"))] | ||
|
||
fees[, block_timestamp_isoweek := paste0(lubridate::isoyear(as.POSIXct(block_timestamp, origin = "1970-01-01", tz = "UTC")), "-", | ||
formatC(lubridate::isoweek(as.POSIXct(block_timestamp, origin = "1970-01-01", tz = "UTC")), width = 2, flag = "0"))] | ||
|
||
|
||
fees <- fees[is.finite(fee_per_byte), ] | ||
# Removes coinbase transactions | ||
|
||
v16.fork.height <- 2689608 # 2022-08-14 | ||
|
||
|
||
fee.clusters.week <- fees[block_height >= v16.fork.height & n.outputs == 2, .( | ||
total_txs = .N, | ||
z500_520_fee_per_byte = sum(fee_per_byte_nanoneros %between% c(500, 520)), | ||
z98_109_fee_per_byte = sum(fee_per_byte_nanoneros %between% c(98, 109)), | ||
z29_32_fee_per_byte = sum(fee_per_byte_nanoneros %between% c(29, 32)), | ||
z24_34_44_fee = sum(tx_fee %in% c(240600000, 342450000, 444300000)), | ||
z317_453_fee = sum(tx_fee %in% c(31720000000, 45300000000)), | ||
z500_520_perc_fee_per_byte = 100*sum(fee_per_byte_nanoneros %between% c(500, 520)/.N), | ||
z98_109_perc_fee_per_byte = 100*sum(fee_per_byte_nanoneros %between% c(98, 109)/.N), | ||
z29_32_perc_fee_per_byte = 100*sum(fee_per_byte_nanoneros %between% c(29, 32)/.N), | ||
z24_34_44_perc_fee = 100*sum(tx_fee %in% c(240600000, 342450000, 444300000)/.N), | ||
z317_453_perc_fee = 100*sum(tx_fee %in% c(31720000000, 45300000000)/.N) | ||
), | ||
by = "block_timestamp_isoweek"] | ||
|
||
names(fee.clusters.week) <- gsub("z", "", names(fee.clusters.week)) | ||
|
||
|
||
|
||
fee.clusters.day <- fees[block_height >= v16.fork.height & n.outputs == 2, .( | ||
total_txs = .N, | ||
z500_520_fee_per_byte = sum(fee_per_byte_nanoneros %between% c(500, 520)), | ||
z98_109_fee_per_byte = sum(fee_per_byte_nanoneros %between% c(98, 109)), | ||
z29_32_fee_per_byte = sum(fee_per_byte_nanoneros %between% c(29, 32)), | ||
z24_34_44_fee = sum(tx_fee %in% c(240600000, 342450000, 444300000)), | ||
z317_453_fee = sum(tx_fee %in% c(31720000000, 45300000000)), | ||
z500_520_perc_fee_per_byte = 100*sum(fee_per_byte_nanoneros %between% c(500, 520)/.N), | ||
z98_109_perc_fee_per_byte = 100*sum(fee_per_byte_nanoneros %between% c(98, 109)/.N), | ||
z29_32_perc_fee_per_byte = 100*sum(fee_per_byte_nanoneros %between% c(29, 32)/.N), | ||
z24_34_44_perc_fee = 100*sum(tx_fee %in% c(240600000, 342450000, 444300000)/.N), | ||
z317_453_perc_fee = 100*sum(tx_fee %in% c(31720000000, 45300000000)/.N) | ||
), | ||
by = "block_timestamp_date"] | ||
|
||
names(fee.clusters.day) <- gsub("z", "", names(fee.clusters.day)) | ||
|
||
|
||
|
||
fee.freq <- fees[block_height >= v16.fork.height & n.outputs == 2, table(fee_per_byte_nanoneros)] | ||
|
||
raw.fee.sort.fee.week <- fees[block_height >= v16.fork.height & n.outputs == 2, | ||
c(total = sum(.N), | ||
lapply(names(fee.freq), FUN = function(x) sum(fee_per_byte_nanoneros == as.numeric(x) )) ), | ||
by = "block_timestamp_isoweek"] | ||
|
||
names(raw.fee.sort.fee.week)[-(1:2)] <- paste0(names(fee.freq), "_per_byte") | ||
|
||
raw.fee.sort.fee.day <- fees[block_height >= v16.fork.height & n.outputs == 2, | ||
c(total = sum(.N), | ||
lapply(names(fee.freq), FUN = function(x) sum(fee_per_byte_nanoneros == as.numeric(x) )) ), | ||
by = "block_timestamp_date"] | ||
|
||
names(raw.fee.sort.fee.day)[-(1:2)] <- paste0(names(fee.freq), "_per_byte") | ||
|
||
set.seed(314) | ||
|
||
exact.fees <- c(240600000, 342450000, 444300000, 31720000000, 45300000000) | ||
|
||
example.tx.hashes <- fees[, c( | ||
lapply(exact.fees, FUN = function(x) { | ||
y <- which(tx_fee == as.numeric(x)) | ||
if (length(y) == 1) { return(c(tx_hash[y], rep("", 9))) } | ||
y <- tx_hash[sample(y, size = min(c(10, length(y))))] | ||
c(y, rep("", 10 - length(y))) | ||
}), | ||
lapply(names(fee.freq), FUN = function(x) { | ||
y <- which(fee_per_byte_nanoneros == as.numeric(x)) | ||
if (length(y) == 1) { return(c(tx_hash[y], rep("", 9))) } | ||
y <- tx_hash[sample(y, size = min(c(10, length(y))))] | ||
c(y, rep("", 10 - length(y))) | ||
}) | ||
) | ||
] | ||
|
||
names(example.tx.hashes) <- c( | ||
paste0(exact.fees/1000, "_fee_tx_id"), | ||
paste0(names(fee.freq), "_fee_per_byte_tx_id") | ||
) | ||
|
||
example.tx.hashes <- t(example.tx.hashes) | ||
|
||
|
||
fee.freq <- sort(fee.freq, decreasing = TRUE) | ||
|
||
raw.fee.sort.prevalence.week <- fees[block_height >= v16.fork.height & n.outputs == 2, | ||
c(total = sum(.N), | ||
lapply(names(fee.freq), FUN = function(x) sum(fee_per_byte_nanoneros == as.numeric(x) )) ), | ||
by = "block_timestamp_isoweek"] | ||
|
||
names(raw.fee.sort.prevalence.week)[-(1:2)] <- paste0(names(fee.freq), "_per_byte") | ||
|
||
|
||
raw.fee.sort.prevalence.day <- fees[block_height >= v16.fork.height & n.outputs == 2, | ||
c(total = sum(.N), | ||
lapply(names(fee.freq), FUN = function(x) sum(fee_per_byte_nanoneros == as.numeric(x) )) ), | ||
by = "block_timestamp_date"] | ||
|
||
names(raw.fee.sort.prevalence.day)[-(1:2)] <- paste0(names(fee.freq), "_per_byte") | ||
|
||
|
||
|
||
write.csv(fee.clusters.week, file = "fee-clusters-by-week.csv", row.names = FALSE) | ||
write.csv(fee.clusters.day, file = "fee-clusters-by-day.csv", row.names = FALSE) | ||
|
||
write.csv(raw.fee.sort.fee.week, file = "raw-fee-counts-by-week.csv", row.names = FALSE) | ||
write.csv(raw.fee.sort.fee.day, file = "raw-fee-counts-by-day.csv", row.names = FALSE) | ||
|
||
write.csv(raw.fee.sort.prevalence.week, file = "raw-fee-counts-by-week-prevalence-sort.csv", row.names = FALSE) | ||
write.csv(raw.fee.sort.prevalence.day, file = "raw-fee-counts-by-day-prevalence-sort.csv", row.names = FALSE) | ||
|
||
write.csv(example.tx.hashes, file = "example-tx-ids-by-fee.csv") | ||
|
||
|
||
|
||
|
||
est.PPV <- function(criteria.type = "fee_per_byte", criteria.set, | ||
block.height.limits = c(0, Inf), fees, ring.size = 16) { | ||
|
||
if (criteria.type == "fee_per_byte") { | ||
|
||
beta.hat <- fees[block_height %between% block.height.limits & n.outputs == 2, | ||
mean(fee_per_byte_nanoneros %between% criteria.set)] | ||
|
||
tx.hashes.w.defects <- unique(fees[block_height %between% block.height.limits & | ||
n.outputs == 2 & fee_per_byte_nanoneros %between% criteria.set, tx_hash]) | ||
|
||
number.of.defects.per.ring <- xmr.rings[tx_hash %chin% tx.hashes.w.defects, | ||
.(n.ring.members.w.defect = sum(fee_per_byte_nanoneros %between% criteria.set, na.rm = TRUE)), | ||
by = c("tx_hash", "input_num")] | ||
|
||
} | ||
|
||
if (criteria.type == "fee") { | ||
|
||
beta.hat <- fees[block_height %between% block.height.limits & n.outputs == 2, | ||
mean(tx_fee %in% criteria.set)] | ||
|
||
tx.hashes.w.defects <- unique(fees[block_height %between% block.height.limits & | ||
n.outputs == 2 & tx_fee %in% criteria.set, tx_hash]) | ||
|
||
number.of.defects.per.ring <- xmr.rings[tx_hash %chin% tx.hashes.w.defects, | ||
.(n.ring.members.w.defect = sum(tx_fee %in% criteria.set, na.rm = TRUE)), | ||
by = c("tx_hash", "input_num")] | ||
|
||
} | ||
|
||
mu_D0.hat <- number.of.defects.per.ring[, mean(n.ring.members.w.defect == 0)] | ||
|
||
mu_C.hat <- 1 - mu_D0.hat/(1-beta.hat)^n | ||
|
||
PPV <- function(n, beta, mu_C) { | ||
d <- 1:n | ||
(1/n)*(1-beta)^n*(1-mu_C) + | ||
sum( (1/d) * dbinom(d-1, n-1, beta) * (mu_C+beta*(1-mu_C)) ) | ||
} | ||
# Formula for PPV estimator | ||
# https://github.com/Rucknium/misc-research/tree/main/Monero-Fungibility-Defect-Classifier/pdf | ||
|
||
c(PPV.hat = PPV(n = ring.size, beta = beta.hat, mu_C = mu_C.hat), | ||
beta.hat = beta.hat, | ||
mu_C.hat = mu_C.hat) | ||
|
||
} | ||
|
||
start.block <- 2941340 # First block of 2023-07-31 | ||
end.block <- 2981597 # Last block of 2023-09-24 | ||
|
||
100 * est.PPV(criteria.type = "fee_per_byte", criteria.set = c(500, 520), | ||
block.height.limits = c(start.block, end.block), fees, ring.size = 16) | ||
|
||
100 * est.PPV(criteria.type = "fee_per_byte", criteria.set = c(98, 109), | ||
block.height.limits = c(start.block, end.block), fees, ring.size = 16) | ||
|
||
100 * est.PPV(criteria.type = "fee_per_byte", criteria.set = c(29, 32), | ||
block.height.limits = c(start.block, end.block), fees, ring.size = 16) | ||
|
||
100 * est.PPV(criteria.type = "fee", criteria.set = c(240600000, 342450000, 444300000), | ||
block.height.limits = c(start.block, end.block), fees, ring.size = 16) | ||
|
||
100 * est.PPV(criteria.type = "fee", criteria.set = c(31720000000, 45300000000), | ||
block.height.limits = c(start.block, end.block), fees, ring.size = 16) | ||
|
Oops, something went wrong.