Optimization potential for df on FAT? #21

Mellvik · 2023-08-30T08:44:09Z

While rounding up the work on issue #20 , I noticed that doing a df on a mounted FAT volume generates an enormous amount of unmaps/remaps of the same block. Enormous = 487

Even on a small (17M) FAT volume, df is notoriously time consuming - if significantly better today than a couple of years ago (ELKS). I have been assuming that the reason was lots of disk reads to gather required data, but 17 consecutive blocks isn't much (the metric seems to be 1 block per MB for FAT16, 41 consecutive blocks on a 41M volume).

Watching the 487 maps/unmaps per block - and (notably) not having looked at the code - it seems there must be some optimization potential, best case at the FS level, worst case in the df code itself.

The text was updated successfully, but these errors were encountered:

ghaerr · 2023-08-30T16:44:43Z

Are you using the newer list_buffer_status code that shows maps/remaps/unmaps? If so, it is important to differentiate between a remap and the others. A remap shows that the desired buffer was present in L1 (not necessarily with b_mapcount > 0) and was reused, without a L2<->L1 copy. The map and unmap counts show copies in and out of L1 respectively, and take lots of time. It should be showing lots and lots of remaps and few map/unmaps, relatively.

As you know, calculating disk free on FAT filesystems got so bad that FAT32 introduced a new mechanism for that, which isn't supported. We use the original method which has to scan the entire FAT table, thus mapping every single FAT block, likely many times, which is probably what you're seeing.

it seems there must be some optimization potential, best case at the FS level, worst case in the df code itself.

The df code uses the new ustatfs system call which then uses the same mechanism as reported by the mount command. IIRC there's an option as to whether to calculate FAT free space since its well known to be a very slow algorithm on larger hard drives. There is no way it can be sped up without moving to the new FAT32 method, which essentially runs it in the background on DOS/Windows and then writes a snapshot block somewhere.

Mellvik added the enhancement New feature or request label Aug 30, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimization potential for df on FAT? #21

Optimization potential for df on FAT? #21

Mellvik commented Aug 30, 2023

ghaerr commented Aug 30, 2023

Optimization potential for df on FAT? #21

Optimization potential for df on FAT? #21

Comments

Mellvik commented Aug 30, 2023

ghaerr commented Aug 30, 2023