squashfs: fix received X bytes instead of expected minimal 40 on listing #201

ncw · 2023-12-18T17:58:35Z

When unpacking large squashfs archives you occasionally get this
error:

received 38 bytes instead of expected minimal 40

This is due to the inode changing type (ie it is actual an extended
inode) when read from the directory but the size not being updated.
Most of the time there is enough data for the larger inode but not
always.

I couldn't think of a way to test this.

I have an archive which demonstrates the problem you can download here: tensorflow.sqfs if you need it.

It was made with this script

#! /usr/bin/bash
ORG=${ORG:-tensorflow}
IMG=${IMG:-tensorflow}
TAG=${TAG:-latest-gpu-jupyter}
docker export $(docker create $ORG/$IMG:$TAG) -o $IMG.tar.gz
mkdir -p $IMG && tar xf $IMG.tar.gz -C $IMG
[ -f $IMG.sqfs ] && rm $IMG.sqfs
mksquashfs $IMG $IMG.sqfs  -comp zstd -Xcompression-level 3 -b 1M -no-xattrs -all-root

deitch · 2023-12-19T08:21:59Z

The following is the current process:

call getInode(), passing it the inode type
get the size from the passed type, uncompress it, get the header
If the actual type is not the expected type, use the actual type

I think this change of yours says:

OK, but if the actual type is different, then not only should the type be different, but the size should be different, too, so reread the inode using the newly-discovered size from the actual header.

Is that correct?

I couldn't think of a way to test this.
I have an archive
It was made with this script

Something in the directory structure must have triggered it. What exactly was it? Could we isolate that?

ncw · 2023-12-23T18:53:47Z

Is that correct?

Yes that is correct, though in step 4 we may have read enough data already. I'll see if I can create a squashfs which causes the problem - watch this space!

ncw · 2023-12-27T18:08:11Z

I managed to make this squashfs which demonstrates the problem.

tensorflow-nolinks.sqfs.zip

(I had to zip it for upload).

It is the tensorflow docker image but with all the files truncated to 0 size and all the symlinks removed.

It is 816K so not too big!

I could write a test which does a directory traversal of it and check that we can see all the files.

What do you think? A good enough test?

I didn't manage to make a synthetic test!

deitch · 2023-12-28T07:36:29Z

It is 816K so not too big!

It isn't, although I suspect we could make it smaller. The real question is what is triggering it.

Wouldn't a simple image with just one file (and therefore 1 inode, or maybe 2 if you count the directory), but where we call getInode() with the basic type work?

I am trying to understand how this is triggered. getInode() is called only from 3 places:

Read(), where it reads the filesystem from a file, and where it assumes that the root is a basic directory. Can the root be an extended directory? If so, that is a simple replication.
getDirectoryEntries(), where it reads the inode for each directory entry, taking the inode type from the directory entry. I cannot see why the type in the entry would be incorrect, but I guess that is possible.
hydrateDirectoryEntries(), where it gets the inode from the passed []*directoryEntryRaw, which are taken from the actual directory entries.

When you run into this problem with your sample above? What actual path is it taking? I assume it is not the first, but the second or third? What is the actual case triggering it?

ncw · 2023-12-30T17:25:40Z

Wouldn't a simple image with just one file (and therefore 1 inode, or maybe 2 if you count the directory), but where we call getInode() with the basic type work?

I managed to make a test image eventually! I think it only triggers when the directory reading is

at the end of the page
reads an extended type

I managed to get this to trigger by using 4k page sizes and 300 files with xattrs (to force them to be extended types).

I've added this to the commit.

In the process I discovered another bug! There was a small typo in the xattr parsing which I've also fixed - the new test doesn't pass without the fix.

deitch · 2023-12-31T13:37:32Z

I managed to get this to trigger by using 4k page sizes and 300 files with xattrs (to force them to be extended types).

All about the right combination to trigger the problem. Thank you, indeed, for tracking it down.

You have done so much for this PR, I hate to ask for more. Can I ask to expand the comment in buildtestsqs.sh to explain the corner case we are dealing with? No other requests, then we can merge this in.

When unpacking large squashfs archives you occasionally get this error: received 38 bytes instead of expected minimal 40 This is due to the inode changing type (ie it is actual an extended inode) when read from the directory but the size not being updated. Most of the time there is enough data for the larger inode but not always.

Before this fix the code crashed with errors like panic: runtime error: slice bounds out of range [282:270] [recovered] When reading xattrs. This was caused by a mixup of indices in the code. This fix is tested by TestSquashfsReadDirCornerCases and causes it to run clean.

ncw · 2023-12-31T17:09:39Z

You have done so much for this PR, I hate to ask for more. Can I ask to expand the comment in buildtestsqs.sh to explain the corner case we are dealing with? No other requests, then we can merge this in.

I've done that now :-) Let me know if you need any other changes.

deitch · 2023-12-31T17:44:59Z

Nope, looks great. Thank you!

ncw force-pushed the fix-large-squashfs branch from 7b0779d to fb28d9d Compare December 30, 2023 17:21

ncw added 2 commits December 31, 2023 17:07

ncw force-pushed the fix-large-squashfs branch from fb28d9d to fe4709c Compare December 31, 2023 17:08

deitch approved these changes Dec 31, 2023

View reviewed changes

deitch merged commit b20cf01 into diskfs:master Dec 31, 2023
19 checks passed

ncw mentioned this pull request Dec 31, 2023

squashfs: add configurable block cache #206

Merged

ncw deleted the fix-large-squashfs branch January 3, 2024 15:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

squashfs: fix received X bytes instead of expected minimal 40 on listing #201

squashfs: fix received X bytes instead of expected minimal 40 on listing #201

ncw commented Dec 18, 2023 •

edited

Loading

deitch commented Dec 19, 2023

ncw commented Dec 23, 2023

ncw commented Dec 27, 2023

deitch commented Dec 28, 2023

ncw commented Dec 30, 2023

deitch commented Dec 31, 2023

ncw commented Dec 31, 2023

deitch commented Dec 31, 2023

squashfs: fix received X bytes instead of expected minimal 40 on listing #201

squashfs: fix received X bytes instead of expected minimal 40 on listing #201

Conversation

ncw commented Dec 18, 2023 • edited Loading

deitch commented Dec 19, 2023

ncw commented Dec 23, 2023

ncw commented Dec 27, 2023

deitch commented Dec 28, 2023

ncw commented Dec 30, 2023

deitch commented Dec 31, 2023

ncw commented Dec 31, 2023

deitch commented Dec 31, 2023

ncw commented Dec 18, 2023 •

edited

Loading