[Toybox] [PATCH] cpio: fix misaligned header if readlink() returns unexpected value

enh enh at google.com
Mon Feb 8 10:44:14 PST 2021


On Mon, Feb 8, 2021 at 2:06 AM Yi-yo Chiang via Toybox <
toybox at lists.landley.net> wrote:

>
>
> On Mon, Feb 8, 2021 at 5:19 PM Rob Landley <rob at landley.net> wrote:
>
>> On 2/7/21 10:10 AM, Yi-Yo Chiang via Toybox wrote:
>> > If file type is symlink and readlink() fails or returns unexpected link
>> > size, then the file body wouldn't be written, resulting in a misaligned
>> > archive.
>>
>> Yup, misaligned archive is definitely a bug. I wonder if I can reproduce
>> it...
>>
>> > As explained in the comments,
>>
>> That's a bigger code comment than I'm checking in, FYI. (Large comments
>> go in
>> commit messages. In code, comments aren't executable and thus don't get
>> regression tested, so having big ones inline is dubious unless they have
>> a TODO
>> in them.)
>>
>> > there are some cases where the link size
>> > returned by lstat() and readlink() may be different.
>> > To accommodate this, we readlink() and update the value of st_size
>> > before writing the file header.
>>
>> I.E. the lstat() link size is not authoritative, the readlink() size is.
>>
>> > This may happen on Android if the userdata filesystem is encrypted.
>> >
>> > ---
>> >  tests/cpio.test   |  2 +-
>> >  toys/posix/cpio.c | 24 ++++++++++++++++++++----
>> >  2 files changed, 21 insertions(+), 5 deletions(-)
>> >
>> > diff --git a/tests/cpio.test b/tests/cpio.test
>> > index 11b3a5bf..a12edd09 100755
>> > --- a/tests/cpio.test
>> > +++ b/tests/cpio.test
>> > @@ -39,7 +39,7 @@ rm a bb ccc dddd
>> >
>> >  # archive dangling symlinks and empty files even if we cannot open them
>> >  touch a; chmod a-rwx a; ln -s a/cant b
>> > -toyonly testing "archives unreadable empty files" "cpio -o -H
>> newc|cpio -it" "a\nb\n" "" "a\nb\n"
>> > +toyonly testing "archives unreadable empty files" "cpio -o -H
>> newc|cpio -it" "a\nb\na\n" "" "a\nb\na\n"
>> >  chmod u+rw a; rm -f a b
>>
>> What does this change accomplish? The changed test still passes without
>> your
>> code change, so what did you actually demonstrate?
>>
>> Ah, this fails on your broken filesystem that's returning inconsistent
>> data. So
>> really you just need to swap the order of a and b so the symlink has an
>> entry
>> after it...
>>
>> > diff --git a/toys/posix/cpio.c b/toys/posix/cpio.c
>> > index 09c99ae1..200fb365 100644
>> > --- a/toys/posix/cpio.c
>> > +++ b/toys/posix/cpio.c
>> > @@ -230,6 +230,7 @@ void cpio_main(void)
>> >        unsigned nlen, error = 0, zero = 0;
>> >        int len, fd = -1;
>> >        ssize_t llen;
>> > +      char *lnkbuf = NULL;
>> >
>> >        len = getline(&name, &size, stdin);
>> >        if (len<1) break;
>> > @@ -242,6 +243,22 @@ void cpio_main(void)
>> >          continue;
>> >        }
>> >
>> > +      // It is possible that readlink() may fail or the actual link
>> size is
>> > +      // different from that indicated by lstat(). This may be due to
>> a race
>> > +      // condition (the link is removed/changed between the call to
>> lstat() and
>> > +      // readlink()),
>>
>> In which case it should get caught with the lstat failure above and
>> reported as
>> an error before we write out a header for it. (Better to avoid writing a
>> bad
>> entry at all than emit nonsense for padding reasons...)
>>
>> > or may happen on encrypted filesystem where lstat()
>> > +      // returns the size of encrypted link and readlink() returns the
>> size of
>> > +      // decrypted link.
>>
>> Still really seems like a filesystem bug to me, but we must work around
>> the bugs
>> that exist. (And it's not as broken as some of the crap reiserfs pulled
>> back in
>> the day...)
>
>
> Yeah I think this is a filesystem or kernel bug (fscrypt?) and the real
> good fix is for the kernel to report consistent lstat() results, but we
> have to remedy for existing devices out there that's running with this
> buggy lstat() :/
> I've raised a question regarding this issue to our kernel team.
>

i'll be curious to hear whether it's actually a bug, or a
deliberate obfuscation: "no, i'm not even going to tell you the _length_ of
this encrypted information because i worry you might be able to make
valid inferences from that".
https://en.wikipedia.org/wiki/Padding_(cryptography)#Traffic_analysis_and_protection_via_padding
kind of thing.


> > Thus we first readlink() and update the st_size, bail
>> > +      // out early if readlink() fails, then write the file header and
>> the link.
>> > +      if (S_ISLNK(st.st_mode)) {
>> > +        lnkbuf = xreadlink(name);
>> > +        if (!lnkbuf) {
>> > +          perror_msg("readlink '%s'", name);
>> > +          continue;
>> > +        }
>> > +        st.st_size = strlen(lnkbuf);
>> > +      }
>> > +
>> >        if (FLAG(no_preserve_owner)) st.st_uid = st.st_gid = 0;
>> >        if (!S_ISREG(st.st_mode) && !S_ISLNK(st.st_mode)) st.st_size = 0;
>> >        if (st.st_size >> 32) perror_msg("skipping >2G file '%s'", name);
>> > @@ -262,9 +279,7 @@ void cpio_main(void)
>> >          // Write out body for symlink or regular file
>> >          llen = st.st_size;
>> >          if (S_ISLNK(st.st_mode)) {
>> > -          if (readlink(name, toybuf, sizeof(toybuf)-1) == llen)
>> > -            xwrite(afd, toybuf, llen);
>> > -          else perror_msg("readlink '%s'", name);
>>
>> My original bug was that should have been perror_exit() because the file
>> was
>> corrupted otherwise. Or the xwrite() should have been unconditional so it
>> didn't
>> get misaligned. Ala I _should_ have written:
>>
>>   if (readlink(name, toybuf, sizeof(toybuf)-1) != llen)
>>     perror_msg("readlink '%s'", name);
>>   xwrite(afd, toybuf, llen);
>>
>> But doing the readlink with the open() and noticing failure before
>> writing the
>> header is the better fix, and then using the link string length to
>> override what
>> stat() says becomes a one line bug workaround for the new issue you hit...
>>
>> > +          xwrite(afd, lnkbuf, llen);
>> >          } else while (llen) {
>> >            nlen = llen > sizeof(toybuf) ? sizeof(toybuf) : llen;
>> >            llen -= nlen;
>> > @@ -276,7 +291,8 @@ void cpio_main(void)
>> >          llen = st.st_size & 3;
>> >          if (llen) xwrite(afd, &zero, 4-llen);
>> >        }
>> > -      close(fd);
>> > +      if (lnkbuf) free(lnkbuf);
>> > +      if (fd != -1) close(fd);
>>
>> free(0) and close(-1) are both defined as NOPs by posix.
>
>
> Good to know this!.
>
>
>> I usually test for the
>> close one because it's an exra syscall and noise in strace, but free(0)
>> is just
>> a libc call. And in fact, I can just modify xclose() to do the -1 test
>> for me...
>>
>> (I've gotten REALLY lax about using xclose because as far as I know the
>> only
>> time close() can EVER fail is on NFS, but I admit "not having to test for
>> -1" is
>> a reason to use it...)
>>
>> >      }
>> >      free(name);
>>
>> I checked in a slightly different fix. Does that solve your problem?
>>
>
> Yup, verified on my android test bench.
>
>
>>
>> Thanks,
>>
>> Rob
>>
>
>
> --
>
> Yi-yo Chiang
> Software Engineer
> yochiang at google.com
> _______________________________________________
> Toybox mailing list
> Toybox at lists.landley.net
> http://lists.landley.net/listinfo.cgi/toybox-landley.net
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.landley.net/pipermail/toybox-landley.net/attachments/20210208/96fcad70/attachment.htm>


More information about the Toybox mailing list