<div dir="ltr">(heh, the time i spent staring at gmail's progress bar as i attached the file make me feel more justified in my "get off my lawn!" attitude here. before now i'm not sure i even knew there _was_ a progress bar when you attach a file :-) )</div><br><div class="gmail_quote"><div dir="ltr" class="gmail_attr">On Fri, Aug 13, 2021 at 12:19 PM enh <<a href="mailto:enh@google.com">enh@google.com</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div dir="ltr">ack. i'll send it to you directly, because i was on too many mailing lists in the 1990s to feel comfortable sending an 11MiB attachment to a list :-)<div><br></div><div>(the instructions to generate these files yourself by running the <a href="https://github.com/gavinhoward/bc" target="_blank">https://github.com/gavinhoward/bc</a> tests would be far shorter, but it would take too long for me to reverse-engineer how i got into that mess in the first place, so...)</div></div><br><div class="gmail_quote"><div dir="ltr" class="gmail_attr">On Fri, Aug 13, 2021 at 3:12 AM Rob Landley <<a href="mailto:rob@landley.net" target="_blank">rob@landley.net</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">On 8/12/21 4:27 PM, enh via Toybox wrote:<br>
> you know how you (rob) have repeatedly expressed your desire to have a different<br>
> diff implementation, and i've always either ignored you or claimed that the<br>
> existing one is good enough?<br>
> <br>
> well ... i finally hit a case where i can tell the difference. it turns out that<br>
> if you have 3 million lines in the files you're diffing, GNU diff can get<br>
> through that in less than 10s, busybox takes just under an hour (!), and toybox<br>
> takes just over an hour.<br>
> <br>
> i'm assuming you already knew of cases like this, but i'll keep my two 125MiB<br>
> files somewhere just in case. they compress pretty well, being _very_ repetitive<br>
> ASCII, but the zip file is still 11MiB so i won't post it without being asked.<br>
<br>
I'd love to get a copy of those just for personal development testing if I can.<br>
(Running problematic real world data through the thing is always preferable.)<br>
<br>
I'll add a TODO for figuring out how to reasonably have the test suite address<br>
the issue without checking in an 11 megabyte test file. :)<br>
<br>
Thanks,<br>
<br>
Rob<br>
</blockquote></div>
</blockquote></div>