<html><head></head><body>   <br><blockquote dir="auto" class="protonmail_quote" type="cite">> Null bytes aren't always "terminators". You can embed null bytes into data and still<br>> want to do utf8 processing with it.<br><br>that's questionable ... the desire to have ASCII NUL in utf-8<br>sequences (without breaking the "utf-8 sequences are usable as c<br>strings" property) is the main reason for the existence of "modified<br>utf-8".</blockquote><div dir="auto">Admittedly, that’s the first time I’ve heard of "modified utf-8". There seems to be different flavors for every language (the Java one seems to be the most prominent) which means not everyone is gonna use it. Because there is no standard</div><div dir="auto"><br></div><div dir="auto">Still, U+0000 is a valid code point, and having a special case especially for it that isn’t mentioned but you have to watch out for is either a bug or a documentation error.</div><div dir="auto"><span dir="auto" style="color: var(--text-color); background: var(--bg-color);"><br></span></div><div dir="auto"><span dir="auto" style="color: var(--text-color); background: var(--bg-color);">—  Oliver Webb <aquahobbyist@proton.me></span></div><div><br></div><div><br></div></body></html>