Changes between Initial Version and Version 1 of Ticket #3783, comment 9
- Timestamp:
- 03/09/17 00:34:08 (8 years ago)
Legend:
- Unmodified
- Added
- Removed
- Modified
-
Ticket #3783, comment 9
initial v1 21 21 }}} 22 22 23 which interprets the lone invalid UTF-8 byte as unsigned 169 and is assigned to the Unicode character, this is semantically the Latin-1 -> U TF-8conversion. The rest goes on as if this was read as a valid 1-byte character.23 which interprets the lone invalid UTF-8 byte as unsigned 169 and is assigned to the Unicode character, this is semantically the Latin-1 -> Unicode conversion. The rest goes on as if this was read as a valid 1-byte character. 24 24 25 25 The pattern of denoting and carrying invalid UTF-8 bytes as negative numbers is really weird to me. I'm inclined to say that the old design worked "accidentally". At least I don't recall ever thinking about it during my viewer rewrite.