-
Notifications
You must be signed in to change notification settings - Fork 37
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Incorrect decoding (truncation) of ByteString
, but not Text
#91
Comments
Here is the bug: tagsoup/src/Text/StringLike.hs Line 64 in 44c32d6
Which uses and that function truncates because it uses:
|
Thanks for the report - agreed that isn't ideal. What would you expect |
I've thought about this a lot. EDIT: Lol, github displays that as an emoji. |
Shoving UTF8 into a bytestring seems like you are treating the type as a different type (having such a type widely used in Haskell would be great, and maybe with text it will be one day). Removing the instance breaks it for people who want to use that, knowing its limitations - I probably wouldn't add such a instance today, but removing it seems too far. Documenting the caveats seems a good idea regardless though. |
👍 I still mostly consume |
There seems to be a bug in tagsoup, that makes it so that characters that do not fit
into latin1, when utf8-encoded as HTML entities, are truncated when parsed.
Here is an example:
The text was updated successfully, but these errors were encountered: