-
-
Notifications
You must be signed in to change notification settings - Fork 314
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
charset 1251 problem #509
Comments
Can I have an URL to reproduce this? |
https://feedster.ru/emb.php to check if cloudflare blocks |
Oooo, I too would like to see a solution for this. The Cyrillic is completely voided when the page isn't UTF-8. |
Embed uses this parser to convert the HTML to DOMDocuments and deal with encoding issues. The function is used here. I've forced the encoding in the second argument and it works fine: $this->document = !empty($html) ? Parser::parse($html, "windows-1251") : new DOMDocument(); So probably the automatic encoding detection of the parser doesn't work fine. Not sure how to fix it and I don't have much time to dig into this. If anyone wants to work on a PR to fix it, it's very appreciated. |
I haven't thoroughly tested this... but |
Hi. I encountered a problem with the encoding. Cyrillic does not display, how to collect data in the encoding 1251. tryed to use iconv, but Fatal error: Uncaught TypeError: iconv(): Argument #3 ($string) must be of type string, Embed\Extractor given
The text was updated successfully, but these errors were encountered: