Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Incomplete Markdown Conversion: Missing MSRP Cap from URL Content #119

Open
nagendrakumar02 opened this issue Sep 23, 2024 · 2 comments
Open

Comments

@nagendrakumar02
Copy link

nagendrakumar02 commented Sep 23, 2024

I am using r.jina.ai to read the contents of the URL and I'm encountering an issue while trying to convert the contents of the URL to markdown format. It appears that some sections are missing from the conversion, particularly the MSRP cap details from this URL. Could you please look into this?

URL: https://portal.ct.gov/deep/air/mobile-sources/cheapr/cheapr---new-eligible-vehicles
Missing content (especially MSRP cap)
image

@nomagick
Copy link
Member

Hi.
This should be our default transformer @mozilla/readability not smart enough and removing your desired content.

Please try the other mode which does not tend to remove things:

curl https://r.jina.ai/https://portal.ct.gov/deep/air/mobile-sources/cheapr/cheapr---new-eligible-vehicles -H 'x-return-format: markdown'

@nagendrakumar02
Copy link
Author

Hi. This should be our default transformer @mozilla/readability not smart enough and removing your desired content.

Please try the other mode which does not tend to remove things:

curl https://r.jina.ai/https://portal.ct.gov/deep/air/mobile-sources/cheapr/cheapr---new-eligible-vehicles -H 'x-return-format: markdown'

Thanks! that worked! However running this gives me different results each time. Do you know if I can self-host this in AWS?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants