using LLM to detect website structure #133

berkbirkan · 2024-08-25T00:53:37Z

hello. first of all, i am not sure if this is the right place. i am quite new to this, so i apologize in advance if i am doing something wrong.

first of all, i need the following:

create a full-content RSS/JSON structure from a ready RSS feed url
create an rss feed from a website url (with or without AI)
pull a single content from a single url (e.g. a news or blog post url)

the items i listed above should be valid for all sites, because i am developing a product for the end user and everyone's needs will be different. from what i have seen in my tests, i can pull some sites without any problems, while some sites have problems. i think morss cannot detect these sites because they use very different structures than the expected structure. morss recognizes certain structures manually, but it cannot do this dynamically. can artificial intelligence be used to detect the structure of such sites and then pull content? (similar to what the scrapegraph-ai library does)

thank you

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

using LLM to detect website structure #133

using LLM to detect website structure #133

berkbirkan commented Aug 25, 2024

using LLM to detect website structure #133

using LLM to detect website structure #133

Comments

berkbirkan commented Aug 25, 2024