Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

抓取到的内容为cookies提醒,正常的文章抓取不到。 #123

Closed
linopluss opened this issue Nov 15, 2024 · 2 comments
Closed

Comments

@linopluss
Copy link

问题:抓取不到正常的文章,只抓取到了cookies提醒。

content 例子1
[from tcl] This site uses cookies to analyse site traffic, improve your experience and personalize ads or other contents. By clicking Accept or continuing to browse the site, you are agree to our use of cookies. See our Cookies Policy here.
Learn more

content 例子2
[from chiq] Our website uses cookies to enhance your browsing experience and to better understand how you use our site. By clicking 'Accept', you consent to the use of cookies, or by clicking
'Reject'
, you refuse the use of cookies.

content 例子3
[from haier] Thank you!
You have successfully subscribed to our newsletter and will receive a
confirmation email shortly. Our newsletter brings you an exclusive preview
of our latest products and promotions, plus design inspiration straight to
your inbox.

tags配置
image

url配置
image

@bigbrother666sh
Copy link
Member

针对这些要写专有网页提取器了,开源代码仓配备的通用网页提取器,仅支持一般常见的新闻网站,微信公众号文章

@bigbrother666sh
Copy link
Member

可以再试试看 V0.3.5版本,如果还是不能很好的抓取,请在 #136 中跟帖 url

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants