Skip to content

Commit

Permalink
Merge cherry-picked commits from #33 (#43)
Browse files Browse the repository at this point in the history
* update: component classifier functions

* add: banner component parser

* fix: return model as dict, require pydantic

* fix: nonetype has no attrs; ignore if no img

* fix: filter empty rso component

* update: modularized component classifier

* Bump to 0.3.2

* Replace dummy package bs4 with official name

* Add function to load external html files

* update: removed load_serp, see load_html in ws.webutils

---------

Co-authored-by: wanLo <[email protected]>
  • Loading branch information
gitronald and wanLo authored Nov 14, 2023
1 parent f9692d9 commit 8243c12
Show file tree
Hide file tree
Showing 2 changed files with 2 additions and 2 deletions.
2 changes: 1 addition & 1 deletion WebSearcher/searchers.py
Original file line number Diff line number Diff line change
Expand Up @@ -189,7 +189,7 @@ def save_serp(self, save_dir='.', append_to=''):
"""Save SERP to file
Args:
save_dir (str, optional): Save results as `save_dir/{serp_id}.json`
save_dir (str, optional): Save results as `save_dir/{serp_id}.html`
append_to (str, optional): Append results to this file path
"""
assert self.html, "Must conduct a search first"
Expand Down
2 changes: 1 addition & 1 deletion setup.py
Original file line number Diff line number Diff line change
Expand Up @@ -27,6 +27,6 @@ def get_readme_descriptions(fp='README.md', s='#', stop_at=2):
long_description=long_description,
long_description_content_type='text/markdown',
packages=setuptools.find_packages(),
install_requires=['requests','lxml','bs4','tldextract','brotli', 'pydantic'],
install_requires=['requests','lxml','beautifulsoup4','tldextract','brotli','pydantic'],
python_requires='>=3.6'
)

0 comments on commit 8243c12

Please sign in to comment.