- Merged #40; updates the additional info selector, fixing #37 and #39
- Merged #33; added a fix to handle recent changes with newlines (
<br/>
tags) - Fix #34; catch case where
developer_id
can be missing.
- Fix CSS attribute selector soupsieve errors
- Correctly parse screenshot img src when placeholder is present (base64 encoded empty image)
- Revert category css selector (seems like it was reverted on the play store)
- Add support for parent category links to
category()
, e.g.GAME
- Pass through gl/hl params to multiple app detail requests
- Fix category css selector
- Fix screenshot css selector
- Update requests version
- Fix missing kwargs for
search
andsimilar
- Added manifest to fix install-option, and update requires
- Remove
**kwargs
in favor of specified kwargs - Update required
lxml
version
- Replace
grequests
withrequests-futures
- Include Python 3.7 in multiversion testing
- Moved parsing functions into utils
- Add test coverage
- Avoid using the
alt
attribute as a selector because specifying a language will change the value.
- Quick fix on import error in
scraper.py
- Add option to change the
hl
andgl
query parameters for specifying language and geolocation (country), respectively.
- Raise when passed a developer_id instead of the developer name to
.developer()
- None type check before calling
developer_address.strip()
- Set test to check >= because number of apps fetched non-deterministic since we actually scrape
- Fix Python3 compatibility (basestring, unicode())
- Use Python primitive types instead of BeautifulSoup4's
NavigableString
- Use requests[security] to fix SSL recursion errors
- Update parsing for details, collections, similar
- Fix and add tests for all of these
- Expose previously unused category list util as api
- Clean up some code here and there.
- Use absolute imports instead of relative.
- Switch README back to markdown now that PyPi supports it.
- Arabic in Current Version metadata fix
- Price not available (due to country restrictions?) and better handling with pre-register apps
- No description/description_html error handling, e.g. with app 'kumagames.onikuma' in detailed scrape
- Removed logging configs and replaced with nullhandler
- Pagination for developer method, using page tokens
- Added page token regex to settings for parsing out of response html script tags
- Added handling of 'pre-register' apps with more generalized selectors in basic card info and detailed parsing
- Fixed App detail URL and IAP range parsing with a proper selector and correctly traversing to the next next sibling, respectively
- Fixed error when developer email not available
- Added number of results and page number limit checks
- App ID raises exception when 404
- Python3 urllib.parse.quote_plus compatibility in scraper/utils
- Python3 import fixes
- No reviews (ratings) set to 0 instead of None
- Developer ID included when available
- Removed file logging
- Bugfix: AGE_RANGE params for fetching collections in the FAMILY category
- Bugfix: description_html doesn't include parent div anymore
- README examples updates
- Added urljoin import compatibility for Python 3
- Simplified parsing description to just use unicode