ScrapeBillTrack50

For each legislator in a list, do a Google search for the relevant Bill Track 50 webpage and then search that webpage for specific information.

Note: Right now the code is setup only to return the scheduler information for each legislator.

To report a bug or request a feature, please file an issue.

You may choose to use the latest image as a clean environment.
An alternative mode has you checkout the code locally, but use one of the Docker images to provide the necessary dependencies. In this case, the code within the image will automatically be replaced by your local version using a bind mount. This method is most useful for people wishing to develop a new feature for the repository, but who want to avoid installing the dependencies on their local machine. The resulting images will be copied back to the host machine. To run in this mode, a helper script has been developed to wrap up all of the Docker complexities. Simply run:
```
.docker/run.sh -C configs/assignments.py
```
Note: To use this running mode, you will need to have permission to bind mount the local directory and the local user will need permission to write to that directory as well. This is typically not a problem unless the repository has been checked out inside a restricted area of the operating system or the permissions on the directory have been changed.

Dependencies

Required dependencies:

Python 3
magiconfig (GitHub): Used to read Python configuration files.
- Can be installed using the command pip3 install --no-cache-dir magiconfig
googlesearch-python (GitHub): Used to search Google for the correct Bill Track 50 webpage
- Can be installed using the command pip3 install --no-cache-dir googlesearch-python
urllib3 (GitHub): Used to get the HTML from a given webpage
- Can be installed using the command pip3 install --no-cache-dir urllib3
beautifulsoup4: Used to parse the HTML from a given Bill Track 50 webpage
- Can be installed using the command pip3 install --no-cache-dir beautifulsoup4

There is a script available to make sure all of the needed dependencies are installed:

python3 check_for_dependencies.py

Optional dependencies:

PyLint (GitHub, PyPI): Used for linting Python modules.
pytest (GitHub, PyPI): Used for unit testing Python modules.

Contributing

Pull requests are welcome, but please submit a proposal issue first, as the library is in active development.

Current maintainers:

Alexx Perloff

Unit Testing

Unit testing is performed using PyTest. You are of course allowed to install this programs locally. However, a shell script has been setup to make this procedure as easy as possible.

To run the python unit/integration tests, you will need to have PyTest installed. To create a local virtual environment with PyTest installed, use the following commands from within the repository's base directory:

./test/pytest_control.sh -s

You only have to run that command when setting up the virtual environment the first time. You can then run the tests by using the command:

./test/pytest_control.sh

You should see an output similar to:

======================================================== test session starts ========================================================
platform darwin -- Python 3.9.10, pytest-7.0.1, pluggy-1.0.0
rootdir: <path to ScrapeBillTrack50>
collected 2 items

test/test.py ..                                                                                                               [100%]

======================================================== 2 passed in 13.34s =========================================================

You can pass addition options to PyTest using the -o flag. For example, you could run the following command to increase the verbosity of PyTest:

./test/pytest_control.sh -o '--verbosity=3'

Other helpful pytest options include:

-rP: To see the output of successful tests. This is necessary because by default all of the output from the various tests is captured by PyTest.
-rx: To see the output of failed tests (default).
-k <testname>: Will limit the tests run to just the test(s) specified. The <testname> can be a class of tests or the name of a specific unit test function.

To remove the virtual environment use the command:

./test/pytest_control.sh -r

which will simply remove the test/venv directory.

Linting

Linting is done using PyLint. The continuous integration jobs on GitHub will run these linters as part of the PR validation process. You may as well run them in advance in order to shorten the code review cycle. PyLint can be run as part of the Python unit testing process using the command:

test/pytest_control.sh -l

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.docker		.docker
.github		.github
configs		configs
test		test
.gitignore		.gitignore
.pylintrc		.pylintrc
README.md		README.md
ScrapeBillTrack50.py		ScrapeBillTrack50.py
check_for_dependencies.py		check_for_dependencies.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ScrapeBillTrack50

Table of Contents

Installation

Local Installation

Available Docker Images

Command Line Interface

Basic example

Options

Using Docker

Dependencies

Contributing

Unit Testing

Linting

About

Releases

Packages

Languages

aperloff/ScrapeBillTrack50

Folders and files

Latest commit

History

Repository files navigation

ScrapeBillTrack50

Table of Contents

Installation

Local Installation

Available Docker Images

Command Line Interface

Basic example

Options

Using Docker

Dependencies

Contributing

Unit Testing

Linting

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages