Skip to content

Commit

Permalink
Fix typos and sync README -> docs/intro.rst
Browse files Browse the repository at this point in the history
  • Loading branch information
Thamme Gowda committed Aug 29, 2023
1 parent a77c0d5 commit b50ae95
Show file tree
Hide file tree
Showing 3 changed files with 32 additions and 24 deletions.
6 changes: 3 additions & 3 deletions README.md
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
# Sotastream
# Sotastream
[![image](http://img.shields.io/pypi/v/sotastream.svg)](https://pypi.python.org/pypi/sotastream/)
[![License: MIT](https://img.shields.io/badge/License-MIT-blue.svg)](./LICENSE)
[![Read the Docs](https://img.shields.io/readthedocs/sotastream.svg)](https://sotastream.readthedocs.io/)
Expand All @@ -23,9 +23,9 @@ pip install sotastream
# To begin, clone the repository:
git clone https://github.com/marian-nmt/sotastream
cd sotastream
# option 1:
# option 1:
python -m pip install .
# option 2: install in --editable model
# option 2: install in --editable mode
python -m pip install -e .
```

Expand Down
7 changes: 6 additions & 1 deletion docs/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -25,4 +25,9 @@ python -m build --sdist --wheel -o dist/
twine upload -r testpypi dist/*
twine upload -r pypi dist/*

```
```


## Update Docs

Go to https://readthedocs.org/projects/sotastream/ and click/touch "Build" button.
43 changes: 23 additions & 20 deletions docs/introduction.rst
Original file line number Diff line number Diff line change
Expand Up @@ -10,31 +10,37 @@ uses `infinibatch <https://github.com/microsoft/infinibatch>`_ internally to gen
shuffled training data and provides a means for on-the-fly data
manipulation, augmentation, mixing, and sampling.

Cloning and initialization
--------------------------

To begin, clone the repository:

::

git clone https://github.com/marian-nmt/sotastream
Setup
-----

To install from PyPI (https://pypi.org/project/sotastream/)

You can then install it as follows.

.. code:: bash
cd sotastream
pip install sotastream
*Developer Setup:*

.. code:: bash
# To begin, clone the repository:
git clone https://github.com/marian-nmt/sotastream
cd sotastream
# option 1:
python -m pip install .
python -m pip install --no-deps . # install without dependencies
# option 2: install in --editable mode
python -m pip install -e .
If you already have your own version of requirements, add
``--no-deps / --no-dependencies`` flag to skip installing dependencies.
*Entry points*
* As a module: `python -m sotastream`
* As a bin in your $PATH: `sotastream`

Entry points \* As a module: ``python -m sotastream`` \* As a bin in
your $PATH: ``sotastream`` \* Via path to script:
``python path/to/cli.py``. For convenience, cli.py is in the root of
repository

Development
-----------
Expand Down Expand Up @@ -94,11 +100,8 @@ sotastream will split them to checksummed folders under

python -m sotastream example parallel.tsv.gz backtrans.tsv.gz

(The garbage file is assumed to have just a single column of data, which
is copied).

There are currently two main pipelines: “default”, and “wmt”. These vary
according to the data sources they take as well as the other options
There are currently two main pipelines: “default”, and “wmt”.
These vary according to the data sources they take as well as the other options
available to them.

There are global options that control behavioral aspects such as
Expand All @@ -116,7 +119,7 @@ can see these by running
# see wmt pipeline options
python -m sotastream wmt -h

Dont cross the streams!
Don't cross the streams!
------------------------

Sotastream workflows build a directed acyclic graph (DAG) consisting of
Expand Down

0 comments on commit b50ae95

Please sign in to comment.