Initial release version 0.1.0 + update docs

PiperOrigin-RevId: 625552588
google-deepmind · Apr 17, 2024 · 42df7b5 · 42df7b5
1 parent da61623
commit 42df7b5
Show file tree

Hide file tree

Showing 4 changed files with 140 additions and 14 deletions.
diff --git a/README.md b/README.md
@@ -1,8 +1,120 @@
 # Penzai
 
-Penzai is a JAX research toolkit for inspecting, patching, and visualizing
-neural networks. More details coming soon.
+> **盆 ("pen", tray) 栽 ("zai", planting)** - *an ancient Chinese art of forming
+  trees and landscapes in miniature, also called penjing and an ancestor of the
+  Japanese art of bonsai.*
 
+Penzai is a JAX library for writing models as legible, functional pytree data
+structures, along with tools for visualizing, modifying, and analyzing them.
+Penzai focuses on **making it easy to do stuff with models after they have been
+trained**, making it a great choice for research involving reverse-engineering
+or ablating model components, inspecting and probing internal activations,
+performing model surgery, debugging architectures, and more. (But if you just
+want to build and train a model, you can do that too!)
+
+Penzai is structured as a collection of modular tools, designed together but
+each useable independently:
+
+* `penzai.nn` (`pz.nn`): A declarative combinator-based neural network
+  library and an alternative to other neural network libraries like Flax, Haiku,
+  Keras, or Equinox, which exposes the full structure of your model's
+  forward pass in the model pytree. This means you can see everything your model
+  does by pretty printing it, and inject new runtime logic with `jax.tree_util`.
+  Like Equinox, there's no magic: models are just callable pytrees under the
+  hood.
+
+* `penzai.treescope` (`pz.ts`): A superpowered interactive Python
+  pretty-printer, which works as a drop-in replacement for the ordinary
+  IPython/Colab renderer. It's designed to help understand Penzai models and
+  other deeply-nested JAX pytrees, with built-in support for visualizing
+  arbitrary-dimensional NDArrays.
+
+* `penzai.core.selectors` (`pz.select`): A pytree swiss-army-knife,
+  generalizing JAX's `.at[...].set(...)` syntax to arbitrary type-driven
+  pytree traversals, and making it easy to do complex rewrites or
+  on-the-fly patching of Penzai models and other data structures.
+
+* `penzai.core.named_axes` (`pz.nx`): A lightweight named axis system which
+  lifts ordinary JAX functions to vectorize over named axes, and allows you to
+  seamlessly switch between named and positional programming styles without
+  having to learn a new array API.
+
+* `penzai.data_effects` (`pz.de`): An opt-in system for side arguments, random
+  numbers, and state variables that is built on pytree traversal and puts you
+  in control, without getting in the way of writing or using your model.
+
+Documentation on Penzai can be found at penzai.readthedocs.io.
+
+
+## Getting Started
+
+If you haven't already installed JAX, you should do that first, since the
+installation process depends on your platform. You can find instructions in the
+[JAX documentation](https://jax.readthedocs.io/en/latest/installation.html).
+Afterward, you can install Penzai using
+
+```
+pip install penzai
+```
+
+and import it using
+
+```
+import penzai
+from penzai import pz
+```
+
+(`penzai.pz` is an *alias namespace*, which makes it easier to reference
+common Penzai objects.)
+
+When working in an Colab or IPython notebook, we recommend also configuring
+Penzai as the default pretty printer, and enabling some utilities for
+interactive use:
+
+```
+pz.ts.register_as_default()
+pz.ts.register_autovisualize_magic()
+pz.enable_interactive_context()
+
+# Optional: enables automatic array visualization
+pz.ts.active_autovisualizer.set_interactive(pz.ts.ArrayAutovisualizer())
+```
+
+Here's how you could initialize and visualize a simple neural network:
+
+```
+from penzai.example_models import simple_mlp
+mlp = pz.nn.initialize_parameters(
+    simple_mlp.MLP.from_config([8, 32, 32, 8]),
+    jax.random.key(42),
+)
+
+# Models and arrays are visualized automatically when you output them from a
+# Colab/IPython notebook cell:
+mlp
+```
+
+Here's how you could capture and extract the activations after the elementwise
+nonlinearities:
+
+```
+mlp_with_captured_activations = pz.de.CollectingSideOutputs.handling(
+    pz.select(mlp)
+    .at_instances_of(pz.nn.Elementwise)
+    .insert_after(pz.de.TellIntermediate())
+)
+
+output, intermediates = mlp_with_captured_activations(
+  pz.nx.ones({"features": 8})
+)
+```
+
+To learn more about how to build and manipulate neural networks with Penzai,
+we recommend starting with the ["How to Think in Penzai" tutorial][], or one
+of the other tutorials in the [Penzai documentation][].
+
+["How to Think in Penzai" tutorial]: https://penzai.readthedocs.io/en/stable/notebooks/how_to_think_in_penzai.html
+[Penzai documentation]: https://penzai.readthedocs.io
 
 
 ---

diff --git a/docs/index.rst b/docs/index.rst
@@ -11,8 +11,12 @@ Penzai
   Japanese art of bonsai.*
 
 Penzai is a JAX library for writing models as legible, functional pytree data
-structures, with tools that make it easy to visualize, edit, and analyze them
-both before and after they are trained.
+structures, along with tools for visualizing, modifying, and analyzing them.
+Penzai focuses on **making it easy to do stuff with models after they have been
+trained**, making it a great choice for research involving reverse-engineering
+or ablating model components, inspecting and probing internal activations,
+performing model surgery, debugging architectures, and more. (But if you just
+want to build and train a model, you can do that too!)
 
 It is structured as a collection of modular tools, designed together but each
 useable independently:
@@ -89,21 +93,31 @@ interactive use::
   pz.ts.register_autovisualize_magic()
   pz.enable_interactive_context()
 
+  # Optional: enables automatic array visualization
+  pz.ts.active_autovisualizer.set_interactive(pz.ts.ArrayAutovisualizer())
 
-You can then automatically visualize arrays by using the ``%%autovisualize``
-IPython cell magic. Alternatively, you can turn on array autovisualization
-by default using ::
+Here's how you could initialize and visualize a simple neural network::
 
-  pz.ts.active_autovisualizer.set_interactive(pz.ts.ArrayAutovisualizer())
+  from penzai.example_models import simple_mlp
+  mlp = pz.nn.initialize_parameters(
+      simple_mlp.MLP.from_config([8, 32, 32, 8]),
+      jax.random.key(42),
+  )
 
+  # Models and arrays are visualized automatically when you output them from a
+  # Colab/IPython notebook cell:
+  mlp
 
-We recommend starting with the
+To learn more about how to build and manipulate neural networks with Penzai,
+we recommend starting with the
 :doc:`"How to Think in Penzai" <notebooks/how_to_think_in_penzai>`
 notebook, which gives a high-level overview of how to think about and use Penzai
 models. Afterward, you coould:
 
-* Take a look at one of the example notebooks to see how you can use Penzai to visualize and modify pretrained models.
-* Or, read through the guides in the left sidebar to learn more about each of Penzai's components.
+* Take a look at one of the example notebooks to see how you can use Penzai to
+  visualize and modify pretrained models.
+* Or, read through the guides in the left sidebar to learn more about each of
+  Penzai's components.
 
 
 .. toctree::

diff --git a/notebooks/how_to_think_in_penzai.ipynb b/notebooks/how_to_think_in_penzai.ipynb
@@ -925,7 +925,7 @@
     "id": "q7n-su7n93Ur"
    },
    "source": [
-    "This pattern applies to layers that are designed for hot-swapping. For instance, the `KVCachingAttention` block defines a classmethod `.from_uncached` that converts an `Attention` block into a `KVCachingAttention`, which takes ownership of the children of that `Attention` block and then discards the original block."
+    "This pattern also applies to layers that are designed for hot-swapping. For instance, the `KVCachingAttention` block defines a classmethod `.from_uncached` that converts an `Attention` block into a `KVCachingAttention`, which takes ownership of the children of that `Attention` block and then discards the original block."
    ]
   },
   {

diff --git a/penzai/__init__.py b/penzai/__init__.py
@@ -12,6 +12,6 @@
 # See the License for the specific language governing permissions and
 # limitations under the License.
 
-"""Penzai: a JAX research toolkit for inspecting and patching neural networks."""
+"""A JAX research toolkit for building, editing, and visualizing neural networks."""
 
-__version__ = '0.0.1+dev'
+__version__ = '0.1.0'