[Proposal] Documentation: Map the Act Names to the Transformer #644

JuVogt · 2024-06-21T08:40:11Z

Proposal

Create a figure that maps the act names to the transformer architecture.

Motivation

Names are just conventions. I find it hard to get the exact position within the transformer block just from the act name. I.e. the resid_pre might be before the split happens or before the merge happens. So I put it in context to the other act names and work by exclusion process or modify it to see what values will change.

Pitch

I suggest using the images from the Vasvani paper and adding labeled arrows pointing to the hook positions.

Alternatives

A list or table of (act name, description) pairs.

Checklist

I have checked that there is no similar issue in the repo (required)

bryce13950 · 2024-06-26T00:29:09Z

@JuVogt Do you have time to handle this issue?

tjbai · 2024-07-23T05:20:10Z

I could put together something this week as a first PR for this project

JuVogt · 2024-07-30T09:30:18Z

I am willing to contribute as well, but I am currently out of time, sorry for that. I can come back after I finish my thesis at the end of the year and design something, but a first sketch would definitely help. Maybe I could then add a list with the act names including some more information about i.e. the dimensions and calculations behind it if someone already contributed a sketch or vice versa.

Also, I could add some more documentation with minimal examples beside the colabs that I think would help me in the beginning.

bryce13950 added documentation Improvements or additions to documentation complexity-moderate Moderately complicated issues for people who have intermediate experience with the code labels Jun 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Proposal] Documentation: Map the Act Names to the Transformer #644

[Proposal] Documentation: Map the Act Names to the Transformer #644

JuVogt commented Jun 21, 2024

bryce13950 commented Jun 26, 2024

tjbai commented Jul 23, 2024

JuVogt commented Jul 30, 2024

[Proposal] Documentation: Map the Act Names to the Transformer #644

[Proposal] Documentation: Map the Act Names to the Transformer #644

Comments

JuVogt commented Jun 21, 2024

Proposal

Motivation

Pitch

Alternatives

Checklist

bryce13950 commented Jun 26, 2024

tjbai commented Jul 23, 2024

JuVogt commented Jul 30, 2024