unify how to freeze some parameters for coca pre-training #526

zhangtemplar · 2024-03-13T20:37:59Z

Summary:

we already have support of freezing vision encoder; as experiment goes, we want to experiment to freeze other part of coca, e.g., text decoder. This diff provides unified way of freezing/unfreezing modules, the same way as we are doing for linear probe or finetune.
add configuration of using MLP instead of attention pooler for vision adapter;
for output projection in text decoder, change bias=False to True. In many other places, e.g., LP head, ember's output module and LLAVA, they are using bias=True (which is default value in Linear).

Differential Revision:
D54559503

Privacy Context Container: 303860477774201

facebook-github-bot · 2024-03-13T20:38:07Z

This pull request was exported from Phabricator. Differential Revision: D54559503

…search#526) Summary: 1. we already have support of freezing vision encoder; as experiment goes, we want to experiment to freeze other part of coca, e.g., text decoder. This diff provides unified way of freezing/unfreezing modules, the same way as we are doing for linear probe or finetune. 2. add configuration of using MLP instead of attention pooler for vision adapter; 3. for output projection in text decoder, change bias=False to True. In many other places, e.g., LP head, ember's output module and LLAVA, they are using bias=True (which is default value in Linear). Differential Revision: D54559503 Privacy Context Container: 303860477774201

facebook-github-bot · 2024-03-14T19:13:10Z

This pull request was exported from Phabricator. Differential Revision: D54559503

…search#526) Summary: 1. we already have support of freezing vision encoder; as experiment goes, we want to experiment to freeze other part of coca, e.g., text decoder. This diff provides unified way of freezing/unfreezing modules, the same way as we are doing for linear probe or finetune. 2. add configuration of using MLP instead of attention pooler for vision adapter; 3. for output projection in text decoder, change bias=False to True. In many other places, e.g., LP head, ember's output module and LLAVA, they are using bias=True (which is default value in Linear). Differential Revision: D54559503 Privacy Context Container: 303860477774201

facebook-github-bot · 2024-03-14T19:15:13Z

This pull request was exported from Phabricator. Differential Revision: D54559503

codecov-commenter · 2024-03-14T19:19:41Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 75.62%. Comparing base (dbeed97) to head (88933e9).

Additional details and impacted files

@@           Coverage Diff           @@
##             main     #526   +/-   ##
=======================================
  Coverage   75.61%   75.62%           
=======================================
  Files         234      234           
  Lines       16122    16126    +4     
=======================================
+ Hits        12191    12195    +4     
  Misses       3931     3931

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

…search#526) Summary: 1. we already have support of freezing vision encoder; as experiment goes, we want to experiment to freeze other part of coca, e.g., text decoder. This diff provides unified way of freezing/unfreezing modules, the same way as we are doing for linear probe or finetune. 2. add configuration of using MLP instead of attention pooler for vision adapter; 3. for output projection in text decoder, change bias=False to True. In many other places, e.g., LP head, ember's output module and LLAVA, they are using bias=True (which is default value in Linear). Differential Revision: D54559503 Privacy Context Container: 303860477774201

facebook-github-bot · 2024-03-20T20:18:35Z

This pull request was exported from Phabricator. Differential Revision: D54559503

…search#526) Summary: 1. we already have support of freezing vision encoder; as experiment goes, we want to experiment to freeze other part of coca, e.g., text decoder. This diff provides unified way of freezing/unfreezing modules, the same way as we are doing for linear probe or finetune. 2. add configuration of using MLP instead of attention pooler for vision adapter; 3. for output projection in text decoder, change bias=False to True. In many other places, e.g., LP head, ember's output module and LLAVA, they are using bias=True (which is default value in Linear). Differential Revision: D54559503 Privacy Context Container: 303860477774201

facebook-github-bot · 2024-03-21T03:48:01Z

This pull request was exported from Phabricator. Differential Revision: D54559503

…search#526) Summary: 1. we already have support of freezing vision encoder; as experiment goes, we want to experiment to freeze other part of coca, e.g., text decoder. This diff provides unified way of freezing/unfreezing modules, the same way as we are doing for linear probe or finetune. 2. add configuration of using MLP instead of attention pooler for vision adapter; 3. for output projection in text decoder, change bias=False to True. In many other places, e.g., LP head, ember's output module and LLAVA, they are using bias=True (which is default value in Linear). Differential Revision: D54559503 Privacy Context Container: 303860477774201

facebook-github-bot · 2024-03-29T18:19:22Z

This pull request was exported from Phabricator. Differential Revision: D54559503

facebook-github-bot · 2024-04-08T23:10:03Z

This pull request was exported from Phabricator. Differential Revision: D54559503

Summary: 1. for output projection in text decoder, change bias=False to True. In many other places, e.g., LP head, ember's output module and LLAVA, they are using bias=True (which is default value in Linear). 2. add configuration of using MLP instead of attention pooler for vision adapter; Differential Revision: D55897450 Privacy Context Container: 303860477774201

Summary: Pull Request resolved: #527 Pull Request resolved: #526 1. for output projection in text decoder, change bias=False to True. In many other places, e.g., LP head, ember's output module and LLAVA, they are using bias=True (which is default value in Linear). 2. add configuration of using MLP instead of attention pooler for vision adapter; Reviewed By: Bellaktris Differential Revision: D55897450 Privacy Context Container: 303860477774201 fbshipit-source-id: 8e012b0c3d37566364f216dbfa8aec389142afe1

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Mar 13, 2024

facebook-github-bot added the fb-exported label Mar 13, 2024

zhangtemplar force-pushed the export-D54559503 branch from f4a1103 to 3953609 Compare March 14, 2024 19:13

zhangtemplar force-pushed the export-D54559503 branch from 3953609 to 44b179b Compare March 14, 2024 19:15

zhangtemplar force-pushed the export-D54559503 branch from 44b179b to da89229 Compare March 20, 2024 20:18

zhangtemplar force-pushed the export-D54559503 branch from da89229 to 88933e9 Compare March 21, 2024 03:47

zhangtemplar force-pushed the export-D54559503 branch from 88933e9 to abc1037 Compare March 29, 2024 18:19

zhangtemplar closed this Apr 8, 2024

zhangtemplar force-pushed the export-D54559503 branch from abc1037 to dbeed97 Compare April 8, 2024 23:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

unify how to freeze some parameters for coca pre-training #526

unify how to freeze some parameters for coca pre-training #526

zhangtemplar commented Mar 13, 2024

facebook-github-bot commented Mar 13, 2024

facebook-github-bot commented Mar 14, 2024

facebook-github-bot commented Mar 14, 2024

codecov-commenter commented Mar 14, 2024 •

edited

Loading

facebook-github-bot commented Mar 20, 2024

facebook-github-bot commented Mar 21, 2024

facebook-github-bot commented Mar 29, 2024

facebook-github-bot commented Apr 8, 2024

unify how to freeze some parameters for coca pre-training #526

unify how to freeze some parameters for coca pre-training #526

Conversation

zhangtemplar commented Mar 13, 2024

facebook-github-bot commented Mar 13, 2024

facebook-github-bot commented Mar 14, 2024

facebook-github-bot commented Mar 14, 2024

codecov-commenter commented Mar 14, 2024 • edited Loading

Codecov Report

facebook-github-bot commented Mar 20, 2024

facebook-github-bot commented Mar 21, 2024

facebook-github-bot commented Mar 29, 2024

facebook-github-bot commented Apr 8, 2024

codecov-commenter commented Mar 14, 2024 •

edited

Loading