UDApter - Efficient Domain Adaptation Using Adapters

Code for our paper at EACL'23: UDAPTER-Efficient Domain Adaptation Using Adapters. Domadapter trains adapters for Domain Adaptation in NLP. The idea is to use principles of unsupervised domain adaptation and parameter efficient fine-tuning to make domain adaptation more efficient.

Use with adapter-transformers and 🤗

The weights are now available at https://huggingface.co/domadapter

Training Requirements

Python >= 3.8
Poetry for dependency and environment management
direnv (Optional) - For automatically exporting environment variables

Environment Variables

We use environment variables to store certain paths

Pretrained Transformer Models (PT_MODEL_CACHE_DIR)
Datasets (DATASET_CACHE_DIR)
Experiments (OUTPUT_DIR)
Results (OUTPUT_DIR)

Change the following variables in the .envrc file.

export DATASET_CACHE_DIR=""
export PT_MODELS_CACHE_DIR=""
export OUTPUT_DIR=""

Run source .envrc

Installation

Creating a virtualenv

Run poetry install to install the dependencies
Use poetry shell to create a virtual environment

Commit your poetry.lock file if you install any new library.

Note: We have tested this on a linux machine. If you are using Macbook M1 then you might encounter in to some errors installing scipy, sklearn etc.

Download Datasets

Run domadapter download mnli to download the mnli dataset
Run domadapter download sa to download the amazon dataset.

Using Google Colab? !pip install pytorch-lightning==1.4.2 datasets transformers pandas click wandb numpy rich

Get results Instantaneously

You can run this on CPU and it will take a couple of minutes

Join Domain Task Adapter for MNLI

domadapter results mnli-joint-dt --source [source_domain] --target [target_domain]

For MNLI, The source and target domain can be one of Fiction, Slate, Government, Telephone, Travel

Other models and other datasets are coming soon.

Train Models

See the scripts folder to train models. For example to train the Joint-DT-:electric_plug: on the MNLI dataset run

bash train_joint_da_ta_mnli.sh

Relevant folders

.
├── commands (domadapter terminal commands)
├── datamodules (Pytorch Lightning DataModules to load the SA and MNLI Dataset)
├── divergences (Different Divergence measures)
├── models (All the models listed in the paper)
├── orchestration (Instantiates the model, dataset, trainer and runs the experiments)
├── scripts (Bash scripts to run experiments)
├── utils (Useful utilities)
└── console.py (Python `Rich` console to pretty print everything)

Citations

@misc{https://doi.org/10.48550/arxiv.2302.03194,
  doi = {10.48550/ARXIV.2302.03194},
  
  url = {https://arxiv.org/abs/2302.03194},
  
  author = {Malik, Bhavitvya and Kashyap, Abhinav Ramesh and Kan, Min-Yen and Poria, Soujanya},
  
  keywords = {Computation and Language (cs.CL), FOS: Computer and information sciences, FOS: Computer and information sciences},
  
  title = {UDApter -- Efficient Domain Adaptation Using Adapters},
  
  publisher = {arXiv},
  
  year = {2023},
  
  copyright = {Creative Commons Attribution 4.0 International
}

Name		Name	Last commit message	Last commit date
Latest commit History 123 Commits
domadapter		domadapter
.envrc		.envrc
.gitignore		.gitignore
README.md		README.md
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
udapter_pic.png		udapter_pic.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

UDApter - Efficient Domain Adaptation Using Adapters

Use with adapter-transformers and 🤗

Training Requirements

Environment Variables

Installation

Creating a virtualenv

Download Datasets

Get results Instantaneously

Train Models

Relevant folders

Citations

About

Releases

Packages

Contributors 5

Languages

abhinavkashyap/domadapter

Folders and files

Latest commit

History

Repository files navigation

UDApter - Efficient Domain Adaptation Using Adapters

Use with adapter-transformers and 🤗

Training Requirements

Environment Variables

Installation

Creating a virtualenv

Download Datasets

Get results Instantaneously

Train Models

Relevant folders

Citations

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 5

Languages

Packages