Few-Shot Slot Tagging and NER

This is the PyTorch implementation of the paper: Vector Projection Network for Few-shot Slot Tagging in Natural Language Understanding. Su Zhu, Ruisheng Cao, Lu Chen and Kai Yu. [PDF]

We achieve a new state-of-the-art of few-shot slot tagging on SNIPS and NER datasets.

Requirements

pytorch >= 1.4.0
transformers >= 3.3.1

Data

Download the SNIPS and NER dataset formatted as episodes.

❱❱❱ bash prepare_data.sh

Data statistic:

❱❱❱ python3 data_utils/data_statistic.py --data_path data/xval_ner_shot_1_out_1

Training & Validation & Evaluation

We try ten different random seeds (999, 189, 114, 929, 290, 848, 538, 874, 295, 266) and report average F1 scores in the paper .
For other data splits, please change the dataset path as "xval_snips_shot_{1,5}_out_{1,2,3,4,5,6,7}" and "xval_ner_shot_{1,5}_out_{1,2,3,4}".

ProtoNet+CDT+VP:

bash run_few_shot_slot_tagger_protoNet_with_pure_bert.sh \
    --matching_similarity_y ctx \
    --matching_similarity_type xy1 \
    --matching_similarity_function dot \
    --test_finetune false \
    --dataset_name HIT_ner_shot_5_out_1 \
    --dataset_path ./data/xval_ner_shot_5_out_1 \
    --random_seed 999 \
    --model_removed no

L-ProtoNet+CDT+VP:

bash run_few_shot_slot_tagger_protoNet_with_pure_bert.sh \
    --matching_similarity_y ctx_desc \
    --matching_similarity_type xy1 \
    --matching_similarity_function dot \
    --test_finetune false \
    --dataset_name HIT_ner_shot_5_out_1 \
    --dataset_path ./data/xval_ner_shot_5_out_1 \
    --random_seed 999 \
    --model_removed no

ProtoNet+CDT+VPB:

bash run_few_shot_slot_tagger_protoNet_with_pure_bert.sh \
    --matching_similarity_y ctx \
    --matching_similarity_type xy \
    --matching_similarity_function euclidean2 \
    --test_finetune false \
    --dataset_name HIT_ner_shot_5_out_1 \
    --dataset_path ./data/xval_ner_shot_5_out_1 \
    --random_seed 999 \
    --model_removed no

You can refer to an example of saved log file which is produced by the above script.

L-ProtoNet+CDT+VPB:

bash run_few_shot_slot_tagger_protoNet_with_pure_bert.sh \
    --matching_similarity_y ctx_desc \
    --matching_similarity_type xy \
    --matching_similarity_function euclidean2 \
    --test_finetune false \
    --dataset_name HIT_ner_shot_5_out_1 \
    --dataset_path ./data/xval_ner_shot_5_out_1 \
    --random_seed 999 \
    --model_removed no

If you want to keep fine-tuning the model with the support set of the target domain after pre-training on source domains, please set "--test_finetune true".

Citation

This code has been written using PyTorch >= 1.4.0. If you use any source codes included in this toolkit in your work, please cite the following paper. The bibtex is listed below:

@article{zhu2020vector,
  title={Vector Projection Network for Few-shot Slot Tagging in Natural Language Understanding},
  author={Zhu, Su and Cao, Ruisheng and Chen, Lu and Yu, Kai},
  journal={arXiv preprint arXiv:2009.09568},
  year={2020}
}
@InProceedings{zhu2021few,
author="Zhu, Su and Chen, Lu and Cao, Ruisheng and Chen, Zhi and Miao, Qingliang and Yu, Kai",
title="Few-Shot NLU with Vector Projection Distance and Abstract Triangular CRF",
booktitle="Natural Language Processing and Chinese Computing",
year="2021",
pages="505--516",
}

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
data		data
data_utils		data_utils
models		models
scripts		scripts
utils		utils
.gitignore		.gitignore
README.md		README.md
example_log_file.txt		example_log_file.txt
path.sh		path.sh
prepare_data.sh		prepare_data.sh
run_few_shot_slot_tagger_protoNet_with_pure_bert.sh		run_few_shot_slot_tagger_protoNet_with_pure_bert.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Few-Shot Slot Tagging and NER

Requirements

Data

Training & Validation & Evaluation

Citation

About

Releases

Packages

Languages

sz128/few_shot_slot_tagging_and_NER

Folders and files

Latest commit

History

Repository files navigation

Few-Shot Slot Tagging and NER

Requirements

Data

Training & Validation & Evaluation

Citation

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages