Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Create new task and schema for dialogue system datasets #172

Closed
holylovenia opened this issue Dec 10, 2023 · 5 comments · Fixed by #237
Closed

Create new task and schema for dialogue system datasets #172

holylovenia opened this issue Dec 10, 2023 · 5 comments · Fixed by #237
Assignees
Labels
enhancement New feature or request help wanted Extra attention is needed pr-ready A PR that closes this issue is Ready to be reviewed

Comments

@holylovenia
Copy link
Contributor

Relates #53 and an existing dataloader (i.e., COD).

Question for thought: Since Wizard-of-Oz (WOZ) format is quite popular for task-oriented dialogue systems, should we make our schema similar to it? (cc: @SamuelCahyawijaya @sabilmakbar)

@holylovenia holylovenia added enhancement New feature or request help wanted Extra attention is needed labels Dec 10, 2023
@sabilmakbar
Copy link
Collaborator

sabilmakbar commented Dec 10, 2023

hi @holylovenia, are CoD & xPersona dataloaders included in this revamp, too?

*CoD has been mentioned

@sabilmakbar
Copy link
Collaborator

I think there are a few more dataloaders that will use this schema:

  1. Issue Create dataset loader for GlobalWoZ V2.0 #83
  2. Issue Create dataset loader for IndoSMD #54

Therefore, shall we make this a high-priority task?

@sabilmakbar
Copy link
Collaborator

btw @holylovenia, do you happen to know the dataset link (or at least its schema explanation) for this original WoZ paper?

I also read a few other relevant articles/papers worth a look:

  1. WoZ Framework (This is the Wiki Article of WoZ Framework to what the WoZ paper mentions -- from the same inventor, Kelley)
  2. MultiWoZ
  3. Evaluation using MultiWoZ

@holylovenia
Copy link
Contributor Author

Hi @sabilmakbar, sorry for the late reply. Yes, let's make it a high priority. As far as I know, the original dataset should be this.

@dehanalkautsar
Copy link
Collaborator

Hi @holylovenia @sabilmakbar @SamuelCahyawijaya . I've implemented the schema based on the link above (https://huggingface.co/datasets/woz_dialogue). The PR is in here #237 , let me know as soon as possible if there are some mistakes or not because I need this schema to implement #53 , #54 , and #83 . Thanks!

@sabilmakbar sabilmakbar added the pr-ready A PR that closes this issue is Ready to be reviewed label Dec 28, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request help wanted Extra attention is needed pr-ready A PR that closes this issue is Ready to be reviewed
Projects
None yet
Development

Successfully merging a pull request may close this issue.

3 participants