Create dataset loader for Vietnamese Hate and Offensive Spans Detection (ViHOS) #218

SamuelCahyawijaya · 2023-12-26T03:26:38Z

Dataloader name: vihos/vihos.py
DataCatalogue: http://seacrowd.github.io/seacrowd-catalogue/card.html?vihos

Dataset	vihos
Description	This dataset consists of human-annotated hateful and offensive spans in Vietnamese Facebook and Youtube comments. Each comment has a corresponding list of indices indicating the characters included in these hate and offensive spans. Individual words and syllables are also tagged as inside or outside spans using the Inside-Outside-Beginning (IOB) tagging representation.
Subsets	-
Languages	vie
Tasks	Hate Speech Detection
License	MIT (mit)
Homepage	https://github.com/phusroyal/ViHOS
HF URL	-
Paper URL	https://aclanthology.org/2023.eacl-main.47

The text was updated successfully, but these errors were encountered:

elyanah-aco · 2023-12-26T06:07:20Z

#self-assign

github-actions · 2024-01-10T02:06:47Z

Hi, may I know if you are still working on this issue? Please let @holylovenia @SamuelCahyawijaya @sabilmakbar know if you need any help.

elyanah-aco · 2024-01-18T14:11:06Z

@holylovenia @SamuelCahyawijaya @sabilmakbar

Was planning to implement ABUSIVE_LANGUAGE_PREDICTION task here.

Would like your thoughts on whether dataset can also support SPAN_BASED_ABSA task or not. Spans are BIO-tagged, but all spans would be labelled "offensive".

github-actions · 2024-02-04T02:02:23Z

Hi @, may I know if you are still working on this issue? Please let @holylovenia @SamuelCahyawijaya @sabilmakbar know if you need any help.

holylovenia · 2024-02-25T08:49:24Z

Spans are BIO-tagged

Hi @elyanah-aco, sorry I missed your question. Doesn't the sequence labeling version of the data use B- and I- for offensive words and O for others? This is my assumption based on a quick look of the data.

SamuelCahyawijaya added this to SEACrowd Data Hub Dec 26, 2023

SamuelCahyawijaya converted this from a draft issue Dec 26, 2023

github-actions bot assigned elyanah-aco Dec 26, 2023

github-actions bot added the staled-issue label Jan 10, 2024

github-actions bot removed the staled-issue label Jan 19, 2024

github-actions bot added the staled-issue label Feb 4, 2024

github-actions bot removed the staled-issue label Feb 26, 2024

github-actions bot added the staled-issue label Mar 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Create dataset loader for Vietnamese Hate and Offensive Spans Detection (ViHOS) #218

Create dataset loader for Vietnamese Hate and Offensive Spans Detection (ViHOS) #218

SamuelCahyawijaya commented Dec 26, 2023

elyanah-aco commented Dec 26, 2023

github-actions bot commented Jan 10, 2024

elyanah-aco commented Jan 18, 2024

github-actions bot commented Feb 4, 2024

holylovenia commented Feb 25, 2024

Create dataset loader for Vietnamese Hate and Offensive Spans Detection (ViHOS) #218

Create dataset loader for Vietnamese Hate and Offensive Spans Detection (ViHOS) #218

Comments

SamuelCahyawijaya commented Dec 26, 2023

elyanah-aco commented Dec 26, 2023

github-actions bot commented Jan 10, 2024

elyanah-aco commented Jan 18, 2024

github-actions bot commented Feb 4, 2024

holylovenia commented Feb 25, 2024