2018, EMNLP, Reducing Gender Bias in Abusive Language Detection #12

Rounique · 2021-10-18T04:07:56Z

No description provided.

Rounique · 2021-10-18T04:11:30Z

2018-Reducing Gender Bias in Abusive Language Detection.pdf

hosseinfani · 2021-10-18T05:50:04Z

@Rounique
your summary?

Rounique · 2021-10-18T17:17:55Z

As I mentioned, I didn't do summaries up to now. I'll do them after I finish the assignments I have for this week.

Rounique · 2021-10-29T21:37:42Z

#11

hosseinfani · 2021-11-09T16:24:13Z

@Rounique
Any update?

Rounique · 2021-11-26T21:22:23Z

Title: Reducing Gender Bias in Abusive Language Detection
Venue: EMNLP
Year: 2018

Introduction
As the use of social media and online platforms is increasing, people tend to share their ideas and words more and more. Here, automatic detection of abusive language plays an important role since abusive language can lead to cyber-bullying, personal trauma, hate crime, and discrimination. Therefore, using machine learning and NLP to automatically detect abusive language is useful for many websites or social media services.

In this paper, gender bias has been measured on models that are trained with abusive language datasets, and also some methods have been introduced for mitigating these biases. The bias measuring is done with a generated unbiased test set and the mitigating methods are: (1) debiased word embedding, (2) gender swap data augmentation, (3) fine-tuning with a larger corpus

Dataset:
Sexist Tweets, Abusive Tweets.

Measuring Gender Biases
It is not possible to measure gender bias on a dataset on which the model has been trained since it will follow the same biases. Therefore, it is necessary to generate an unbiased test set.
the test set generated in this work includes 1,152 samples (576 pairs) by filling the templates with common gender identity pairs (ex. male/female, man/woman, etc.). Some templates have been generated that contain both neutral and offensive nouns and adjectives inside the vocabulary to retain balance in neutral and abusive samples.

Mitigating Bias

Word Embeddings (DE)
This is an algorithm to correct word embeddings by removing gender-stereotypical information.

Gender Swap (GS)
What is basically done here is the identify male entities and swap them with equivalent female entities and vice-versa. This simple method removes the correlation between gender and classification decisions and has proven to be effective for correcting gender biases.

Bias fine-tuning (FT)
A method to use transfer learning from a less biased corpus to reduce the bias. Initially, a model is trained with a less-biased and larger source corpus and fine-tuned with a target corpus.

Metric used:
AUC

Conclusion
It is found that these proposed methods can reduce gender biases up to 90-98%, improving the robustness of the models.

Future Work
Increasing classification performance and reducing the bias at the same time.

Codes
https://github.com/conversationai/unintended-ml-bias-analysis

Rounique · 2021-11-26T21:29:38Z

The summary is added.

hosseinfani · 2021-11-26T21:35:17Z

@Rounique
please explore their codeline, there are more good info :
https://perspectiveapi.com/how-it-works/
https://conversationai.github.io/

Rounique self-assigned this Oct 18, 2021

Rounique added the literature-review Summary of the paper related to the work label Oct 18, 2021

hosseinfani transferred this issue from fani-lab/OpeNTF Mar 12, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

2018, EMNLP, Reducing Gender Bias in Abusive Language Detection #12

2018, EMNLP, Reducing Gender Bias in Abusive Language Detection #12

Rounique commented Oct 18, 2021

Rounique commented Oct 18, 2021

hosseinfani commented Oct 18, 2021

Rounique commented Oct 18, 2021

Rounique commented Oct 29, 2021

hosseinfani commented Nov 9, 2021

Rounique commented Nov 26, 2021 •

edited

Loading

Rounique commented Nov 26, 2021

hosseinfani commented Nov 26, 2021

2018, EMNLP, Reducing Gender Bias in Abusive Language Detection #12

2018, EMNLP, Reducing Gender Bias in Abusive Language Detection #12

Comments

Rounique commented Oct 18, 2021

Rounique commented Oct 18, 2021

hosseinfani commented Oct 18, 2021

Rounique commented Oct 18, 2021

Rounique commented Oct 29, 2021

hosseinfani commented Nov 9, 2021

Rounique commented Nov 26, 2021 • edited Loading

Rounique commented Nov 26, 2021

hosseinfani commented Nov 26, 2021

Rounique commented Nov 26, 2021 •

edited

Loading