Link to paper
The full paper is available here.
You can also find the paper on PapersWithCode here.
Abstract
- Language models are not treating diverse demographic groups fairly.
- Most research on fairness has been focused on English.
- This paper looks at fairness in multilingual and non-English contexts.
- Current research is limited and cannot be scaled across languages and cultures.
Paper Content
Introduction
- Language models are susceptible to spurious correlations and encoding biases
- Representational harms refer to groups being misrepresented or underrepresented
- Allocational harms refer to inequitable distribution of resources and opportunities
- Bias can occur in multiple steps of the pipeline
- Most work on fairness in NLP is Anglo-centric
- NLP systems can reinforce and reproduce social and racial hierarchies
- Insufficient documentation of harms from unfair models
- Interplay between privacy, efficiency, and fairness in NLP is understudied
Metrics for measurement
- Monolingual systems have biases and challenges
- Bias in NLP models can be quantified with intrinsic and extrinsic measures
- WEAT and SEAT are commonly used metrics
- StereoSet and CrowS-Pair measure stereotypical proclivity
- Blodgett et al. (2021) points out flaws in data quality
- Other intrinsic measures have been proposed
- Recent studies compare metrics and provide insights into their potential drawbacks
Intrinsic vs extrinsic evaluation
- Intrinsic measures indicate the existence of representational bias in systems.
- Little work has been done on addressing bias in extrinsic evaluation.
- Extrinsic evaluation measures are important for bias mitigation objectives.
- Evaluating fairness on downstream task’s outputs allows us to gauge how a particular demographic may be affected by biases.
- Monolingual studies have unanswered questions and inconclusive results.
- Template-based data may be unreliable and natural sentence prompts should be used.
Fairness from the lens of multiple social dimensions
- Existing literature focuses on gender bias, not other dimensions like race and religion.
- Evaluation metrics should be able to evaluate harms over intersectional identities.
- Previous research has emphasized the importance of fairness evaluation and mitigation over intersectional identities.
- Most fairness measures do not account for intersectionality of identities.
- Need more work in benchmarking debiasing techniques to assess their effectiveness.
Linguistic aspects
- Linguistic variations between languages pose additional problems in multilingual NLP
- Gender has multiple definitions in linguistic terms
- Grammatical gender can affect bias in multilingual and monolingual spaces
- Referential gender deals with terms that address a person’s gender
- Lexical gender deals with terms that describe gender
- Bio-social gender involves a mixture of phenotypic traits, gender expression, and identity
- Variations in pronoun complexity and linguistic forms can lead to pronoun changes based on social dynamic
Grammatically gendered languages
- Linguistics recognizes multiple forms of gender
- Grammatically gendered languages have 2-20 forms of gender divisions
- English is not a grammatically gendered language
- Zhou et al. (2019) examines bias from the view of grammatically gendered languages
- Gonen et al. (2019) show that grammatical gender affects word representations
- Cultural context must be incorporated when curating data for languages
- Hovy and Yang (2021) claim NLP literature oversimplifies information content
- España-Bonet and Barrón-Cedeño (2022) advocate for culturally-sensitive datasets
- Kaneko et al. (2022) proposes a way to generate parallel corpora for other languages
Multilingual models
- Multilingual spaces allow words to be mapped to their equivalents in other languages
- Training multilingual language models can improve cross-lingual performance on low-resource languages
- Performance across languages begins to dip as the number of languages increases
- Few studies explore the impact of multilingual training on biases
An outline of fairness evaluation in the context of multilinguality
- Several datasets have been created for multilingual evaluation
- Zhao et al. (2020) quantified biases in multilingual spaces
- Factors that influence bias include language’s linguistic properties, target language used for alignment, and transfer learning
- Huang et al. (2020) released first multilingual Twitter corpus for hate speech detection
- Liang et al. (2020) proposed a novel debiasing method
- Câmara et al. (2022) studied gender and ethnicity in multilingual language models
- Kaneko et al. (2022) proposed a method to use parallel corpora to evaluate bias
- Wolfe and Caliskan (2022) and Wolfe et al. (2022) studied fairness in multimodal contexts
- Wang et al. (2022) studied fairness from a multilingual view in multimodal representations
- Talat et al. (2022) expressed criticism over primary data source for multilingual large language models being English
An outline of fairness mitigation in the context of multilinguality
- Multilingual spaces are composed of different languages with different properties
- Balancing corpus and transferring to a gendered language’s embedding space can reduce bias
- FEDA domain adaption technique can be used to mitigate bias in multilingual text classification
- Adversarial training, token masking, and instance weighting can reduce impact of data instances that lead to bias
- Zero-shot debiasing may be beneficial, but further study is needed to confirm
Problems in multilingual evaluation and mitigation
- Lack of datasets and literature for evaluation across tasks
- Monolingual research not replicated in multilingual setting
- Datasets neglect less-represented demographics
- Variations in annotator information can result in bias
- No work on evaluating bias in language contact settings
- Models may not learn useful representations of marginalized identities
Culture
- Language and culture are intrinsically linked.
- NLP research has historically focused on information content, not contextual information.
- Hovy and Yang (2021) proposed a taxonomy of 7 social factors to incorporate into models.
- Variations in language can be controlled by the speaker and receiver.
- Systems can discriminate against marginalized communities.
- Cultural taboos and stereotypes can be highly localized.
- Gender is often treated as a binary variable, but this does not reflect real-world settings.
Moving towards inclusive systems in all languages
- Multilingualism presents challenges in terms of fairness
- Current solutions may not be viable
- Language and social aspects are ever-evolving
- Social interactions from diverse linguistic backgrounds need to be studied
- Models can exhibit bias based on linguistic variations
- LLMs are often biased towards certain demographic strata
- Interdisciplinary and multicultural teams are needed to identify and mitigate bias
- Better evaluation benchmarks and data collection are needed
- Zero-shot techniques should be used with care in fairness-critical applications
- Multilingual models need to incorporate shared value systems
Limitations
- Examines fairness literature in languages other than English
- Identifies bias measurement and mitigation strategies
- Notes that authors have a heterogeneous background which may lead to missing diverse perspectives on linguistic and cultural aspects of bias