Kocyigit, Muhammed Yusuf, Anietie Andy, and Derry Wijaya. “A Novel Method for Analysing Racial Bias: Collection of Person Level References.” arXiv preprint arXiv:2310.15847 (2023).
Long-term exposure to biased content in literature or media can significantly influence people’s perceptions of reality,
leading to the development of implicit biases that are difficult
to detect and address (Gerbner 1998). In this study, we propose a novel method to analyze the differences in representation between two groups and use it examine the representation of African Americans and White Americans in books
between 1850 to 2000 with the Google Books dataset (Goldberg and Orwant 2013). By developing better tools to understand differences in representation, we aim to contribute to
the ongoing efforts to recognize and mitigate biases. To improve upon the more common phrase-based (men, women,
white, black, etc) methods to differentiate context (Tripodi
et al. 2019; Lucy, Tadimeti, and Bamman 2022), we propose
collecting a comprehensive list of historically significant figures and using their names to select relevant context. This
novel approach offers a more accurate and nuanced method
for detecting implicit biases through reducing the risk of selection bias. We create group representations for each decade
and analyze them in an aligned semantic space (Hamilton,
Leskovec, and Jurafsky 2016). We further support our results
by assessing the time-adjusted toxicity (Bassignana, Basile,
and Patti 2018) in the context for each group and identifying
the semantic axes (Lucy, Tadimeti, and Bamman 2022) that
exhibit the most significant differences between the groups
across decades. We support our method by showing that our
proposed method can capture known socio-political changes
accurately and our findings indicate that while the relative
number of African American names mentioned in books have
increased over time, the context surrounding them remains
more toxic than white Americans.
@article{kocyigit2023novel, title={A Novel Method for Analysing Racial Bias: Collection of Person Level References}, author={Kocyigit, Muhammed Yusuf and Andy, Anietie and Wijaya, Derry}, journal={arXiv preprint arXiv:2310.15847}, year={2023} }
Kuwanto, Garry, Eno-Abasi E. Urua, Priscilla Amondi Amuok, Shamsuddeen Hassan Muhammad, Anuoluwapo Aremu, Verrah Otiende, Loice Emma Nanyanga et al. “Mitigating Translationese in Low-resource Languages: The Storyboard Approach.” In Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), pp. 11349-11360. 2024.
Low-resource languages often face challenges in acquiring high-quality language data due to the reliance on
translation-based methods, which can introduce the translationese effect. This phenomenon results in translated
sentences that lack fluency and naturalness in the target language. In this paper, we propose a novel approach
for data collection by leveraging storyboards to elicit more fluent and natural sentences. Our method involves
presenting native speakers with visual stimuli in the form of storyboards and collecting their descriptions without
direct exposure to the source text. We conducted a comprehensive evaluation comparing our storyboard-based
approach with traditional text translation-based methods in terms of accuracy and fluency. Human annotators
and quantitative metrics were used to assess translation quality. The results indicate a preference for text translation in terms of accuracy, while our method demonstrates worse accuracy but better fluency in the language focused.
@inproceedings{kuwanto2024mitigating, title={Mitigating Translationese in Low-resource Languages: The Storyboard Approach}, author={Kuwanto, Garry and Urua, Eno-Abasi E and Amuok, Priscilla Amondi and Muhammad, Shamsuddeen Hassan and Aremu, Anuoluwapo and Otiende, Verrah and Nanyanga, Loice Emma and Nyoike, Teresiah W and Akpan, Aniefon D and Ab Udouboh, Nsima and others}, booktitle={Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)}, pages={11349--11360}, year={2024} }
A Andy, G Sherman, SC Guntuku, Plos one 17 (9), e0273636, 2022
J Tong, M Kearney, A Andy, A Naik, M Sharma, D Boateng, E Akpek, …, 2022 Annual Research Meeting, 2022
A Andy, R Kriz, SC Guntuku, DT Wijaya, C Callison-Burch, Proceedings of the Thirteenth Language Resources and Evaluation Conference …, 2022
E Nwafor, A Andy, Proceedings of the Thirteenth Language Resources and Evaluation Conference …, 2022
A Andy, S Liu, D Ippolito, R Kriz, C Callison-Burch, D Wijaya, arXiv preprint arXiv:2203.08931, 2022
JKC Tong, E Akpek, A Naik, M Sharma, D Boateng, A Andy, RM Merchant, …, JAMA network open 5 (2), e220715-e220715, 2022
K El-Jack, K Henderson, AU Andy, L Southwick, International Journal of Medical Students 10 (4), 370-374, 2022
A Andy, PloS one 16 (9), e0257791, 2021
A Andy, U Andy, JMIR cancer 7 (3), e29555, 2021
J Tong, AU Andy, RM Merchant, RR Kelz, JAMA Network Open 4 (9), e2126118-e2126118, 2021
A Andy, JMIR Formative Research 5 (7), e28738, 2021
A Andy, B Chu, R Fathy, B Bennett, D Stokes, SC Guntuku, Proceedings of the 12th International Workshop on Health Text Mining and …, 2021
AU Andy, SC Guntuku, S Adusumalli, DA Asch, PW Groeneveld, …, JMIR cardio 5 (1), e24473, 2021
A Andy, International Workshop on Health Intelligence, 65-74, 2021
DT Wijaya, A Andy, C Callison-Burch, Association for Computational Linguistics, 2020
A Andy, S Guntuku, arXiv preprint arXiv:2011.05103, 2020
A Andy, B Chu, R Fathy, B Bennet, D Stokes, S Guntuku
K Ryskina, A Andy, K Manges, K Foley, R Werner, R Merchant, 2020 Virtual Annual Research Meeting, 2020
DC Stokes, A Andy, SC Guntuku, LH Ungar, RM Merchant, Journal of general internal medicine 35 (7), 2244-2247, 2020
KL Ryskina, AU Andy, KA Manges, KA Foley, RM Werner, RM Merchant, JAMA network open 3 (5), e204682-e204682, 2020
A Andy, D Wijaya, D Ippolito, C Callison-Burch, Proceedings of the Second Workshop on Storytelling, 112–116, 2019
A Andy, M Dredze, M Rwebangira, C Callison-Burch, Proceedings of the 3rd Workshop on Noisy User-generated Text, 40-44, 2017
A Andy, M Rwebangira, S Sekine, Proceedings of the Open Knowledge Base and Question Answering Workshop …, 2016
A Andy, S Sekine, M Rwebangira, M Dredze, Proceedings of the 2nd Workshop on Noisy User-generated Text (WNUT), 51-60, 2016
J Tong, WC MSHPc, Health Aff 35 (4), 697-705, 2016
A Andy, M Robert, M Chouikha, International Journal of Computer Applications 108 (18), 2014
AU Andy, AT Abayomi, CM Kennefick, International Journal of Applied Electromagnetics and Mechanics 32 (3), 133-143, 2010
C Liu, J Qu, Y Song, AU Andy, 2009 IEEE International Conference on Bioinformatics and Biomedicine …, 2009