CultureBank: An Online Community-Driven Knowledge Base Towards Culturally Aware Language Technologies
"Globally, people express pride, celebrate, and respect cultural diversity, while acknowledging and working towards reducing cultural bias"
------ CultureBank
In this project, we explore the cultural awarenesss in language models. To this end, we introduce CultureBank, a knowledge base built upon users' self-narratives with 12k cultural descriptors sourced from TikTok and 11k from Reddit. With CultureBank, we evaluate different LLMs' cultural awareness, and identify areas for improvement. We also fine-tune a language model on CultureBank: experiments show that it achieves better performances on two downstream cultural tasks in a zero-shot setting. Finally, we offer recommendations based on our findings for future culturally aware language technologies.