国产bbaaaaa片,成年美女黄网站色视频免费,成年黄大片,а天堂中文最新一区二区三区,成人精品视频一区二区三区尤物

首頁> 外文會議>Workshop on Natural Language Processing and Computational Social Sciences >Is Wikipedia succeeding in reducing gender bias? Assessing changes in gender bias in Wikipedia using word embeddings
【24h】

Is Wikipedia succeeding in reducing gender bias? Assessing changes in gender bias in Wikipedia using word embeddings

機(jī)譯:維基百科是否成功地減少了性別偏見?使用Word Embeddings評估Wikipedia中性別偏差的變化

獲取原文

摘要

Large text corpora used for creating word em-beddings (vectors which represent word meanings) often contain stereotypical gender biases. As a result, such unwanted biases will typically also be present in word embeddings derived from such corpora and downstream applications in the field of natural language processing (NLP). To minimize the effect of gender bias in these settings, more insight is needed when it comes to where and how biases manifest themselves in the text corpora employed. This paper contributes by showing how gender bias in word embeddings from Wikipedia has developed over time. Quantifying the gender bias over time shows that art related words have become more female biased. Family and science words have stereotypical biases towards respectively female and male words. These biases seem to have decreased since 2006, but these changes are not more extreme than those seen in random sets of words. Career related words are more strongly associated with male than with female, this difference has only become smaller in recently written articles. These developments provide additional understanding of what can be done to make Wikipedia more gender neutral and how important time of writing can be when considering biases in word embeddings trained from Wikipedia or from other text corpora.
機(jī)譯:用于創(chuàng)建文字EM-BEDDINGS(代表詞含義)的單詞的大型文本語料庫通常包含陳規(guī)定型的性別偏見。結(jié)果,這種不需要的偏差通常也存在于從自然語言處理領(lǐng)域(NLP)中的這種語料庫和下游應(yīng)用程序?qū)С龅那度朐~中。為了最大限度地減少這些環(huán)境中的性別偏差的影響,涉及到哪里以及偏見在雇用的文本語料庫中表現(xiàn)出來的地點(diǎn)以及如何在地點(diǎn)和偏見方面需要更多的洞察力。本文通過展示W(wǎng)ikipedia的Word Embeddings中的性別偏差如何隨著時(shí)間的推移而產(chǎn)生的貢獻(xiàn)。隨著時(shí)間的推移量化性別偏見表明,藝術(shù)相關(guān)詞組變得更加偏見。家庭和科學(xué)詞語分別對女性和男性詞語具有陳規(guī)定型偏見。自2006年以來,這些偏差似乎已經(jīng)減少,但這些變化并不比在隨機(jī)的單詞組中看到的變化。職業(yè)相關(guān)詞匯與男性更強(qiáng)烈地關(guān)聯(lián),而不是女性,這種差異在最近書面的文章中只變得越來越小。這些事態(tài)發(fā)展提供了對使維基百科更為性別中立的額外的額外理解以及在從維基百科或其他文本語料庫中考慮嵌入的單詞嵌入中的偏見時(shí),寫作的重要時(shí)間是多么重要。

著錄項(xiàng)

相似文獻(xiàn)

  • 外文文獻(xiàn)
  • 中文文獻(xiàn)
  • 專利
獲取原文

客服郵箱:kefu@zhangqiaokeyan.com

京公網(wǎng)安備:11010802029741號 ICP備案號:京ICP備15016152號-6 六維聯(lián)合信息科技 (北京) 有限公司?版權(quán)所有
  • 客服微信

  • 服務(wù)號