Experiment 2: Biosbias
Basic information
- Task: Predicting the occupation of a given bio paragraph
- Dataset: Biosbias
- Classes: Surgeon or Nurse
- Train/Dev/Test examples: 3832 / 1277 / 1278
- Problem: Due to the gender imbalance in each occupation, a classifier usually exploits gender information when making predictions. As a result, bios of female surgeons and male nurses are often misclassified.
- For more details, please see section 6 in the paper.
Word Clouds & Annotations
Model 1: Biosbias2_CNN_20200510171008
Feature 0 | Feature 1 | Feature 2 | Feature 3 | Feature 4 | Feature 5 | Feature 6 | Feature 7 | Feature 8 | Feature 9 | Feature 10 | Feature 11 | Feature 12 | Feature 13 | Feature 14 | Feature 15 | Feature 16 | Feature 17 | Feature 18 | Feature 19 | Feature 20 | Feature 21 | Feature 22 | Feature 23 | Feature 24 | Feature 25 | Feature 26 | Feature 27 | Feature 28 | Feature 29 |
Model weights: - surgeon = 0.327 - nurse = 0.115 | Model weights: - surgeon = 0.244 - nurse = 0.497 | Model weights: - surgeon = -0.290 - nurse = 0.226 | Model weights: - surgeon = -0.023 - nurse = 0.052 | Model weights: - surgeon = -0.355 - nurse = -0.482 | Model weights: - surgeon = -0.387 - nurse = 0.168 | Model weights: - surgeon = 0.224 - nurse = 0.446 | Model weights: - surgeon = -0.400 - nurse = 0.076 | Model weights: - surgeon = -0.341 - nurse = 0.038 | Model weights: - surgeon = -0.212 - nurse = 0.167 | Model weights: - surgeon = -0.332 - nurse = -0.075 | Model weights: - surgeon = 0.481 - nurse = 0.254 | Model weights: - surgeon = -0.286 - nurse = 0.199 | Model weights: - surgeon = 0.149 - nurse = -0.137 | Model weights: - surgeon = 0.368 - nurse = -0.150 | Model weights: - surgeon = 0.086 - nurse = 0.216 | Model weights: - surgeon = 0.219 - nurse = -0.273 | Model weights: - surgeon = -0.329 - nurse = 0.024 | Model weights: - surgeon = 0.233 - nurse = -0.195 | Model weights: - surgeon = -0.346 - nurse = 0.155 | Model weights: - surgeon = -0.191 - nurse = -0.042 | Model weights: - surgeon = -0.272 - nurse = 0.232 | Model weights: - surgeon = 0.454 - nurse = 0.238 | Model weights: - surgeon = 0.451 - nurse = -0.029 | Model weights: - surgeon = 0.094 - nurse = 0.374 | Model weights: - surgeon = -0.341 - nurse = -0.057 | Model weights: - surgeon = 0.124 - nurse = 0.449 | Model weights: - surgeon = 0.197 - nurse = -0.204 | Model weights: - surgeon = -0.360 - nurse = 0.414 | Model weights: - surgeon = 0.134 - nurse = -0.155 |
Human answers: - surgeon = 5 - nurse = 0 - It could be either = 5 | Human answers: - surgeon = 0 - nurse = 10 - It could be either = 0 | Human answers: - surgeon = 0 - nurse = 1 - It could be either = 9 | Human answers: - surgeon = 0 - nurse = 10 - It could be either = 0 | Human answers: - surgeon = 0 - nurse = 1 - It could be either = 9 | Human answers: - surgeon = 0 - nurse = 4 - It could be either = 6 | Human answers: - surgeon = 0 - nurse = 3 - It could be either = 7 | Human answers: - surgeon = 0 - nurse = 4 - It could be either = 6 | Human answers: - surgeon = 0 - nurse = 1 - It could be either = 9 | Human answers: - surgeon = 0 - nurse = 10 - It could be either = 0 | Human answers: - surgeon = 0 - nurse = 10 - It could be either = 0 | Human answers: - surgeon = 10 - nurse = 0 - It could be either = 0 | Human answers: - surgeon = 0 - nurse = 10 - It could be either = 0 | Human answers: - surgeon = 8 - nurse = 1 - It could be either = 1 | Human answers: - surgeon = 0 - nurse = 1 - It could be either = 9 | Human answers: - surgeon = 0 - nurse = 9 - It could be either = 1 | Human answers: - surgeon = 10 - nurse = 0 - It could be either = 0 | Human answers: - surgeon = 1 - nurse = 3 - It could be either = 6 | Human answers: - surgeon = 10 - nurse = 0 - It could be either = 0 | Human answers: - surgeon = 0 - nurse = 0 - It could be either = 10 | Human answers: - surgeon = 1 - nurse = 4 - It could be either = 5 | Human answers: - surgeon = 1 - nurse = 1 - It could be either = 8 | Human answers: - surgeon = 10 - nurse = 0 - It could be either = 0 | Human answers: - surgeon = 9 - nurse = 1 - It could be either = 0 | Human answers: - surgeon = 0 - nurse = 4 - It could be either = 6 | Human answers: - surgeon = 0 - nurse = 7 - It could be either = 3 | Human answers: - surgeon = 1 - nurse = 1 - It could be either = 8 | Human answers: - surgeon = 10 - nurse = 0 - It could be either = 0 | Human answers: - surgeon = 0 - nurse = 10 - It could be either = 0 | Human answers: - surgeon = 7 - nurse = 0 - It could be either = 3 |
Decision: - MTurk: Disabled - One: Enabled | Decision: - MTurk: Enabled - One: Enabled | Decision: - MTurk: Disabled - One: Disabled | Decision: - MTurk: Enabled - One: Enabled | Decision: - MTurk: Disabled - One: Disabled | Decision: - MTurk: Disabled - One: Disabled | Decision: - MTurk: Disabled - One: Enabled | Decision: - MTurk: Disabled - One: Enabled | Decision: - MTurk: Disabled - One: Disabled | Decision: - MTurk: Enabled - One: Enabled | Decision: - MTurk: Enabled - One: Enabled | Decision: - MTurk: Enabled - One: Disabled | Decision: - MTurk: Enabled - One: Enabled | Decision: - MTurk: Enabled - One: Enabled | Decision: - MTurk: Disabled - One: Disabled | Decision: - MTurk: Enabled - One: Enabled | Decision: - MTurk: Enabled - One: Disabled | Decision: - MTurk: Disabled - One: Disabled | Decision: - MTurk: Enabled - One: Enabled | Decision: - MTurk: Disabled - One: Disabled | Decision: - MTurk: Disabled - One: Disabled | Decision: - MTurk: Disabled - One: Disabled | Decision: - MTurk: Enabled - One: Enabled | Decision: - MTurk: Enabled - One: Enabled | Decision: - MTurk: Disabled - One: Enabled | Decision: - MTurk: Enabled - One: Enabled | Decision: - MTurk: Disabled - One: Disabled | Decision: - MTurk: Enabled - One: Enabled | Decision: - MTurk: Enabled - One: Disabled | Decision: - MTurk: Enabled - One: Enabled |
Model 2: Biosbias2_CNN_20200510172123
Feature 0 | Feature 1 | Feature 2 | Feature 3 | Feature 4 | Feature 5 | Feature 6 | Feature 7 | Feature 8 | Feature 9 | Feature 10 | Feature 11 | Feature 12 | Feature 13 | Feature 14 | Feature 15 | Feature 16 | Feature 17 | Feature 18 | Feature 19 | Feature 20 | Feature 21 | Feature 22 | Feature 23 | Feature 24 | Feature 25 | Feature 26 | Feature 27 | Feature 28 | Feature 29 |
Model weights: - surgeon = 0.220 - nurse = -0.444 | Model weights: - surgeon = 0.410 - nurse = -0.166 | Model weights: - surgeon = 0.415 - nurse = -0.326 | Model weights: - surgeon = -0.259 - nurse = 0.298 | Model weights: - surgeon = 0.283 - nurse = -0.427 | Model weights: - surgeon = -0.063 - nurse = 0.232 | Model weights: - surgeon = -0.197 - nurse = -0.080 | Model weights: - surgeon = -0.008 - nurse = 0.299 | Model weights: - surgeon = -0.089 - nurse = 0.061 | Model weights: - surgeon = 0.301 - nurse = 0.207 | Model weights: - surgeon = -0.227 - nurse = -0.005 | Model weights: - surgeon = -0.464 - nurse = -0.270 | Model weights: - surgeon = 0.222 - nurse = 0.033 | Model weights: - surgeon = -0.033 - nurse = -0.162 | Model weights: - surgeon = 0.188 - nurse = 0.432 | Model weights: - surgeon = 0.296 - nurse = -0.217 | Model weights: - surgeon = 0.002 - nurse = 0.357 | Model weights: - surgeon = -0.334 - nurse = 0.011 | Model weights: - surgeon = 0.434 - nurse = -0.263 | Model weights: - surgeon = -0.098 - nurse = 0.216 | Model weights: - surgeon = -0.059 - nurse = -0.416 | Model weights: - surgeon = -0.346 - nurse = 0.336 | Model weights: - surgeon = 0.050 - nurse = -0.170 | Model weights: - surgeon = -0.414 - nurse = 0.048 | Model weights: - surgeon = -0.108 - nurse = -0.332 | Model weights: - surgeon = 0.197 - nurse = -0.261 | Model weights: - surgeon = -0.377 - nurse = -0.067 | Model weights: - surgeon = -0.034 - nurse = 0.356 | Model weights: - surgeon = -0.195 - nurse = 0.267 | Model weights: - surgeon = 0.271 - nurse = -0.221 |
Human answers: - surgeon = 10 - nurse = 0 - It could be either = 0 | Human answers: - surgeon = 10 - nurse = 0 - It could be either = 0 | Human answers: - surgeon = 4 - nurse = 0 - It could be either = 6 | Human answers: - surgeon = 0 - nurse = 0 - It could be either = 10 | Human answers: - surgeon = 1 - nurse = 0 - It could be either = 9 | Human answers: - surgeon = 0 - nurse = 2 - It could be either = 8 | Human answers: - surgeon = 2 - nurse = 1 - It could be either = 7 | Human answers: - surgeon = 0 - nurse = 0 - It could be either = 10 | Human answers: - surgeon = 0 - nurse = 10 - It could be either = 0 | Human answers: - surgeon = 10 - nurse = 0 - It could be either = 0 | Human answers: - surgeon = 0 - nurse = 10 - It could be either = 0 | Human answers: - surgeon = 0 - nurse = 9 - It could be either = 1 | Human answers: - surgeon = 10 - nurse = 0 - It could be either = 0 | Human answers: - surgeon = 9 - nurse = 0 - It could be either = 1 | Human answers: - surgeon = 0 - nurse = 6 - It could be either = 4 | Human answers: - surgeon = 6 - nurse = 0 - It could be either = 4 | Human answers: - surgeon = 0 - nurse = 10 - It could be either = 0 | Human answers: - surgeon = 0 - nurse = 9 - It could be either = 1 | Human answers: - surgeon = 4 - nurse = 1 - It could be either = 5 | Human answers: - surgeon = 0 - nurse = 5 - It could be either = 5 | Human answers: - surgeon = 10 - nurse = 0 - It could be either = 0 | Human answers: - surgeon = 0 - nurse = 10 - It could be either = 0 | Human answers: - surgeon = 5 - nurse = 0 - It could be either = 5 | Human answers: - surgeon = 0 - nurse = 10 - It could be either = 0 | Human answers: - surgeon = 4 - nurse = 0 - It could be either = 6 | Human answers: - surgeon = 5 - nurse = 0 - It could be either = 5 | Human answers: - surgeon = 0 - nurse = 10 - It could be either = 0 | Human answers: - surgeon = 0 - nurse = 9 - It could be either = 1 | Human answers: - surgeon = 0 - nurse = 10 - It could be either = 0 | Human answers: - surgeon = 8 - nurse = 0 - It could be either = 2 |
Decision: - MTurk: Enabled - One: Enabled | Decision: - MTurk: Enabled - One: Enabled | Decision: - MTurk: Disabled - One: Enabled | Decision: - MTurk: Disabled - One: Disabled | Decision: - MTurk: Disabled - One: Disabled | Decision: - MTurk: Disabled - One: Disabled | Decision: - MTurk: Disabled - One: Disabled | Decision: - MTurk: Disabled - One: Disabled | Decision: - MTurk: Enabled - One: Enabled | Decision: - MTurk: Enabled - One: Enabled | Decision: - MTurk: Enabled - One: Enabled | Decision: - MTurk: Enabled - One: Disabled | Decision: - MTurk: Enabled - One: Enabled | Decision: - MTurk: Enabled - One: Enabled | Decision: - MTurk: Enabled - One: Disabled | Decision: - MTurk: Enabled - One: Disabled | Decision: - MTurk: Enabled - One: Enabled | Decision: - MTurk: Enabled - One: Enabled | Decision: - MTurk: Disabled - One: Enabled | Decision: - MTurk: Disabled - One: Disabled | Decision: - MTurk: Enabled - One: Enabled | Decision: - MTurk: Enabled - One: Disabled | Decision: - MTurk: Disabled - One: Enabled | Decision: - MTurk: Enabled - One: Disabled | Decision: - MTurk: Disabled - One: Enabled | Decision: - MTurk: Disabled - One: Enabled | Decision: - MTurk: Enabled - One: Enabled | Decision: - MTurk: Enabled - One: Enabled | Decision: - MTurk: Enabled - One: Enabled | Decision: - MTurk: Enabled - One: Disabled |
Model 3: Biosbias2_CNN_20200510173215
Feature 0 | Feature 1 | Feature 2 | Feature 3 | Feature 4 | Feature 5 | Feature 6 | Feature 7 | Feature 8 | Feature 9 | Feature 10 | Feature 11 | Feature 12 | Feature 13 | Feature 14 | Feature 15 | Feature 16 | Feature 17 | Feature 18 | Feature 19 | Feature 20 | Feature 21 | Feature 22 | Feature 23 | Feature 24 | Feature 25 | Feature 26 | Feature 27 | Feature 28 | Feature 29 |
Model weights: - surgeon = 0.076 - nurse = -0.178 | Model weights: - surgeon = 0.423 - nurse = -0.411 | Model weights: - surgeon = -0.467 - nurse = -0.102 | Model weights: - surgeon = 0.070 - nurse = 0.226 | Model weights: - surgeon = -0.190 - nurse = -0.442 | Model weights: - surgeon = -0.367 - nurse = 0.063 | Model weights: - surgeon = -0.352 - nurse = 0.018 | Model weights: - surgeon = 0.013 - nurse = 0.369 | Model weights: - surgeon = -0.435 - nurse = -0.108 | Model weights: - surgeon = 0.296 - nurse = 0.200 | Model weights: - surgeon = -0.229 - nurse = 0.109 | Model weights: - surgeon = 0.175 - nurse = 0.043 | Model weights: - surgeon = 0.388 - nurse = -0.143 | Model weights: - surgeon = 0.426 - nurse = -0.235 | Model weights: - surgeon = 0.176 - nurse = -0.151 | Model weights: - surgeon = -0.178 - nurse = 0.051 | Model weights: - surgeon = 0.201 - nurse = -0.256 | Model weights: - surgeon = -0.035 - nurse = -0.140 | Model weights: - surgeon = 0.395 - nurse = 0.155 | Model weights: - surgeon = 0.355 - nurse = -0.183 | Model weights: - surgeon = -0.323 - nurse = 0.203 | Model weights: - surgeon = -0.274 - nurse = -0.400 | Model weights: - surgeon = -0.325 - nurse = 0.409 | Model weights: - surgeon = 0.102 - nurse = -0.303 | Model weights: - surgeon = -0.054 - nurse = -0.362 | Model weights: - surgeon = -0.278 - nurse = 0.121 | Model weights: - surgeon = 0.339 - nurse = -0.240 | Model weights: - surgeon = 0.281 - nurse = -0.448 | Model weights: - surgeon = 0.405 - nurse = -0.297 | Model weights: - surgeon = -0.132 - nurse = 0.274 |
Human answers: - surgeon = 10 - nurse = 0 - It could be either = 0 | Human answers: - surgeon = 10 - nurse = 0 - It could be either = 0 | Human answers: - surgeon = 1 - nurse = 1 - It could be either = 8 | Human answers: - surgeon = 0 - nurse = 10 - It could be either = 0 | Human answers: - surgeon = 10 - nurse = 0 - It could be either = 0 | Human answers: - surgeon = 0 - nurse = 10 - It could be either = 0 | Human answers: - surgeon = 0 - nurse = 10 - It could be either = 0 | Human answers: - surgeon = 0 - nurse = 7 - It could be either = 3 | Human answers: - surgeon = 0 - nurse = 2 - It could be either = 8 | Human answers: - surgeon = 0 - nurse = 0 - It could be either = 10 | Human answers: - surgeon = 1 - nurse = 7 - It could be either = 2 | Human answers: - surgeon = 10 - nurse = 0 - It could be either = 0 | Human answers: - surgeon = 6 - nurse = 0 - It could be either = 4 | Human answers: - surgeon = 2 - nurse = 0 - It could be either = 8 | Human answers: - surgeon = 3 - nurse = 0 - It could be either = 7 | Human answers: - surgeon = 1 - nurse = 0 - It could be either = 9 | Human answers: - surgeon = 10 - nurse = 0 - It could be either = 0 | Human answers: - surgeon = 8 - nurse = 0 - It could be either = 2 | Human answers: - surgeon = 10 - nurse = 0 - It could be either = 0 | Human answers: - surgeon = 8 - nurse = 0 - It could be either = 2 | Human answers: - surgeon = 1 - nurse = 9 - It could be either = 0 | Human answers: - surgeon = 10 - nurse = 0 - It could be either = 0 | Human answers: - surgeon = 0 - nurse = 10 - It could be either = 0 | Human answers: - surgeon = 9 - nurse = 0 - It could be either = 1 | Human answers: - surgeon = 3 - nurse = 0 - It could be either = 7 | Human answers: - surgeon = 1 - nurse = 2 - It could be either = 7 | Human answers: - surgeon = 4 - nurse = 0 - It could be either = 6 | Human answers: - surgeon = 10 - nurse = 0 - It could be either = 0 | Human answers: - surgeon = 10 - nurse = 0 - It could be either = 0 | Human answers: - surgeon = 0 - nurse = 9 - It could be either = 1 |
Decision: - MTurk: Enabled - One: Enabled | Decision: - MTurk: Enabled - One: Enabled | Decision: - MTurk: Disabled - One: Disabled | Decision: - MTurk: Enabled - One: Enabled | Decision: - MTurk: Enabled - One: Enabled | Decision: - MTurk: Enabled - One: Enabled | Decision: - MTurk: Enabled - One: Enabled | Decision: - MTurk: Enabled - One: Disabled | Decision: - MTurk: Disabled - One: Disabled | Decision: - MTurk: Disabled - One: Disabled | Decision: - MTurk: Enabled - One: Enabled | Decision: - MTurk: Enabled - One: Enabled | Decision: - MTurk: Enabled - One: Enabled | Decision: - MTurk: Disabled - One: Enabled | Decision: - MTurk: Disabled - One: Enabled | Decision: - MTurk: Disabled - One: Disabled | Decision: - MTurk: Enabled - One: Enabled | Decision: - MTurk: Enabled - One: Disabled | Decision: - MTurk: Enabled - One: Enabled | Decision: - MTurk: Enabled - One: Disabled | Decision: - MTurk: Enabled - One: Enabled | Decision: - MTurk: Enabled - One: Enabled | Decision: - MTurk: Enabled - One: Enabled | Decision: - MTurk: Enabled - One: Enabled | Decision: - MTurk: Disabled - One: Disabled | Decision: - MTurk: Disabled - One: Disabled | Decision: - MTurk: Disabled - One: Enabled | Decision: - MTurk: Enabled - One: Enabled | Decision: - MTurk: Enabled - One: Disabled | Decision: - MTurk: Enabled - One: Disabled |
Results
Results (Average ± SD) of Experiment 2: Biosbias, CNNs; Boldface numbers are the best scores in the columns. They are further underlined if they are significantly better than the scores of all the other models (based on approximate randomization test with α = 0.05)
Downloads
- Wordclouds and annotations
- The dataset of this experiment as well as other experiments can be downloaded here.
- If you want to use the original trained models in the experiments, please contact Piyawat (pl1515 [at] imperial [dot] ac [dot] uk).