Effects of Algorithmic Flagging on Fairness: Quasi-experimental Evidence from Wikipedia

dc.contributor.authorTeBlunthuis, Nathan
dc.contributor.authorHill, Benjamin Mako
dc.contributor.authorHalfaker, Aaron
dc.date.accessioned2026-01-05T20:38:13Z
dc.date.available2026-01-05T20:38:13Z
dc.date.issued2021-04-22
dc.description.abstractOnline community moderators often rely on social signals such as whether or not a user has an account or a profile page as clues that users may cause problems. Reliance on these clues can lead to "overprofiling'' bias when moderators focus on these signals but overlook the misbehavior of others. We propose that algorithmic flagging systems deployed to improve the efficiency of moderation work can also make moderation actions more fair to these users by reducing reliance on social signals and making norm violations by everyone else more visible. We analyze moderator behavior in Wikipedia as mediated by RCFilters, a system which displays social signals and algorithmic flags, and estimate the causal effect of being flagged on moderator actions. We show that algorithmically flagged edits are reverted more often, especially those by established editors with positive social signals, and that flagging decreases the likelihood that moderation actions will be undone. Our results suggest that algorithmic flagging systems can lead to increased fairness in some contexts but that the relationship is complex and contingent.
dc.identifier.urihttps://hdl.handle.net/1773/54485
dc.rightsAttribution-ShareAlike 3.0 United Statesen
dc.rights.urihttp://creativecommons.org/licenses/by-sa/3.0/us/
dc.titleEffects of Algorithmic Flagging on Fairness: Quasi-experimental Evidence from Wikipedia
dc.typeArticle

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
3449130-stripped.pdf
Size:
497.83 KB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.6 KB
Format:
Item-specific license agreed upon to submission
Description: