Stephanie Eckman
Stephanie Eckman
Home
AI Research
Survey Research
Talks
Publications
Contact
CV
Light
Dark
Automatic
data quality
Data Quality in Data Science
Data scientists are increasingly turning their attention to the collection of high quality training data. Those of us with expertise in data collection can apply lessons in web survey design to them.
Aug 5, 2021 10:00 AM
Stephanie Eckman
Project
Slides
Video
stephnie
Annotation Sensitivity: Training Data Collection Methods Affect Model Performance
Small changes in how you ask annotators to label data can dramatically change your model’s behavior. We tested 5 versions of a hate speech labeling task and found significant differences in model performance.
Christoph Kern
,
Stephanie Eckman
,
Jacob Beck
,
Rob Chew
,
Bolei Ma
,
Frauke Kreuter
PDF
Project
DOI
Improving Labeling Through Social Science Insights: Preliminary Results and Research Agenda
How you design a labeling interface affects the labels you get. We show that task structure, ordering, and annotator backgrounds all shape training data quality.
Jacob Beck
,
Stephanie Eckman
,
Rob Chew
,
Frauke Kreuter
PDF
Project
DOI
«
Cite
×