Skip to main content

Showing 1–40 of 40 results for author: Bernstein, M S

  1. arXiv:2406.19571  [pdf, other

    cs.SI cs.CY

    Reranking Social Media Feeds: A Practical Guide for Field Experiments

    Authors: Tiziano Piccardi, Martin Saveski, Chenyan Jia, Jeffrey Hancock, Jeanne L. Tsai, Michael S. Bernstein

    Abstract: Social media plays a central role in shaping public opinion and behavior, yet performing experiments on these platforms and, in particular, on feed algorithms is becoming increasingly challenging. This article offers practical recommendations to researchers developing and deploying field experiments focused on real-time re-ranking of social media feeds. This article is organized around two contrib… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  2. Concept Induction: Analyzing Unstructured Text with High-Level Concepts Using LLooM

    Authors: Michelle S. Lam, Janice Teoh, James Landay, Jeffrey Heer, Michael S. Bernstein

    Abstract: Data analysts have long sought to turn unstructured text data into meaningful concepts. Though common, topic modeling and clustering focus on lower-level keywords and require significant interpretative work. We introduce concept induction, a computational process that instead produces high-level concepts, defined by explicit inclusion criteria, from unstructured text. For a dataset of toxic online… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: To appear at CHI 2024

  3. arXiv:2404.04204  [pdf, other

    cs.CL cs.HC

    Social Skill Training with Large Language Models

    Authors: Diyi Yang, Caleb Ziems, William Held, Omar Shaikh, Michael S. Bernstein, John Mitchell

    Abstract: People rely on social skills like conflict resolution to communicate effectively and to thrive in both work and personal life. However, practice environments for social skills are typically out of reach for most people. How can we make social skill training more available, accessible, and inviting? Drawing upon interdisciplinary research from communication and psychology, this perspective paper id… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

  4. arXiv:2402.05388  [pdf, other

    cs.HC cs.SI

    Form-From: A Design Space of Social Media Systems

    Authors: Amy X. Zhang, Michael S. Bernstein, David R. Karger, Mark S. Ackerman

    Abstract: Social media systems are as varied as they are pervasive. They have been almost universally adopted for a broad range of purposes including work, entertainment, activism, and decision making. As a result, they have also diversified, with many distinct designs differing in content type, organization, delivery mechanism, access control, and many other dimensions. In this work, we aim to characterize… ▽ More

    Submitted 23 March, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

    Journal ref: Proc. ACM Hum.-Comput. Interact. 8, CSCW1, Article 167 (April 2024), 47 pages

  5. arXiv:2402.03715  [pdf, other

    cs.LG cs.AI cs.CL

    Clarify: Improving Model Robustness With Natural Language Corrections

    Authors: Yoonho Lee, Michelle S. Lam, Helena Vasconcelos, Michael S. Bernstein, Chelsea Finn

    Abstract: In supervised learning, models are trained to extract correlations from a static dataset. This often leads to models that rely on high-level misconceptions. To prevent such misconceptions, we must necessarily provide additional information beyond the training data. Existing methods incorporate forms of additional instance-level supervision, such as labels for spurious features or additional labele… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

  6. arXiv:2309.12309  [pdf, other

    cs.HC cs.AI cs.CL

    Rehearsal: Simulating Conflict to Teach Conflict Resolution

    Authors: Omar Shaikh, Valentino Chai, Michele J. Gelfand, Diyi Yang, Michael S. Bernstein

    Abstract: Interpersonal conflict is an uncomfortable but unavoidable fact of life. Navigating conflict successfully is a skill -- one that can be learned through deliberate practice -- but few have access to effective training or feedback. To expand this access, we introduce Rehearsal, a system that allows users to rehearse conflicts with a believable simulated interlocutor, explore counterfactual "what if?… ▽ More

    Submitted 29 February, 2024; v1 submitted 21 September, 2023; originally announced September 2023.

    Comments: CHI 2024

  7. Cura: Curation at Social Media Scale

    Authors: Wanrong He, Mitchell L. Gordon, Lindsay Popowski, Michael S. Bernstein

    Abstract: How can online communities execute a focused vision for their space? Curation offers one approach, where community leaders manually select content to share with the community. Curation enables leaders to shape a space that matches their taste, norms, and values, but the practice is often intractable at social media scale: curators cannot realistically sift through hundreds or thousands of submissi… ▽ More

    Submitted 26 August, 2023; originally announced August 2023.

    Comments: CSCW 2023

  8. arXiv:2307.13912  [pdf, other

    cs.HC cs.AI

    Embedding Democratic Values into Social Media AIs via Societal Objective Functions

    Authors: Chenyan Jia, Michelle S. Lam, Minh Chau Mai, Jeff Hancock, Michael S. Bernstein

    Abstract: Can we design artificial intelligence (AI) systems that rank our social media feeds to consider democratic values such as mitigating partisan animosity as part of their objective functions? We introduce a method for translating established, vetted social scientific constructs into AI objective functions, which we term societal objective functions, and demonstrate the method with application to the… ▽ More

    Submitted 14 February, 2024; v1 submitted 25 July, 2023; originally announced July 2023.

    Comments: This paper has been accepted to CSCW 2024 and will be published in Proc. ACM Hum.-Comput. Interact. 8, CSCW1, Article 163 (April 2024)

    Journal ref: Proceedings of the ACM: Human-Computer Interaction, 8, CSCW1, Article 163 (2024)

  9. arXiv:2305.09038  [pdf

    cs.HC

    Characterizing Image Accessibility on Wikipedia across Languages

    Authors: Elisa Kreiss, Krishna Srinivasan, Tiziano Piccardi, Jesus Adolfo Hermosillo, Cynthia Bennett, Michael S. Bernstein, Meredith Ringel Morris, Christopher Potts

    Abstract: We make a first attempt to characterize image accessibility on Wikipedia across languages, present new experimental results that can inform efforts to assess description quality, and offer some strategies to improve Wikipedia's image accessibility.

    Submitted 15 May, 2023; originally announced May 2023.

    Comments: Presented at Wiki Workshop 2023

  10. arXiv:2304.03442  [pdf, other

    cs.HC cs.AI cs.LG

    Generative Agents: Interactive Simulacra of Human Behavior

    Authors: Joon Sung Park, Joseph C. O'Brien, Carrie J. Cai, Meredith Ringel Morris, Percy Liang, Michael S. Bernstein

    Abstract: Believable proxies of human behavior can empower interactive applications ranging from immersive environments to rehearsal spaces for interpersonal communication to prototyping tools. In this paper, we introduce generative agents--computational software agents that simulate believable human behavior. Generative agents wake up, cook breakfast, and head to work; artists paint, while authors write; t… ▽ More

    Submitted 5 August, 2023; v1 submitted 6 April, 2023; originally announced April 2023.

  11. arXiv:2303.02884  [pdf, other

    cs.HC cs.AI cs.LG

    Model Sketching: Centering Concepts in Early-Stage Machine Learning Model Design

    Authors: Michelle S. Lam, Zixian Ma, Anne Li, Izequiel Freitas, Dakuo Wang, James A. Landay, Michael S. Bernstein

    Abstract: Machine learning practitioners often end up tunneling on low-level technical details like model architectures and performance metrics. Could early model development instead focus on high-level questions of which factors a model ought to pay attention to? Inspired by the practice of sketching in design, which distills ideas to their minimal representation, we introduce model sketching: a technical… ▽ More

    Submitted 5 March, 2023; originally announced March 2023.

    Comments: To appear at CHI 2023

  12. arXiv:2301.13431  [pdf, other

    cs.HC cs.CY cs.DL

    Breaking Out of the Ivory Tower: A Large-scale Analysis of Patent Citations to HCI Research

    Authors: Hancheng Cao, Yujie Lu, Yuting Deng, Daniel A. McFarland, Michael S. Bernstein

    Abstract: What is the impact of human-computer interaction research on industry? While it is impossible to track all research impact pathways, the growing literature on translational research impact measurement offers patent citations as one measure of how industry recognizes and draws on research in its inventions. In this paper, we perform a large-scale measurement study primarily of 70,000 patent citatio… ▽ More

    Submitted 31 January, 2023; originally announced January 2023.

    Comments: accepted to CHI 2023

  13. arXiv:2208.13094  [pdf, other

    cs.HC

    Measuring the Prevalence of Anti-Social Behavior in Online Communities

    Authors: Joon Sung Park, Joseph Seering, Michael S. Bernstein

    Abstract: With increasing attention to online anti-social behaviors such as personal attacks and bigotry, it is critical to have an accurate accounting of how widespread anti-social behaviors are. In this paper, we empirically measure the prevalence of anti-social behavior in one of the world's most popular online community platforms. We operationalize this goal as measuring the proportion of unmoderated co… ▽ More

    Submitted 27 August, 2022; originally announced August 2022.

    Comments: This work will appear in the Proc. ACM Hum.-Comput. Interact. 6, CSCW (CSCW'22)

  14. arXiv:2208.04024  [pdf, other

    cs.HC

    Social Simulacra: Creating Populated Prototypes for Social Computing Systems

    Authors: Joon Sung Park, Lindsay Popowski, Carrie J. Cai, Meredith Ringel Morris, Percy Liang, Michael S. Bernstein

    Abstract: Social computing prototypes probe the social behaviors that may arise in an envisioned system design. This prototyping practice is currently limited to recruiting small groups of people. Unfortunately, many challenges do not arise until a system is populated at a larger scale. Can a designer understand how a social system might behave when populated, and make adjustments to the design before the s… ▽ More

    Submitted 8 August, 2022; originally announced August 2022.

    Comments: This work will appear in the 35th Annual ACM Symposium on User Interface Software and Technology (UIST '22)

  15. arXiv:2207.04369  [pdf, other

    stat.AP

    Balancing Producer Fairness and Efficiency via Prior-Weighted Rating System Design

    Authors: Thomas Ma, Michael S. Bernstein, Ramesh Johari, Nikhil Garg

    Abstract: Online marketplaces use rating systems to promote the discovery of high-quality products. However, these systems also lead to high variance in producers' economic outcomes: a new producer who sells high-quality items, may unluckily receive one low rating early on, negatively impacting their future popularity. We investigate the design of rating systems that balance the goals of identifying high-qu… ▽ More

    Submitted 25 November, 2023; v1 submitted 9 July, 2022; originally announced July 2022.

    Comments: 12 pages, 8 figures, submitted to TheWebConf 2024

  16. arXiv:2204.05439  [pdf, other

    cs.HC cs.SI

    A Web-Scale Analysis of the Community Origins of Image Memes

    Authors: Durim Morina, Michael S. Bernstein

    Abstract: Where do the most popular online cultural artifacts such as image memes originate? Media narratives suggest that cultural innovations often originate in peripheral communities and then diffuse to the mainstream core; behavioral science suggests that intermediate network positions that bridge between the periphery and the core are especially likely to originate many influential cultural innovations… ▽ More

    Submitted 11 April, 2022; originally announced April 2022.

    Comments: CSCW 2022

    Journal ref: Proc. ACM Hum.-Comput. Interact. 6, CSCW1, Article 74 (April 2022), 25 pages

  17. Comparing the Perceived Legitimacy of Content Moderation Processes: Contractors, Algorithms, Expert Panels, and Digital Juries

    Authors: Christina A. Pan, Sahil Yakhmi, Tara P. Iyer, Evan Strasnick, Amy X. Zhang, Michael S. Bernstein

    Abstract: While research continues to investigate and improve the accuracy, fairness, and normative appropriateness of content moderation processes on large social media platforms, even the best process cannot be effective if users reject its authority as illegitimate. We present a survey experiment comparing the perceived institutional legitimacy of four popular content moderation processes. We conducted a… ▽ More

    Submitted 6 October, 2022; v1 submitted 13 February, 2022; originally announced February 2022.

    Comments: This paper will appear at CSCW 2022

  18. arXiv:2202.02950  [pdf, other

    cs.HC cs.AI cs.LG

    Jury Learning: Integrating Dissenting Voices into Machine Learning Models

    Authors: Mitchell L. Gordon, Michelle S. Lam, Joon Sung Park, Kayur Patel, Jeffrey T. Hancock, Tatsunori Hashimoto, Michael S. Bernstein

    Abstract: Whose labels should a machine learning (ML) algorithm learn to emulate? For ML tasks ranging from online comment toxicity to misinformation detection to medical diagnosis, different groups in society may have irreconcilable disagreements about ground truth labels. Supervised ML today resolves these label disagreements implicitly using majority vote, which overrides minority groups' labels. We intr… ▽ More

    Submitted 7 February, 2022; originally announced February 2022.

    Comments: To appear at CHI 2022

  19. A "Distance Matters" Paradox: Facilitating Intra-Team Collaboration Can Harm Inter-Team Collaboration

    Authors: Xinlan Emily Hu, Rebecca Hinds, Melissa A. Valentine, Michael S. Bernstein

    Abstract: By identifying the socio-technical conditions required for teams to work effectively remotely, the Distance Matters framework has been influential in CSCW since its introduction in 2000. Advances in collaboration technology and practices have since brought teams increasingly closer to achieving these conditions. This paper presents a ten-month ethnography in a remote organization, where we observe… ▽ More

    Submitted 4 February, 2022; originally announced February 2022.

    Comments: Accepted at CSCW 2022 (The 25th ACM Conference on Computer-Supported Cooperative Work and Social Computing)

    Journal ref: Proc. ACM Hum.-Comput. Interact. 6, CSCW1, Article 48 (April 2022), 36 pages

  20. arXiv:2112.08279  [pdf, other

    cs.CY

    Crowdsourcing County-Level Data on Early COVID-19 Policy Interventions in the United States: Technical Report

    Authors: Jacob Ritchie, Mark Whiting, Sorathan Chaturapruek, J. D. Zamfirescu-Pereira, Madhav Marathe, Achla Marathe, Stephen Eubank, Michael S. Bernstein

    Abstract: Beginning in April 2020, we gathered partial county-level data on non-pharmaceutical interventions (NPIs) implemented in response to the COVID-19 pandemic in the United States, using both volunteer and paid crowdsourcing. In this report, we document the data collection process and summarize our results, to increase the utility of our open data and inform the design of future rapid crowdsourcing da… ▽ More

    Submitted 15 December, 2021; originally announced December 2021.

    Comments: Includes survey instrument

  21. arXiv:2108.07258  [pdf, other

    cs.LG cs.AI cs.CY

    On the Opportunities and Risks of Foundation Models

    Authors: Rishi Bommasani, Drew A. Hudson, Ehsan Adeli, Russ Altman, Simran Arora, Sydney von Arx, Michael S. Bernstein, Jeannette Bohg, Antoine Bosselut, Emma Brunskill, Erik Brynjolfsson, Shyamal Buch, Dallas Card, Rodrigo Castellon, Niladri Chatterji, Annie Chen, Kathleen Creel, Jared Quincy Davis, Dora Demszky, Chris Donahue, Moussa Doumbouya, Esin Durmus, Stefano Ermon, John Etchemendy, Kawin Ethayarajh , et al. (89 additional authors not shown)

    Abstract: AI is undergoing a paradigm shift with the rise of models (e.g., BERT, DALL-E, GPT-3) that are trained on broad data at scale and are adaptable to a wide range of downstream tasks. We call these models foundation models to underscore their critically central yet incomplete character. This report provides a thorough account of the opportunities and risks of foundation models, ranging from their cap… ▽ More

    Submitted 12 July, 2022; v1 submitted 16 August, 2021; originally announced August 2021.

    Comments: Authored by the Center for Research on Foundation Models (CRFM) at the Stanford Institute for Human-Centered Artificial Intelligence (HAI). Report page with citation guidelines: https://crfm.stanford.edu/report.html

  22. arXiv:2106.11521  [pdf, other

    cs.CY

    ESR: Ethics and Society Review of Artificial Intelligence Research

    Authors: Michael S. Bernstein, Margaret Levi, David Magnus, Betsy Rajala, Debra Satz, Charla Waeiss

    Abstract: Artificial intelligence (AI) research is routinely criticized for its real and potential impacts on society, and we lack adequate institutional responses to this criticism and to the responsibility that it reflects. AI research often falls outside the purview of existing feedback mechanisms such as the Institutional Review Board (IRB), which are designed to evaluate harms to human subjects rather… ▽ More

    Submitted 9 July, 2021; v1 submitted 21 June, 2021; originally announced June 2021.

    Comments: Revision: credit to the Microsoft Research Ethics Review Program

    ACM Class: K.4

  23. Understanding the Representation and Representativeness of Age in AI Data Sets

    Authors: Joon Sung Park, Michael S. Bernstein, Robin N. Brewer, Ece Kamar, Meredith Ringel Morris

    Abstract: A diverse representation of different demographic groups in AI training data sets is important in ensuring that the models will work for a large range of users. To this end, recent efforts in AI fairness and inclusion have advocated for creating AI data sets that are well-balanced across race, gender, socioeconomic status, and disability status. In this paper, we contribute to this line of work by… ▽ More

    Submitted 6 May, 2021; v1 submitted 10 March, 2021; originally announced March 2021.

    Comments: 9 pages

    Journal ref: In Proceedings of the 2021 AAAI/ACM Conference on AI, Ethics, and Society (AIES '21)

  24. arXiv:2101.11743  [pdf, other

    cs.HC

    Not Now, Ask Later: Users Weaken Their Behavior Change Regimen Over Time, But Expect To Re-Strengthen It Imminently

    Authors: Geza Kovacs, Zhengxuan Wu, Michael S. Bernstein

    Abstract: How effectively do we adhere to nudges and interventions that help us control our online browsing habits? If we have a temporary lapse and disable the behavior change system, do we later resume our adherence, or has the dam broken? In this paper, we investigate these questions through log analyses of 8,000+ users on HabitLab, a behavior change platform that helps users reduce their time online. We… ▽ More

    Submitted 27 January, 2021; originally announced January 2021.

    Comments: To appear in ACM CHI Conference on Human Factors in Computing Systems (CHI '21), May 8-13, 2021, Yokohama, Japan

    ACM Class: H.5.2

  25. arXiv:2010.07292  [pdf, other

    cs.CY cs.CL

    My Team Will Go On: Differentiating High and Low Viability Teams through Team Interaction

    Authors: Hancheng Cao, Vivian Yang, Victor Chen, Yu Jin Lee, Lydia Stone, N'godjigui Junior Diarrassouba, Mark E. Whiting, Michael S. Bernstein

    Abstract: Understanding team viability -- a team's capacity for sustained and future success -- is essential for building effective teams. In this study, we aggregate features drawn from the organizational behavior literature to train a viability classification model over a dataset of 669 10-minute text conversations of online teams. We train classifiers to identify teams at the top decile (most viable team… ▽ More

    Submitted 3 November, 2020; v1 submitted 14 October, 2020; originally announced October 2020.

    Comments: CSCW 2020 Honorable Mention Award

    Journal ref: Proc. ACM Hum.-Comput. Interact. 4, CSCW3, Article 230 (December 2020)

  26. PolicyKit: Building Governance in Online Communities

    Authors: Amy X. Zhang, Grant Hugh, Michael S. Bernstein

    Abstract: The software behind online community platforms encodes a governance model that represents a strikingly narrow set of governance possibilities focused on moderators and administrators. When online communities desire other forms of government, such as ones that take many members' opinions into account or that distribute power in non-trivial ways, communities must resort to laborious manual effort. I… ▽ More

    Submitted 17 August, 2020; v1 submitted 10 August, 2020; originally announced August 2020.

    Comments: to be published in ACM UIST 2020

    ACM Class: H.5.3

  27. arXiv:1910.10143  [pdf, other

    cs.LG cs.CV stat.ML

    Establishing an Evaluation Metric to Quantify Climate Change Image Realism

    Authors: Sharon Zhou, Alexandra Luccioni, Gautier Cosne, Michael S. Bernstein, Yoshua Bengio

    Abstract: With success on controlled tasks, generative models are being increasingly applied to humanitarian applications [1,2]. In this paper, we focus on the evaluation of a conditional generative model that illustrates the consequences of climate change-induced flooding to encourage public interest and awareness on the issue. Because metrics for comparing the realism of different modes in a conditional g… ▽ More

    Submitted 22 October, 2019; originally announced October 2019.

    Comments: Accepted to the NeurIPS 2019 Workshop, Tackling Climate Change with Machine Learning

    MSC Class: 68T45

  28. arXiv:1904.06722  [pdf, other

    cs.CY cs.HC econ.GN

    Boomerang: Rebounding the Consequences of Reputation Feedback on Crowdsourcing Platforms

    Authors: Snehalkumar, S. Gaikwad, Durim Morina, Adam Ginzberg, Catherine Mullings, Shirish Goyal, Dilrukshi Gamage, Christopher Diemert, Mathias Burton, Sharon Zhou, Mark Whiting, Karolina Ziulkoski, Alipta Ballav, Aaron Gilbee, Senadhipathige S. Niranga, Vibhor Sehgal, Jasmine Lin, Leonardy Kristianto, Angela Richmond-Fuller, Jeff Regino, Nalin Chhibber, Dinesh Majeti, Sachin Sharma, Kamila Mananova, Dinesh Dhakal , et al. (13 additional authors not shown)

    Abstract: Paid crowdsourcing platforms suffer from low-quality work and unfair rejections, but paradoxically, most workers and requesters have high reputation scores. These inflated scores, which make high-quality work and workers difficult to find, stem from social pressure to avoid giving negative feedback. We introduce Boomerang, a reputation system for crowdsourcing that elicits more accurate feedback b… ▽ More

    Submitted 14 April, 2019; originally announced April 2019.

    ACM Class: H.5.3; H.1.2; J.4; K.4.4; K.4.3

    Journal ref: Proceedings of the 29th Annual Symposium on User Interface Software and Technology, 2016

  29. arXiv:1904.01121  [pdf, other

    cs.CV cs.HC cs.LG

    HYPE: A Benchmark for Human eYe Perceptual Evaluation of Generative Models

    Authors: Sharon Zhou, Mitchell L. Gordon, Ranjay Krishna, Austin Narcomey, Li Fei-Fei, Michael S. Bernstein

    Abstract: Generative models often use human evaluations to measure the perceived quality of their outputs. Automated metrics are noisy indirect proxies, because they rely on heuristics or pretrained embeddings. However, up until now, direct human evaluation strategies have been ad-hoc, neither standardized nor validated. Our work establishes a gold standard human benchmark for generative realism. We constru… ▽ More

    Submitted 31 October, 2019; v1 submitted 1 April, 2019; originally announced April 2019.

    Comments: https://hype.stanford.edu

  30. Mechanical Novel: Crowdsourcing Complex Work through Reflection and Revision

    Authors: Joy Kim, Sarah Sterman, Allegra Argent Beal Cohen, Michael S. Bernstein

    Abstract: Crowdsourcing systems accomplish large tasks with scale and speed by breaking work down into independent parts. However, many types of complex creative work, such as fiction writing, have remained out of reach for crowds because work is tightly interdependent: changing one part of a story may trigger changes to the overall plot and vice versa. Taking inspiration from how expert authors write, we p… ▽ More

    Submitted 8 November, 2016; originally announced November 2016.

    ACM Class: H.5.3

  31. Mosaic: Designing Online Creative Communities for Sharing Works-in-Progress

    Authors: Joy Kim, Maneesh Agrawala, Michael S. Bernstein

    Abstract: Online creative communities allow creators to share their work with a large audience, maximizing opportunities to showcase their work and connect with fans and peers. However, sharing in-progress work can be technically and socially challenging in environments designed for sharing completed pieces. We propose an online creative community where sharing process, rather than showcasing outcomes, is t… ▽ More

    Submitted 8 November, 2016; originally announced November 2016.

    ACM Class: H.5.3

  32. Crowd Guilds: Worker-led Reputation and Feedback on Crowdsourcing Platforms

    Authors: Mark E. Whiting, Dilrukshi Gamage, Snehalkumar S. Gaikwad, Aaron Gilbee, Shirish Goyal, Alipta Ballav, Dinesh Majeti, Nalin Chhibber, Angela Richmond-Fuller, Freddie Vargus, Tejas Seshadri Sarma, Varshine Chandrakanthan, Teogenes Moura, Mohamed Hashim Salih, Gabriel Bayomi Tinoco Kalejaiye, Adam Ginzberg, Catherine A. Mullings, Yoni Dayan, Kristy Milland, Henrique Orefice, Jeff Regino, Sayna Parsi, Kunz Mainali, Vibhor Sehgal, Sekandar Matin , et al. (3 additional authors not shown)

    Abstract: Crowd workers are distributed and decentralized. While decentralization is designed to utilize independent judgment to promote high-quality results, it paradoxically undercuts behaviors and institutions that are critical to high-quality work. Reputation is one central example: crowdsourcing systems depend on reputation scores from decentralized workers and requesters, but these scores are notoriou… ▽ More

    Submitted 28 February, 2017; v1 submitted 4 November, 2016; originally announced November 2016.

    Comments: 12 pages, 6 figures, 1 table. To be presented at CSCW2017

    ACM Class: H.5.3

    Journal ref: ACM Conference on Computer Supported Cooperative Work and Social Computing. ACM, New York, NY, USA, 1902-1913

  33. A Glimpse Far into the Future: Understanding Long-term Crowd Worker Quality

    Authors: Kenji Hata, Ranjay Krishna, Li Fei-Fei, Michael S. Bernstein

    Abstract: Microtask crowdsourcing is increasingly critical to the creation of extremely large datasets. As a result, crowd workers spend weeks or months repeating the exact same tasks, making it necessary to understand their behavior over these long periods of time. We utilize three large, longitudinal datasets of nine million annotations collected from Amazon Mechanical Turk to examine claims that workers… ▽ More

    Submitted 1 November, 2016; v1 submitted 15 September, 2016; originally announced September 2016.

    Comments: 10 pages, 11 figures, accepted CSCW 2017

    ACM Class: H.5.3

  34. arXiv:1603.08832  [pdf, other

    cs.CL cs.SI

    Shirtless and Dangerous: Quantifying Linguistic Signals of Gender Bias in an Online Fiction Writing Community

    Authors: Ethan Fast, Tina Vachovsky, Michael S. Bernstein

    Abstract: Imagine a princess asleep in a castle, waiting for her prince to slay the dragon and rescue her. Tales like the famous Sleeping Beauty clearly divide up gender roles. But what about more modern stories, borne of a generation increasingly aware of social constructs like sexism and racism? Do these stories tend to reinforce gender stereotypes, or counter them? In this paper, we present a technique t… ▽ More

    Submitted 29 March, 2016; originally announced March 2016.

    Comments: in ICWSM 2016

  35. arXiv:1602.07332  [pdf, other

    cs.CV cs.AI

    Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations

    Authors: Ranjay Krishna, Yuke Zhu, Oliver Groth, Justin Johnson, Kenji Hata, Joshua Kravitz, Stephanie Chen, Yannis Kalantidis, Li-Jia Li, David A. Shamma, Michael S. Bernstein, Fei-Fei Li

    Abstract: Despite progress in perceptual tasks such as image classification, computers still perform poorly on cognitive tasks such as image description and question answering. Cognition is core to tasks that involve not just recognizing, but reasoning about our visual world. However, models used to tackle the rich content in images for cognitive tasks are still being trained using the same datasets designe… ▽ More

    Submitted 23 February, 2016; originally announced February 2016.

    Comments: 44 pages, 37 figures

  36. arXiv:1602.06634  [pdf, other

    cs.HC

    Atelier: Repurposing Expert Crowdsourcing Tasks as Micro-internships

    Authors: Ryo Suzuki, Niloufar Salehi, Michelle S. Lam, Juan C. Marroquin, Michael S. Bernstein

    Abstract: Expert crowdsourcing marketplaces have untapped potential to empower workers' career and skill development. Currently, many workers cannot afford to invest the time and sacrifice the earnings required to learn a new skill, and a lack of experience makes it difficult to get job offers even if they do. In this paper, we seek to lower the threshold to skill development by repurposing existing tasks o… ▽ More

    Submitted 21 February, 2016; originally announced February 2016.

    Comments: CHI 2016

    ACM Class: H.5.3

  37. Embracing Error to Enable Rapid Crowdsourcing

    Authors: Ranjay Krishna, Kenji Hata, Stephanie Chen, Joshua Kravitz, David A. Shamma, Li Fei-Fei, Michael S. Bernstein

    Abstract: Microtask crowdsourcing has enabled dataset advances in social science and machine learning, but existing crowdsourcing schemes are too expensive to scale up with the expanding volume of data. To scale and widen the applicability of crowdsourcing, we present a technique that produces extremely rapid judgments for binary and categorical labels. Rather than punishing all errors, which causes workers… ▽ More

    Submitted 14 February, 2016; originally announced February 2016.

    Comments: 10 pages, 7 figures, CHI '16, CHI: ACM Conference on Human Factors in Computing Systems (2016)

    ACM Class: H.5.m

  38. arXiv:1508.07053  [pdf, other

    cs.HC

    SentenceRacer: A Game with a Purpose for Image Sentence Annotation

    Authors: Kenji Hata, Sherman Leung, Ranjay Krishna, Michael S. Bernstein, Li Fei-Fei

    Abstract: Recently datasets that contain sentence descriptions of images have enabled models that can automatically generate image captions. However, collecting these datasets are still very expensive. Here, we present SentenceRacer, an online game that gathers and verifies descriptions of images at no cost. Similar to the game hangman, players compete to uncover words in a sentence that ultimately describe… ▽ More

    Submitted 27 August, 2015; originally announced August 2015.

    Comments: 2 pages, 2 figures, 2 tables, potential CSCW poster submission

  39. arXiv:1409.3174  [pdf, other

    cs.HC cs.PL cs.SI stat.AP

    Designing and Deploying Online Field Experiments

    Authors: Eytan Bakshy, Dean Eckles, Michael S. Bernstein

    Abstract: Online experiments are widely used to compare specific design alternatives, but they can also be used to produce generalizable knowledge and inform strategic decision making. Doing so often requires sophisticated experimental designs, iterative refinement, and careful logging and analysis. Few tools exist that support these needs. We thus introduce a language for online field experiments called Pl… ▽ More

    Submitted 10 September, 2014; originally announced September 2014.

    Comments: Proceedings of the 23rd international conference on World wide web, 283-292

    ACM Class: H.5.3

  40. arXiv:1204.2995  [pdf, other

    cs.SI cs.HC physics.soc-ph

    Analytic Methods for Optimizing Realtime Crowdsourcing

    Authors: Michael S. Bernstein, David R. Karger, Robert C. Miller, Joel Brandt

    Abstract: Realtime crowdsourcing research has demonstrated that it is possible to recruit paid crowds within seconds by managing a small, fast-reacting worker pool. Realtime crowds enable crowd-powered systems that respond at interactive speeds: for example, cameras, robots and instant opinion polls. So far, these techniques have mainly been proof-of-concept prototypes: research has not yet attempted to und… ▽ More

    Submitted 13 April, 2012; originally announced April 2012.

    Comments: Presented at Collective Intelligence conference, 2012

    Report number: CollectiveIntelligence/2012/12