research-article
Free Access
Abstract
Errors are a natural part of predictive algorithms, but may discourage users from relying on algorithms. We conduct two experiments to demonstrate that reliance on a predictive algorithm following a substantial error is affected by (i) when the error occurs and (ii) how the algorithm is used in the decision-making process. We find that the impact of an error on reliance depends on whether the error occurs early (i.e., when users first start using the algorithm) or late (i.e., after users have used the algorithm for an extended period). While an early error results in substantial and persistent reliance reduction, a late error affects reliance only temporarily and to a lesser extent. However, when users have more control over how to use the algorithm’s predictions, error timing ceases to have a significant impact. Our work advances the understanding of algorithm aversion and informs the practical design of algorithmic decision-making systems.
- Ahmed Abbasi, Hsinchun Chen, and Arab Salem. 2008. Sentiment analysis in multiple languages: Feature selection for opinion classification in web forums. ACM Transactions on Information Systems (TOIS) 26, 3 (2008), 1–34.Google Scholar
Digital Library
- Michael D Abràmoff, Philip T Lavin, Michele Birch, Nilay Shah, and James C Folk. 2018. Pivotal trial of an autonomous AI-based diagnostic system for detection of diabetic retinopathy in primary care offices. NPJ digital medicine 1, 1 (2018), 1–8.Google Scholar
- Ritu Agarwal and Jayesh Prasad. 1998. A conceptual and operational definition of personal innovativeness in the domain of information technology. Information systems research 9, 2 (1998), 204–215.Google Scholar
- Ashish Arora, Jonathan P Caulkins, and Rahul Telang. 2006. Research note: Sell first, fix later: Impact of patching on software quality. Management Science 52, 3 (2006), 465–471.Google Scholar
Digital Library
- Solomon E Asch. 1946. Forming impressions of personality.The Journal of Abnormal and Social Psychology 41, 3(1946), 258.Google Scholar
- Timothy W Bickmore and Rosalind W Picard. 2005. Establishing and maintaining long-term human-computer relationships. ACM Transactions on Computer-Human Interaction (TOCHI) 12, 2 (2005), 293–327.Google Scholar
Digital Library
- John Bohannon. 2016. Mechanical Turk upends social sciences. (2016).Google Scholar
- Noah Castelo, Maarten W Bos, and Donald R Lehmann. 2019. Task-dependent algorithm aversion. Journal of Marketing Research 56, 5 (2019), 809–825.Google Scholar
Cross Ref
- Joseph Cazier, Benjamin Shao, and Robert St Louis. 2017. Value congruence, trust, and their effects on purchase intention and reservation price. ACM Transactions on Management Information Systems (TMIS) 8, 4(2017), 1–28.Google Scholar
Digital Library
- Bogeum Choi, Austin Ward, Yuan Li, Jaime Arguello, and Robert Capra. 2019. The effects of task complexity on the use of different types of information in a search assistance tool. ACM Transactions on Information Systems (TOIS) 38, 1 (2019), 1–28.Google Scholar
Digital Library
- Robyn M Dawes. 1971. A case study of graduate admissions: Application of three principles of human decision making.American psychologist 26, 2 (1971), 180.Google Scholar
- Robyn M Dawes. 1979. The robust beauty of improper linear models in decision making.American psychologist 34, 7 (1979), 571.Google Scholar
- Maria De-Arteaga, Riccardo Fogliato, and Alexandra Chouldechova. 2020. A Case for Humans-in-the-Loop: Decisions in the Presence of Erroneous Algorithmic Scores. arXiv preprint arXiv:2002.08035(2020).Google Scholar
- Berkeley J Dietvorst and Soaham Bharti. 2020. People reject algorithms in uncertain decision domains because they have diminishing sensitivity to forecasting error. Psychological science 31, 10 (2020), 1302–1314.Google Scholar
- Berkeley J Dietvorst, Joseph P Simmons, and Cade Massey. 2015. Algorithm aversion: People erroneously avoid algorithms after seeing them err.Journal of Experimental Psychology: General 144, 1(2015), 114.Google Scholar
- Berkeley J Dietvorst, Joseph P Simmons, and Cade Massey. 2018. Overcoming algorithm aversion: People will use imperfect algorithms if they can (even slightly) modify them. Management Science 64, 3 (2018), 1155–1170.Google Scholar
Digital Library
- Robert Fildes and Paul Goodwin. 2007. Against your better judgment? How organizations can improve their use of management judgment in forecasting. Interfaces 37, 6 (2007), 570–576.Google Scholar
Digital Library
- Ben Green and Yiling Chen. 2019. The principles and limits of algorithm-in-the-loop decision making. Proceedings of the ACM on Human-Computer Interaction 3, CSCW(2019), 1–24.Google Scholar
Digital Library
- William M Grove, David H Zald, Boyd S Lebow, Beth E Snitz, and Chad Nelson. 2000. Clinical versus mechanical prediction: a meta-analysis.Psychological assessment 12, 1 (2000), 19.Google Scholar
- Junius Gunaratne, Lior Zalmanson, and Oded Nov. 2018. The persuasive power of algorithmic and crowdsourced advice. Journal of Management Information Systems 35, 4 (2018), 1092–1120.Google Scholar
Cross Ref
- Kotaro Hara, Abigail Adams, Kristy Milland, Saiph Savage, Chris Callison-Burch, and Jeffrey P Bigham. 2018. A data-driven analysis of workers’ earnings on Amazon Mechanical Turk. In Proceedings of the 2018 CHI conference on human factors in computing systems. 1–14.Google Scholar
Digital Library
- Michael P Haselhuhn, Maurice E Schweitzer, and Alison M Wood. 2010. How implicit beliefs influence trust recovery. Psychological Science 21, 5 (2010), 645–648.Google Scholar
Cross Ref
- Kevin Anthony Hoff and Masooda Bashir. 2015. Trust in automation: Integrating empirical evidence on factors that influence trust. Human factors 57, 3 (2015), 407–434.Google Scholar
Cross Ref
- Panagiotis G Ipeirotis. 2010. Analyzing the amazon mechanical turk marketplace. XRDS: Crossroads, The ACM magazine for students 17, 2(2010), 16–21.Google Scholar
- Leanna Ireland. 2019. Who errs? Algorithm aversion, the source of judicial error, and public support for self-help behaviors. Journal of Crime and Justice(2019), 1–19.Google Scholar
- Ryan Kennedy, Philip Waggoner, and Matthew Ward. 2018. Trust in Public Policy Algorithms. Available at SSRN 3339475(2018).Google Scholar
- Taemie Kim and Pamela Hinds. 2006. Who should I blame? Effects of autonomy and transparency on attributions in human-robot interaction. In ROMAN 2006-The 15th IEEE International Symposium on Robot and Human Interactive Communication. IEEE, 80–85.Google Scholar
Cross Ref
- Daniël Lakens, Anne M Scheel, and Peder M Isager. 2018. Equivalence testing for psychological research: A tutorial. Advances in Methods and Practices in Psychological Science 1, 2(2018), 259–269.Google Scholar
Cross Ref
- Hyung Koo Lee, Jong Seok Lee, and Mark Keil. 2018. Using perspective-taking to de-escalate launch date commitment for products with known software defects. Journal of Management Information Systems 35, 4 (2018), 1251–1276.Google Scholar
Cross Ref
- John Lee and Neville Moray. 1992. Trust, control strategies and allocation of function in human-machine systems. Ergonomics 35, 10 (1992), 1243–1270.Google Scholar
Cross Ref
- John D Lee and Neville Moray. 1994. Trust, self-confidence, and operators’ adaptation to automation. International journal of human-computer studies 40, 1 (1994), 153–184.Google Scholar
- John D Lee and Katrina A See. 2004. Trust in automation: Designing for appropriate reliance. Human factors 46, 1 (2004), 50–80.Google Scholar
- Qing Li, Yuanzhu Chen, Li Ling Jiang, Ping Li, and Hsinchun Chen. 2016. A tensor-based information framework for predicting the stock market. ACM Transactions on Information Systems (TOIS) 34, 2 (2016), 1–30.Google Scholar
Digital Library
- Kai H Lim, Izak Benbasat, and Lawrence M Ward. 2000. The role of multimedia in changing first impression bias. Information Systems Research 11, 2 (2000), 115–136.Google Scholar
Digital Library
- Yu-Kai Lin, Hsinchun Chen, Randall A Brown, Shu-Hsing Li, and Hung-Jen Yang. 2017. Healthcare predictive analytics for risk profiling in chronic care: A Bayesian multitask learning approach.MIS Quarterly 41, 2 (2017).Google Scholar
- Jennifer M Logg, Julia A Minson, and Don A Moore. 2019. Algorithm appreciation: People prefer algorithmic to human judgment. Organizational Behavior and Human Decision Processes 151 (2019), 90–103.Google Scholar
Cross Ref
- Chiara Longoni, Andrea Bonezzi, and Carey K Morewedge. 2019. Resistance to medical artificial intelligence. Journal of Consumer Research 46, 4 (2019), 629–650.Google Scholar
Cross Ref
- Robert B Lount, Chen-Bo Zhong, Niro Sivanathan, and J Keith Murnighan. 2008. Getting off on the wrong foot: The timing of a breach and the restoration of trust. Personality and Social Psychology Bulletin 34, 12 (2008), 1601–1612.Google Scholar
Cross Ref
- Xueming Luo, Siliang Tong, Zheng Fang, and Zhe Qu. 2019. Frontiers: Machines vs. humans: The impact of artificial intelligence chatbot disclosure on customer purchases. Marketing Science 38, 6 (2019), 937–947.Google Scholar
Digital Library
- Dietrich Manzey, Juliane Reichenbach, and Linda Onnasch. 2012. Human performance consequences of automated decision aids: The impact of degree of automation and system experience. Journal of Cognitive Engineering and Decision Making 6, 1 (2012), 57–87.Google Scholar
Cross Ref
- Scott Mayer McKinney, Marcin Sieniek, Varun Godbole, Jonathan Godwin, Natasha Antropova, Hutan Ashrafian, Trevor Back, Mary Chesus, Greg C Corrado, Ara Darzi, et al. 2020. International evaluation of an AI system for breast cancer screening. Nature 577, 7788 (2020), 89–94.Google Scholar
- D Harrison Mcknight, Michelle Carter, Jason Bennett Thatcher, and Paul F Clay. 2011. Trust in a specific technology: An investigation of its components and measures. ACM Transactions on management information systems (TMIS) 2, 2(2011), 1–25.Google Scholar
- Mahsan Nourani, Donald R Honeycutt, Jeremy E Block, Chiradeep Roy, Tahrima Rahman, Eric D Ragan, and Vibhav Gogate. 2020. Investigating the Importance of First Impressions and Explainable AI with Interactive Video Analysis. In Extended Abstracts of the 2020 CHI Conference on Human Factors in Computing Systems. 1–8.Google Scholar
- Mahsan Nourani, Joanie King, and Eric Ragan. 2020. The role of domain expertise in user trust and the impact of first impressions with intelligent systems. In Proceedings of the AAAI Conference on Human Computation and Crowdsourcing, Vol. 8. 112–121.Google Scholar
Cross Ref
- Paul A Pavlou and Angelika Dimoka. 2006. The nature and role of feedback text comments in online marketplaces: Implications for trust building, price premiums, and seller differentiation. Information Systems Research 17, 4 (2006), 392–414.Google Scholar
Digital Library
- Andrew Prahl and Lyn Van Swol. 2017. Understanding algorithm aversion: When is advice from automation discounted?Journal of Forecasting 36, 6 (2017), 691–702.Google Scholar
- Marco Tulio Ribeiro, Sameer Singh, and Carlos Guestrin. 2016. ” Why should i trust you?” Explaining the predictions of any classifier. In Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining. 1135–1144.Google Scholar
Digital Library
- Richard M Ryan and Edward L Deci. 2000. Self-determination theory and the facilitation of intrinsic motivation, social development, and well-being.American psychologist 55, 1 (2000), 68.Google Scholar
- Julian Sanchez. 2006. Factors that affect trust and reliance on an automated aid. Georgia Institute of Technology.Google Scholar
- Nada R Sanders and Karl B Manrodt. 2003. The efficacy of using judgmental versus quantitative forecasting methods in practice. Omega 31, 6 (2003), 511–522.Google Scholar
Cross Ref
- James Schaffer, John O’Donovan, James Michaelis, Adrienne Raglin, and Tobias Höllerer. 2019. I can do better than your AI: expertise and explanations. In Proceedings of the 24th International Conference on Intelligent User Interfaces. 240–251.Google Scholar
Digital Library
- Oliver Schilke, Martin Reimann, and Karen S Cook. 2013. Effect of relationship experience on trust recovery following a breach. Proceedings of the National Academy of Sciences 110, 38(2013), 15236–15241.Google Scholar
Cross Ref
- Robert P Schumaker and Hsinchun Chen. 2009. Textual analysis of stock market prediction using breaking financial news: The AZFin text system. ACM Transactions on Information Systems (TOIS) 27, 2 (2009), 1–19.Google Scholar
Digital Library
- Maurice E Schweitzer and Gérard P Cachon. 2000. Decision bias in the newsvendor problem with a known demand distribution: Experimental evidence. Management Science 46, 3 (2000), 404–420.Google Scholar
Cross Ref
- Donghui Shi, Jian Guan, Jozef Zurada, and Andrew Manikas. 2017. A data-mining approach to identification of risk factors in safety management systems. Journal of management information systems 34, 4 (2017), 1054–1081.Google Scholar
Cross Ref
- John J Skowronski and Donal E Carlston. 1989. Negativity and extremity biases in impression formation: A review of explanations.Psychological bulletin 105, 1 (1989), 131.Google Scholar
- Scott I Vrieze and William M Grove. 2009. Survey on the use of clinical and mechanical prediction methods in clinical psychology.Professional Psychology: Research and Practice 40, 5(2009), 525.Google Scholar
- Weiquan Wang and Izak Benbasat. 2008. Attributions of trust in decision support technologies: A study of recommendation agents for e-commerce. Journal of Management Information Systems 24, 4 (2008), 249–273.Google Scholar
Digital Library
- Weiquan Wang and Izak Benbasat. 2016. Empirical assessment of alternative designs for enhancing different types of trusting beliefs in online recommendation agents. Journal of Management Information Systems 33, 3 (2016), 744–775.Google Scholar
- Jenna Wiens and Erica S Shenoy. 2018. Machine learning for healthcare: on the verge of a major shift in healthcare epidemiology. Clinical Infectious Diseases 66, 1 (2018), 149–153.Google Scholar
Cross Ref
- Michael Yeomans, Anuj Shah, Sendhil Mullainathan, and Jon Kleinberg. 2019. Making sense of recommendations. Journal of Behavioral Decision Making 32, 4 (2019), 403–414.Google Scholar
Cross Ref
- Kun Yu, Shlomo Berkovsky, Dan Conway, Ronnie Taib, Jianlong Zhou, and Fang Chen. 2016. Trust and reliance based on system accuracy. In Proceedings of the 2016 Conference on User Modeling Adaptation and Personalization. 223–227.Google Scholar
Digital Library
- Kun Yu, Shlomo Berkovsky, Ronnie Taib, Dan Conway, Jianlong Zhou, and Fang Chen. 2017. User trust dynamics: An investigation driven by differences in system performance. In Proceedings of the 22nd International Conference on Intelligent User Interfaces. 307–317.Google Scholar
Digital Library
Index Terms
-
When Algorithms Err: Differential Impact of Early vs. Late Errors on Users’ Reliance on Algorithms
-