Differential Impact of Early vs. Late Errors on Users’ Reliance on Algorithms by rntn

Share This Article

Sed ut perspiciatis unde.

research-article

Free Access

Abstract

Errors are a natural part of predictive algorithms, but may discourage users from relying on algorithms. We conduct two experiments to demonstrate that reliance on a predictive algorithm following a substantial error is affected by (i) when the error occurs and (ii) how the algorithm is used in the decision-making process. We find that the impact of an error on reliance depends on whether the error occurs early (i.e., when users first start using the algorithm) or late (i.e., after users have used the algorithm for an extended period). While an early error results in substantial and persistent reliance reduction, a late error affects reliance only temporarily and to a lesser extent. However, when users have more control over how to use the algorithm’s predictions, error timing ceases to have a significant impact. Our work advances the understanding of algorithm aversion and informs the practical design of algorithmic decision-making systems.

Ahmed Abbasi, Hsinchun Chen, and Arab Salem. 2008. Sentiment analysis in multiple languages: Feature selection for opinion classification in web forums. ACM Transactions on Information Systems (TOIS) 26, 3 (2008), 1–34.Google ScholarDigital Library
Michael D Abràmoff, Philip T Lavin, Michele Birch, Nilay Shah, and James C Folk. 2018. Pivotal trial of an autonomous AI-based diagnostic system for detection of diabetic retinopathy in primary care offices. NPJ digital medicine 1, 1 (2018), 1–8.Google Scholar
Ritu Agarwal and Jayesh Prasad. 1998. A conceptual and operational definition of personal innovativeness in the domain of information technology. Information systems research 9, 2 (1998), 204–215.Google Scholar
Ashish Arora, Jonathan P Caulkins, and Rahul Telang. 2006. Research note: Sell first, fix later: Impact of patching on software quality. Management Science 52, 3 (2006), 465–471.Google ScholarDigital Library
Solomon E Asch. 1946. Forming impressions of personality.The Journal of Abnormal and Social Psychology 41, 3(1946), 258.Google Scholar
Timothy W Bickmore and Rosalind W Picard. 2005. Establishing and maintaining long-term human-computer relationships. ACM Transactions on Computer-Human Interaction (TOCHI) 12, 2 (2005), 293–327.Google ScholarDigital Library
John Bohannon. 2016. Mechanical Turk upends social sciences. (2016).Google Scholar
Noah Castelo, Maarten W Bos, and Donald R Lehmann. 2019. Task-dependent algorithm aversion. Journal of Marketing Research 56, 5 (2019), 809–825.Google ScholarCross Ref
Joseph Cazier, Benjamin Shao, and Robert St Louis. 2017. Value congruence, trust, and their effects on purchase intention and reservation price. ACM Transactions on Management Information Systems (TMIS) 8, 4(2017), 1–28.Google ScholarDigital Library
Bogeum Choi, Austin Ward, Yuan Li, Jaime Arguello, and Robert Capra. 2019. The effects of task complexity on the use of different types of information in a search assistance tool. ACM Transactions on Information Systems (TOIS) 38, 1 (2019), 1–28.Google ScholarDigital Library
Robyn M Dawes. 1971. A case study of graduate admissions: Application of three principles of human decision making.American psychologist 26, 2 (1971), 180.Google Scholar
Robyn M Dawes. 1979. The robust beauty of improper linear models in decision making.American psychologist 34, 7 (1979), 571.Google Scholar
Maria De-Arteaga, Riccardo Fogliato, and Alexandra Chouldechova. 2020. A Case for Humans-in-the-Loop: Decisions in the Presence of Erroneous Algorithmic Scores. arXiv preprint arXiv:2002.08035(2020).Google Scholar
Berkeley J Dietvorst and Soaham Bharti. 2020. People reject algorithms in uncertain decision domains because they have diminishing sensitivity to forecasting error. Psychological science 31, 10 (2020), 1302–1314.Google Scholar
Berkeley J Dietvorst, Joseph P Simmons, and Cade Massey. 2015. Algorithm aversion: People erroneously avoid algorithms after seeing them err.Journal of Experimental Psychology: General 144, 1(2015), 114.Google Scholar
Berkeley J Dietvorst, Joseph P Simmons, and Cade Massey. 2018. Overcoming algorithm aversion: People will use imperfect algorithms if they can (even slightly) modify them. Management Science 64, 3 (2018), 1155–1170.Google ScholarDigital Library
Robert Fildes and Paul Goodwin. 2007. Against your better judgment? How organizations can improve their use of management judgment in forecasting. Interfaces 37, 6 (2007), 570–576.Google ScholarDigital Library
Ben Green and Yiling Chen. 2019. The principles and limits of algorithm-in-the-loop decision making. Proceedings of the ACM on Human-Computer Interaction 3, CSCW(2019), 1–24.Google ScholarDigital Library
William M Grove, David H Zald, Boyd S Lebow, Beth E Snitz, and Chad Nelson. 2000. Clinical versus mechanical prediction: a meta-analysis.Psychological assessment 12, 1 (2000), 19.Google Scholar
Junius Gunaratne, Lior Zalmanson, and Oded Nov. 2018. The persuasive power of algorithmic and crowdsourced advice. Journal of Management Information Systems 35, 4 (2018), 1092–1120.Google ScholarCross Ref
Kotaro Hara, Abigail Adams, Kristy Milland, Saiph Savage, Chris Callison-Burch, and Jeffrey P Bigham. 2018. A data-driven analysis of workers’ earnings on Amazon Mechanical Turk. In Proceedings of the 2018 CHI conference on human factors in computing systems. 1–14.Google ScholarDigital Library
Michael P Haselhuhn, Maurice E Schweitzer, and Alison M Wood. 2010. How implicit beliefs influence trust recovery. Psychological Science 21, 5 (2010), 645–648.Google ScholarCross Ref
Kevin Anthony Hoff and Masooda Bashir. 2015. Trust in automation: Integrating empirical evidence on factors that influence trust. Human factors 57, 3 (2015), 407–434.Google ScholarCross Ref
Panagiotis G Ipeirotis. 2010. Analyzing the amazon mechanical turk marketplace. XRDS: Crossroads, The ACM magazine for students 17, 2(2010), 16–21.Google Scholar
Leanna Ireland. 2019. Who errs? Algorithm aversion, the source of judicial error, and public support for self-help behaviors. Journal of Crime and Justice(2019), 1–19.Google Scholar
Ryan Kennedy, Philip Waggoner, and Matthew Ward. 2018. Trust in Public Policy Algorithms. Available at SSRN 3339475(2018).Google Scholar
Taemie Kim and Pamela Hinds. 2006. Who should I blame? Effects of autonomy and transparency on attributions in human-robot interaction. In ROMAN 2006-The 15th IEEE International Symposium on Robot and Human Interactive Communication. IEEE, 80–85.Google ScholarCross Ref
Daniël Lakens, Anne M Scheel, and Peder M Isager. 2018. Equivalence testing for psychological research: A tutorial. Advances in Methods and Practices in Psychological Science 1, 2(2018), 259–269.Google ScholarCross Ref
Hyung Koo Lee, Jong Seok Lee, and Mark Keil. 2018. Using perspective-taking to de-escalate launch date commitment for products with known software defects. Journal of Management Information Systems 35, 4 (2018), 1251–1276.Google ScholarCross Ref
John Lee and Neville Moray. 1992. Trust, control strategies and allocation of function in human-machine systems. Ergonomics 35, 10 (1992), 1243–1270.Google ScholarCross Ref
John D Lee and Neville Moray. 1994. Trust, self-confidence, and operators’ adaptation to automation. International journal of human-computer studies 40, 1 (1994), 153–184.Google Scholar
John D Lee and Katrina A See. 2004. Trust in automation: Designing for appropriate reliance. Human factors 46, 1 (2004), 50–80.Google Scholar
Qing Li, Yuanzhu Chen, Li Ling Jiang, Ping Li, and Hsinchun Chen. 2016. A tensor-based information framework for predicting the stock market. ACM Transactions on Information Systems (TOIS) 34, 2 (2016), 1–30.Google ScholarDigital Library
Kai H Lim, Izak Benbasat, and Lawrence M Ward. 2000. The role of multimedia in changing first impression bias. Information Systems Research 11, 2 (2000), 115–136.Google ScholarDigital Library
Yu-Kai Lin, Hsinchun Chen, Randall A Brown, Shu-Hsing Li, and Hung-Jen Yang. 2017. Healthcare predictive analytics for risk profiling in chronic care: A Bayesian multitask learning approach.MIS Quarterly 41, 2 (2017).Google Scholar
Jennifer M Logg, Julia A Minson, and Don A Moore. 2019. Algorithm appreciation: People prefer algorithmic to human judgment. Organizational Behavior and Human Decision Processes 151 (2019), 90–103.Google ScholarCross Ref
Chiara Longoni, Andrea Bonezzi, and Carey K Morewedge. 2019. Resistance to medical artificial intelligence. Journal of Consumer Research 46, 4 (2019), 629–650.Google ScholarCross Ref
Robert B Lount, Chen-Bo Zhong, Niro Sivanathan, and J Keith Murnighan. 2008. Getting off on the wrong foot: The timing of a breach and the restoration of trust. Personality and Social Psychology Bulletin 34, 12 (2008), 1601–1612.Google ScholarCross Ref
Xueming Luo, Siliang Tong, Zheng Fang, and Zhe Qu. 2019. Frontiers: Machines vs. humans: The impact of artificial intelligence chatbot disclosure on customer purchases. Marketing Science 38, 6 (2019), 937–947.Google ScholarDigital Library
Dietrich Manzey, Juliane Reichenbach, and Linda Onnasch. 2012. Human performance consequences of automated decision aids: The impact of degree of automation and system experience. Journal of Cognitive Engineering and Decision Making 6, 1 (2012), 57–87.Google ScholarCross Ref
Scott Mayer McKinney, Marcin Sieniek, Varun Godbole, Jonathan Godwin, Natasha Antropova, Hutan Ashrafian, Trevor Back, Mary Chesus, Greg C Corrado, Ara Darzi, et al. 2020. International evaluation of an AI system for breast cancer screening. Nature 577, 7788 (2020), 89–94.Google Scholar
D Harrison Mcknight, Michelle Carter, Jason Bennett Thatcher, and Paul F Clay. 2011. Trust in a specific technology: An investigation of its components and measures. ACM Transactions on management information systems (TMIS) 2, 2(2011), 1–25.Google Scholar
Mahsan Nourani, Donald R Honeycutt, Jeremy E Block, Chiradeep Roy, Tahrima Rahman, Eric D Ragan, and Vibhav Gogate. 2020. Investigating the Importance of First Impressions and Explainable AI with Interactive Video Analysis. In Extended Abstracts of the 2020 CHI Conference on Human Factors in Computing Systems. 1–8.Google Scholar
Mahsan Nourani, Joanie King, and Eric Ragan. 2020. The role of domain expertise in user trust and the impact of first impressions with intelligent systems. In Proceedings of the AAAI Conference on Human Computation and Crowdsourcing, Vol. 8. 112–121.Google ScholarCross Ref
Paul A Pavlou and Angelika Dimoka. 2006. The nature and role of feedback text comments in online marketplaces: Implications for trust building, price premiums, and seller differentiation. Information Systems Research 17, 4 (2006), 392–414.Google ScholarDigital Library
Andrew Prahl and Lyn Van Swol. 2017. Understanding algorithm aversion: When is advice from automation discounted?Journal of Forecasting 36, 6 (2017), 691–702.Google Scholar
Marco Tulio Ribeiro, Sameer Singh, and Carlos Guestrin. 2016. ” Why should i trust you?” Explaining the predictions of any classifier. In Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining. 1135–1144.Google ScholarDigital Library
Richard M Ryan and Edward L Deci. 2000. Self-determination theory and the facilitation of intrinsic motivation, social development, and well-being.American psychologist 55, 1 (2000), 68.Google Scholar
Julian Sanchez. 2006. Factors that affect trust and reliance on an automated aid. Georgia Institute of Technology.Google Scholar
Nada R Sanders and Karl B Manrodt. 2003. The efficacy of using judgmental versus quantitative forecasting methods in practice. Omega 31, 6 (2003), 511–522.Google ScholarCross Ref
James Schaffer, John O’Donovan, James Michaelis, Adrienne Raglin, and Tobias Höllerer. 2019. I can do better than your AI: expertise and explanations. In Proceedings of the 24th International Conference on Intelligent User Interfaces. 240–251.Google ScholarDigital Library
Oliver Schilke, Martin Reimann, and Karen S Cook. 2013. Effect of relationship experience on trust recovery following a breach. Proceedings of the National Academy of Sciences 110, 38(2013), 15236–15241.Google ScholarCross Ref
Robert P Schumaker and Hsinchun Chen. 2009. Textual analysis of stock market prediction using breaking financial news: The AZFin text system. ACM Transactions on Information Systems (TOIS) 27, 2 (2009), 1–19.Google ScholarDigital Library
Maurice E Schweitzer and Gérard P Cachon. 2000. Decision bias in the newsvendor problem with a known demand distribution: Experimental evidence. Management Science 46, 3 (2000), 404–420.Google ScholarCross Ref
Donghui Shi, Jian Guan, Jozef Zurada, and Andrew Manikas. 2017. A data-mining approach to identification of risk factors in safety management systems. Journal of management information systems 34, 4 (2017), 1054–1081.Google ScholarCross Ref
John J Skowronski and Donal E Carlston. 1989. Negativity and extremity biases in impression formation: A review of explanations.Psychological bulletin 105, 1 (1989), 131.Google Scholar
Scott I Vrieze and William M Grove. 2009. Survey on the use of clinical and mechanical prediction methods in clinical psychology.Professional Psychology: Research and Practice 40, 5(2009), 525.Google Scholar
Weiquan Wang and Izak Benbasat. 2008. Attributions of trust in decision support technologies: A study of recommendation agents for e-commerce. Journal of Management Information Systems 24, 4 (2008), 249–273.Google ScholarDigital Library
Weiquan Wang and Izak Benbasat. 2016. Empirical assessment of alternative designs for enhancing different types of trusting beliefs in online recommendation agents. Journal of Management Information Systems 33, 3 (2016), 744–775.Google Scholar
Jenna Wiens and Erica S Shenoy. 2018. Machine learning for healthcare: on the verge of a major shift in healthcare epidemiology. Clinical Infectious Diseases 66, 1 (2018), 149–153.Google ScholarCross Ref
Michael Yeomans, Anuj Shah, Sendhil Mullainathan, and Jon Kleinberg. 2019. Making sense of recommendations. Journal of Behavioral Decision Making 32, 4 (2019), 403–414.Google ScholarCross Ref
Kun Yu, Shlomo Berkovsky, Dan Conway, Ronnie Taib, Jianlong Zhou, and Fang Chen. 2016. Trust and reliance based on system accuracy. In Proceedings of the 2016 Conference on User Modeling Adaptation and Personalization. 223–227.Google ScholarDigital Library
Kun Yu, Shlomo Berkovsky, Ronnie Taib, Dan Conway, Jianlong Zhou, and Fang Chen. 2017. User trust dynamics: An investigation driven by differences in system performance. In Proceedings of the 22nd International Conference on Intelligent User Interfaces. 307–317.Google ScholarDigital Library

Index Terms

When Algorithms Err: Differential Impact of Early vs. Late Errors on Users’ Reliance on Algorithms

Differential Impact of Early vs. Late Errors on Users’ Reliance on Algorithms by rntn

Differential Impact of Early vs. Late Errors on Users’ Reliance on Algorithms by rntn

Share This Article

Newsletter

Abstract

Index Terms

When Algorithms Err: Differential Impact of Early vs. Late Errors on Users’ Reliance on Algorithms

HackTech

Leave a comment Cancel reply

Editor's Choice

Differential Impact of Early vs. Late Errors on Users’ Reliance on Algorithms by rntn

Differential Impact of Early vs. Late Errors on Users’ Reliance on Algorithms by rntn

Share This Article

Newsletter

Abstract

Index Terms

When Algorithms Err: Differential Impact of Early vs. Late Errors on Users’ Reliance on Algorithms

HackTech

Leave a comment Cancel reply

Editor's Choice

Sign Up to Our Newsletter