I'm categorizing LLM as a productivity tool: Examining ethics of LLM use in HCI research practices

2403.19876

Published 4/1/2024 by Shivani Kapania, Ruiyi Wang, Toby Jia-Jun Li, Tianshi Li, Hong Shen

👁️

Abstract

Large language models are increasingly applied in real-world scenarios, including research and education. These models, however, come with well-known ethical issues, which may manifest in unexpected ways in human-computer interaction research due to the extensive engagement with human subjects. This paper reports on research practices related to LLM use, drawing on 16 semi-structured interviews and a survey conducted with 50 HCI researchers. We discuss the ways in which LLMs are already being utilized throughout the entire HCI research pipeline, from ideation to system development and paper writing. While researchers described nuanced understandings of ethical issues, they were rarely or only partially able to identify and address those ethical concerns in their own projects. This lack of action and reliance on workarounds was explained through the perceived lack of control and distributed responsibility in the LLM supply chain, the conditional nature of engaging with ethics, and competing priorities. Finally, we reflect on the implications of our findings and present opportunities to shape emerging norms of engaging with large language models in HCI research.

Get summaries of the top AI research delivered straight to your inbox:

Introduction

The paper discusses the increasing use of large language models (LLMs) in human-computer interaction (HCI) research and the associated ethical concerns. LLMs are being integrated throughout the research process, from ideation to data analysis and writing. Researchers perceive LLMs as enabling new possibilities for building tools, generating ideas, and simplifying workflows. However, the paper also highlights potential ethical issues, such as harmful outputs, privacy violations, intellectual integrity concerns, and overtrust in LLMs.

While HCI researchers acknowledge these ethical considerations, they often struggle to identify and address them effectively. Reasons include perceived lack of control over the LLM supply chain, lack of established best practices, and competing priorities taking precedence over ethical concerns.

The paper calls for engaging with institutional review boards (IRBs), redesigning informed consent processes, developing tools to interrupt the LLM supply chain, providing learning opportunities on the ethics of LLM use in HCI, and shifting academic incentives to prioritize ethical considerations in research.

Overall, the paper underscores the importance of foregrounding research ethics as LLMs become integrated into HCI research practices.

Related Work

The paper discusses the increasing use of large language models (LLMs) in human-computer interaction (HCI) research for various purposes, such as research ideation, data generation and analysis, system design and development, and more. It highlights the long-standing ethical considerations in HCI research, including responsible conduct in human subjects studies, privacy, informed consent, institutional review boards (IRBs), and mitigating biases. The paper acknowledges the ethical challenges brought by emerging technologies like AI, necessitating the development of new ethical guidelines and bridging the gap between ethics and AI practices.

The paper then discusses the ethical risks and harms associated with LLMs, such as discrimination and exclusion from biased training data, privacy violations, hallucinations leading to misinformation, malicious activities like scams and phishing, anthropomorphization leading to manipulation, and increasing inequality and job displacement. The authors note recent research efforts to assess and mitigate these harms, including privacy-preserving strategies, combating misinformation, prevention measures for misuse, and improving transparency and explainability.

However, the paper identifies a gap in understanding the unique ethical challenges faced by HCI researchers when integrating LLMs into their research projects. It aims to explore these challenges and examine how researchers manage them in practice.

Methods

The paper describes a mixed-method study to examine how HCI researchers apply large language models (LLMs) in their research workflows and their ethical considerations for using LLM-based tools. The study involved conducting a survey with 50 respondents to gather broad perspectives, followed by semi-structured interviews with 16 HCI researchers to investigate their approaches to ethical considerations in more detail.

The survey aimed to identify how HCI researchers use LLMs and any ethical challenges they encountered. It covered questions about their LLM usage, ethical considerations, and demographic information. The survey responses were analyzed using descriptive statistics and qualitative analysis of open-ended questions.

The interviews focused on LLM use across the research workflow, specific ethical considerations, the process of navigating ethical concerns, the role of IRBs and ethical frameworks, and incentives and accountability. Participants were recruited through various channels, and the interviews were conducted online. The qualitative data from the interviews were analyzed using reflexive thematic analysis.

The study employed a rigorous approach to data collection and analysis, involving multiple coders, iterative coding, and regular team discussions to define themes. The findings aimed to provide a holistic view of LLM use practices and ethical considerations among HCI researchers.

Findings

The paper discusses how HCI researchers are using large language models (LLMs) like ChatGPT, GPT-4 API, Bard, and Bing Chat in various stages of their research workflow, including ideation, literature review, study design, data analysis, system building, evaluation, and paper writing. Researchers perceive LLMs as enabling new possibilities in their work.

However, researchers express several ethical concerns regarding LLM use, such as potential harms from biased or harmful outputs, privacy violations from data leaks, ambiguity around intellectual integrity and authorship, overtrust and overreliance on LLMs, and environmental and societal impacts.

The paper outlines four main approaches researchers take to navigate these ethical concerns:

Conditional and reactive engagement, where they address ethics based on perceived risk levels or react after issues arise rather than proactively.
Limited disclosure practices, with researchers often not formally reporting LLM use to study participants, IRBs, or in publications, likening LLMs to productivity tools.
Restricting LLM use, carefully verifying outputs before use, and reflecting through group discussions to mitigate risks.
Delaying responsibility by expressing uncertainty over who is accountable for ethical issues and postponing addressing concerns to avoid hindering innovation.

Overall, while aware of potential ethical pitfalls, researchers currently lack clear guidelines and tend to employ ad-hoc strategies that do not directly address underlying ethical challenges.

Discussion

The paper discusses the implications of HCI researchers' approaches to using large language models (LLMs) in their research projects and provides opportunities to better engage with ethical considerations.

Key points:

Most researchers did not consider it necessary to report LLM usage to Institutional Review Boards (IRBs), which could lead to challenges in replicating studies and understanding methodological decisions.
Researchers acknowledged limited transparency to study participants regarding LLM usage, often rationalizing it as avoiding overwhelming them with technical details.
Researchers expressed a lack of control over the functionality and outputs of LLMs, as they are situated downstream in the "LLM supply chain."
The paper suggests proactively engaging with IRBs, re-examining informed consent processes, developing tools and methods to interrupt the LLM supply chain, creating learning opportunities on ethics of LLM use, and shifting academic incentives to foreground ethical concerns.
It calls for collaborative efforts between researchers, policymakers, and LLM companies to create "living guidelines" for responsible use of LLMs in research.
The paper advocates for a cultural shift within academia to demonstrate a commitment to ethical use of LLMs, potentially through changes in recognition and funding criteria.

Limitations and Future Work

The study explores emerging practices around ethics and large language models (LLMs) among human-computer interaction (HCI) researchers. However, it has certain limitations:

Sample constraints: The sample was limited due to the exploratory nature of the study and the snowball sampling method used. HCI research encompasses diverse traditions, many of which were not included, highlighting the need for more comprehensive and systematic future studies.
Geographical bias: The interview sample primarily consisted of researchers from the USA. Future research should explore ethical challenges and practices related to LLM usage by researchers from other regions.
Potential selection bias: The study may have attracted respondents who are conscious of their LLM usage and open to discussing their experiences, overlooking perspectives from researchers with different levels of awareness or willingness to discuss the topic.

The paper suggests that future research should explore how ethical practices with LLMs vary across different research methodologies, domains, settings (industry and academia), to gain a broader perspective on the subject.

Conclusion

The paper explores how human-computer interaction (HCI) researchers have integrated large language models (LLMs) into their research practices and the ethical concerns they have encountered. The authors conducted a survey and interviews to gather empirical data.

The results indicate that while HCI researchers are using LLMs across various stages of their research process and are aware of potential ethical issues, they often face challenges in effectively identifying and navigating those concerns within their own projects.

Based on these findings, the paper discusses potential approaches to support the formation of emerging ethical norms for using LLMs in HCI research. The authors encourage HCI researchers to engage with institutional review boards (IRBs), collaborate with policymakers and generative AI companies to create guidelines for responsible LLM use, and re-examine the informed consent process.

The paper also highlights the need for technological support to interrupt the LLM supply chain and the importance of creating learning opportunities for HCI researchers to understand the ethics of LLM use. Additionally, the authors suggest shifting academic incentives to prioritize ethical concerns.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

💬

Apprentices to Research Assistants: Advancing Research with Large Language Models

M. Namvarpour, A. Razi

Large Language Models (LLMs) have emerged as powerful tools in various research domains. This article examines their potential through a literature review and firsthand experimentation. While LLMs offer benefits like cost-effectiveness and efficiency, challenges such as prompt tuning, biases, and subjectivity must be addressed. The study presents insights from experiments utilizing LLMs for qualitative analysis, highlighting successes and limitations. Additionally, it discusses strategies for mitigating challenges, such as prompt optimization techniques and leveraging human expertise. This study aligns with the 'LLMs as Research Tools' workshop's focus on integrating LLMs into HCI data work critically and ethically. By addressing both opportunities and challenges, our work contributes to the ongoing dialogue on their responsible application in research.

4/10/2024

cs.HC cs.AI cs.LG

🤖

AI Act and Large Language Models (LLMs): When critical issues and privacy impact require human and ethical oversight

Nicola Fabiano

The imposing evolution of artificial intelligence systems and, specifically, of Large Language Models (LLM) makes it necessary to carry out assessments of their level of risk and the impact they may have in the area of privacy, personal data protection and at an ethical level, especially on the weakest and most vulnerable. This contribution addresses human oversight, ethical oversight, and privacy impact assessment.

4/3/2024

cs.CY cs.AI cs.CL

💬

Modeling Emotions and Ethics with Large Language Models

Edward Y. Chang

This paper explores the integration of human-like emotions and ethical considerations into Large Language Models (LLMs). We first model eight fundamental human emotions, presented as opposing pairs, and employ collaborative LLMs to reinterpret and express these emotions across a spectrum of intensity. Our focus extends to embedding a latent ethical dimension within LLMs, guided by a novel self-supervised learning algorithm with human feedback (SSHF). This approach enables LLMs to perform self-evaluations and adjustments concerning ethical guidelines, enhancing their capability to generate content that is not only emotionally resonant but also ethically aligned. The methodologies and case studies presented herein illustrate the potential of LLMs to transcend mere text and image generation, venturing into the realms of empathetic interaction and principled decision-making, thereby setting a new precedent in the development of emotionally aware and ethically conscious AI systems.

4/23/2024

cs.CL cs.AI

Large Language Models for Education: A Survey and Outlook

Shen Wang, Tianlong Xu, Hang Li, Chaoli Zhang, Joleen Liang, Jiliang Tang, Philip S. Yu, Qingsong Wen

The advent of Large Language Models (LLMs) has brought in a new era of possibilities in the realm of education. This survey paper summarizes the various technologies of LLMs in educational settings from multifaceted perspectives, encompassing student and teacher assistance, adaptive learning, and commercial tools. We systematically review the technological advancements in each perspective, organize related datasets and benchmarks, and identify the risks and challenges associated with deploying LLMs in education. Furthermore, we outline future research opportunities, highlighting the potential promising directions. Our survey aims to provide a comprehensive technological picture for educators, researchers, and policymakers to harness the power of LLMs to revolutionize educational practices and foster a more effective personalized learning environment.

4/3/2024

cs.CL cs.AI