On the Creativity of Large Language Models

    Read original: arXiv:2304.00008 - Published 9/19/2024 by Giorgio Franceschelli, Mirco Musolesi
    Total Score

    1

    💬

    Sign in to get full access

    or

    If you already have an account, we'll log you in

    Overview

    • Large Language Models (LLMs) are transforming various areas of Artificial Intelligence, including creative writing.
    • The paper explores whether LLMs can be considered truly creative, analyzing this through the lens of creativity theories.
    • The discussion focuses on the dimensions of value, novelty, and surprise in LLM-generated outputs.
    • The authors examine different perspectives on creativity, including product, process, press, and person.
    • The paper identifies "easy" and "hard" problems in machine creativity and discusses the societal impact of these technologies.

    Plain English Explanation

    In the world of Artificial Intelligence, Large Language Models (LLMs) are revolutionizing various fields, including the realm of creative writing. These advanced language models can generate poetry, stories, and other creative content that often surprises us with their quality. However, this raises a fundamental question: can these AI systems truly be considered creative?

    The authors of this paper delve into this intriguing question, exploring the development of LLMs through the lens of creativity theories. They focus their discussion on the key dimensions of value, novelty, and surprise, as proposed by the renowned creativity researcher, Margaret Boden.

    The paper examines creativity from different perspectives, including the product, process, press, and person. This multi-faceted approach helps the authors identify both "easy" and "hard" problems in the realm of machine creativity, shedding light on the capabilities and limitations of LLMs.

    Furthermore, the paper explores the societal impact of these revolutionary technologies, particularly in the context of the creative industries. It analyzes the opportunities, challenges, and potential risks associated with the use of LLMs in creative domains, considering both legal and ethical perspectives.

    By delving into this fascinating topic, the authors aim to provide a comprehensive understanding of the current state of machine creativity and the broader implications of LLMs in shaping our creative landscape.

    Technical Explanation

    The paper begins by acknowledging the remarkable advancements in Large Language Models (LLMs) and their impact on various areas of Artificial Intelligence, including the field of creative writing. The authors then pose a fundamental question: can these AI systems truly be considered creative?

    To explore this question, the paper analyzes the development of LLMs through the lens of creativity theories. The authors focus their discussion on the dimensions of value, novelty, and surprise, as proposed by the renowned creativity researcher, Margaret Boden.

    The paper then explores different classic perspectives on creativity, including the product, process, press, and person. This multifaceted approach allows the authors to identify both "easy" and "hard" problems in the realm of machine creativity, shedding light on the capabilities and limitations of LLMs.

    Finally, the paper examines the societal impact of these revolutionary technologies, particularly in the context of the creative industries. It analyzes the opportunities, challenges, and potential risks associated with the use of LLMs in creative domains, considering both legal and ethical perspectives.

    Critical Analysis

    The paper raises important questions about the nature of creativity and the role of LLMs in this domain. While the generated outputs from these AI systems can be of impressive quality, the authors rightly acknowledge that the "creativity" of LLMs remains an open and complex question.

    One potential limitation of the paper is that it does not delve deeply into the specific architectures, training processes, or underlying mechanisms of LLMs. A more technical exploration of these aspects could provide additional insights into the capabilities and limitations of these models in the context of creativity.

    Furthermore, the paper could have benefited from a more extensive discussion of the ongoing debates and controversies surrounding the use of LLMs in creative industries. The potential risks, such as issues of authorship, copyright, and the displacement of human creative workers, deserve a more in-depth analysis.

    Despite these minor caveats, the paper provides a thoughtful and comprehensive examination of the intersection between LLMs and creativity. It encourages readers to think critically about the nature of machine-generated creative outputs and the broader implications of these technologies in shaping our creative landscape.

    Conclusion

    This paper delves into the fascinating question of whether Large Language Models (LLMs) can be considered truly creative, analyzing this through the lens of established creativity theories. By exploring the dimensions of value, novelty, and surprise, the authors provide a multi-faceted perspective on the capabilities and limitations of these AI systems in the realm of creativity.

    The paper's examination of the societal impact of LLMs in creative industries highlights the opportunities, challenges, and potential risks associated with these technologies. As these transformative tools continue to evolve, this research encourages critical thinking and a nuanced understanding of the complex relationship between artificial intelligence and human creativity.



    This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

    Follow @aimodelsfyi on 𝕏 →

    Related Papers

    💬

    Total Score

    1

    On the Creativity of Large Language Models

    Giorgio Franceschelli, Mirco Musolesi

    Large Language Models (LLMs) are revolutionizing several areas of Artificial Intelligence. One of the most remarkable applications is creative writing, e.g., poetry or storytelling: the generated outputs are often of astonishing quality. However, a natural question arises: can LLMs be really considered creative? In this article, we first analyze the development of LLMs under the lens of creativity theories, investigating the key open questions and challenges. In particular, we focus our discussion on the dimensions of value, novelty, and surprise as proposed by Margaret Boden in her work. Then, we consider different classic perspectives, namely product, process, press, and person. We discuss a set of ``easy'' and ``hard'' problems in machine creativity, presenting them in relation to LLMs. Finally, we examine the societal impact of these technologies with a particular focus on the creative industries, analyzing the opportunities offered, the challenges arising from them, and the potential associated risks, from both legal and ethical points of view.

    Read more

    9/19/2024

    💬

    Total Score

    0

    Divergent Creativity in Humans and Large Language Models

    Antoine Bellemare-Pepin (CoCo Lab, Psychology department, Universit'e de Montr'eal, Montreal, QC, Canada, Music department, Concordia University, Montreal, QC, Canada), Franc{c}ois Lespinasse (Sociology and Anthropology department, Concordia University, Montreal, QC, Canada), Philipp Tholke (CoCo Lab, Psychology department, Universit'e de Montr'eal, Montreal, QC, Canada), Yann Harel (CoCo Lab, Psychology department, Universit'e de Montr'eal, Montreal, QC, Canada), Kory Mathewson (Mila), Jay A. Olson (Department of Psychology, University of Toronto Mississauga, Mississauga, ON, Canada), Yoshua Bengio (Mila, Department of Computer Science and Operations Research, Universit'e de Montr'eal, Montreal, QC, Canada), Karim Jerbi (CoCo Lab, Psychology department, Universit'e de Montr'eal, Montreal, QC, Canada, UNIQUE Center)

    The recent surge in the capabilities of Large Language Models (LLMs) has led to claims that they are approaching a level of creativity akin to human capabilities. This idea has sparked a blend of excitement and apprehension. However, a critical piece that has been missing in this discourse is a systematic evaluation of LLM creativity, particularly in comparison to human divergent thinking. To bridge this gap, we leverage recent advances in creativity science to build a framework for in-depth analysis of divergent creativity in both state-of-the-art LLMs and a substantial dataset of 100,000 humans. We found evidence suggesting that LLMs can indeed surpass human capabilities in specific creative tasks such as divergent association and creative writing. Our quantitative benchmarking framework opens up new paths for the development of more creative LLMs, but it also encourages more granular inquiries into the distinctive elements that constitute human inventive thought processes, compared to those that can be artificially generated.

    Read more

    5/24/2024

    💬

    Total Score

    0

    Characterising the Creative Process in Humans and Large Language Models

    Surabhi S. Nath, Peter Dayan, Claire Stevenson

    Large language models appear quite creative, often performing on par with the average human on creative tasks. However, research on LLM creativity has focused solely on textit{products}, with little attention on the creative textit{process}. Process analyses of human creativity often require hand-coded categories or exploit response times, which do not apply to LLMs. We provide an automated method to characterise how humans and LLMs explore semantic spaces on the Alternate Uses Task, and contrast with behaviour in a Verbal Fluency Task. We use sentence embeddings to identify response categories and compute semantic similarities, which we use to generate jump profiles. Our results corroborate earlier work in humans reporting both persistent (deep search in few semantic spaces) and flexible (broad search across multiple semantic spaces) pathways to creativity, where both pathways lead to similar creativity scores. LLMs were found to be biased towards either persistent or flexible paths, that varied across tasks. Though LLMs as a population match human profiles, their relationship with creativity is different, where the more flexible models score higher on creativity. Our dataset and scripts are available on href{https://github.com/surabhisnath/Creative_Process}{GitHub}.

    Read more

    6/7/2024

    Can Large Language Models Unlock Novel Scientific Research Ideas?
    Total Score

    0

    Can Large Language Models Unlock Novel Scientific Research Ideas?

    Sandeep Kumar, Tirthankar Ghosal, Vinayak Goyal, Asif Ekbal

    An idea is nothing more nor less than a new combination of old elements (Young, J.W.). The widespread adoption of Large Language Models (LLMs) and publicly available ChatGPT have marked a significant turning point in the integration of Artificial Intelligence (AI) into people's everyday lives. This study explores the capability of LLMs in generating novel research ideas based on information from research papers. We conduct a thorough examination of 4 LLMs in five domains (e.g., Chemistry, Computer, Economics, Medical, and Physics). We found that the future research ideas generated by Claude-2 and GPT-4 are more aligned with the author's perspective than GPT-3.5 and Gemini. We also found that Claude-2 generates more diverse future research ideas than GPT-4, GPT-3.5, and Gemini 1.0. We further performed a human evaluation of the novelty, relevancy, and feasibility of the generated future research ideas. This investigation offers insights into the evolving role of LLMs in idea generation, highlighting both its capability and limitations. Our work contributes to the ongoing efforts in evaluating and utilizing language models for generating future research ideas. We make our datasets and codes publicly available.

    Read more

    9/11/2024