Tracking Down the Musical Habitus of the Machine

JOHANNES TREẞ
University of Education Freiburg, Germany

June 2025

Published in Action, Criticism, and Theory for Music Education 24 (3): 79–108 [pdf]. https://doi.org/10.22176/act24.3.79


Abstract: AI-based applications and content have become integral to our everyday lives, increasingly permeating the field of music education e.g. through algorithmically sorted listening recommendations, AI-generated lesson plans, or audio content on platforms like TikTok and YouTube. These tools rely on machine learning and deep learning, which, despite their statistical foundation, are deeply entangled with social structures. This paper explores the concept of a “musical habitus of the machine,” examining how AI-based content inherits classifications, dispositions, and structures in music education. Three examples are analyzed: lesson planning with ChatGPT, image generation via Midjourney, and song production using Suno.ai. The findings highlight that the uncritical use of AI-generated content in music classrooms threatens the diversity of music education, potentially leading to a global homogenization of musical practices. The paper also suggests strategies to preserve diversity and agency in the face of these transformative processes.

Keywords: Digital habitus, music education, AI, bias, critical ed-tech


AI-based applications and content are increasingly woven into the fabric of our daily lives (Bearman and Ajjawi 2023) and therefore impact the realm of music education, as well. This influence is evident in the use of algorithmically-curated recommendations, AI-generated lesson materials, and audio content on platforms like TikTok and YouTube (Seaver 2022). The growing influence of software algorithms on our daily social actions, thoughts, and tastes is no longer a surprise. With the advent of more and more diverse AI-generating software solutions and the promises of commercial providers to simplify or even fully automate tasks such as lesson preparation, songwriting, or image generation, the field of music education is poised for significant transformation (Hadi Mogavi et al. 2024).

As this paper will demonstrate, Pierre Bourdieu’s scientific vocabulary offers profound insights related to AI and music education, highlighting the urgent need for empirically grounded knowledge and reflection amid complex socio-technological entanglements of humans and algorithms. Drawing on Bourdieu’s oeuvre, the paper explores the concept of “musical habitus of the machine” as it emerges from AI-generated content, examining how embedded music specific classifications, dispositions, structures, and traces impact music education. Employing an ethnographic approach to algorithms (Seaver 2017), three pertinent examples will be analyzed: music lesson planning with ChatGPT, AI-based image generation via Midjourney, and song production using Suno.ai. The analysis is followed by further discussions of possible ways of tackling these transformation processes.

Habitus, Field and Capital: Theoretical Foundations

In response to the call for this special issue, the following insights focus on the relationship between Pierre Bourdieu’s theoretical framework, music education, and AI technologies.

Habitus as a Fundamental Principle in the Work of Pierre Bourdieu

The concept of habitus is central to the sociological work of Pierre Bourdieu, one of the most influential sociologists of the 20th century. Although the term itself is part of a discourse that goes back much further (Sterne 2003), we owe Bourdieu the sharpening of theoretical and analytical tools that help us to understand and explain the social world and its mechanisms of power (Swartz 1997, 100–03). Bourdieu refined the concept numerous times (Maton 2012). In one of his later works, he defines habitus as follows:

The conditionings associated with a particular class of conditions of existence produce habitus, systems of durable, transposable dispositions, structured structures predisposed to function as structuring structures, that is, as principles which generate and organize practices and representations that can be objectively adapted to their outcomes without presupposing a conscious aiming at ends or an express mastery of the operations necessary in order to attain them. Objectively “regulated” and “regular” without being in any way the product of obedience to rules, they can be collectively orchestrated without being the product of the organizing action of a conductor. (Bourdieu 1990, 53)

According to this definition, habitus refers to the deeply ingrained routines, skills, and schemes that individuals acquire through their life experiences. It is a system of lasting structure that functions at a pre-conscious level, guiding thoughts, perceptions, and actions in a way that is often taken for granted and deeply ingrained in the human body (Bourdieu 1990, 54). At the individual level, a range of actions is made possible within the limits set by one’s habitus, facilitating a type of freedom that is contextually constrained (Rehbein 2016, 89). The concept therefore addresses the interplay between individual agency, collective interaction processes, and social structures, providing a framework for understanding how social practices, norms, and power relations are incorporated and reproduced across generations (Edgerton and Roberts 2014, 198). Bourdieu (1977) also emphasizes that emerging habitus are a product of collective history and objective structures (like language, economy, or educational systems) that individuals internalize through socialization.

It is important to mention that the concept represents two sides of the same coin. By “structured structure,” Bourdieu (1990) refers to habitus as a system that is structured because it is the result of socialization and influences from specific social conditions. In this context, Bourdieu also speaks of the so-called “opus operatum” (52) as the result and objectifiable dimension of habitus. Habitus in the sense of a “modus operandi” (Bourdieu 1990, 52) or as “structuring structure” implies that it influences how individuals act in the present world (and future), thus contributing to the reproduction of social logics. As mentioned earlier Bourdieu’s praxeological approach, emphasizing the interplay between social structures and individual agency, has significantly influenced various academic fields (Maton 2012).

Habitus in Context: The Roles of Field and Capital

In addition to habitus, Bourdieu’s concept of field illustrates how social practices are structured in different spheres of life. A field, as defined in Bourdieu’s theory, is a structured social space in which individuals and groups interact according to a particular set of rules, norms, and power relations (Bourdieu 1984, 226). Fields, such as those of art, science, education or technology, create their own boundaries and are characterized by a unique logic that governs social interactions within them. Each field is somewhat autonomous and is characterized by internal struggles for influence, power, and authority. Importantly, the fields are not isolated; they are interconnected and overlap with others, allowing influences and resources to flow between them. In these fields, capital—resources or assets that individuals and groups accumulate and use—is essential for navigating and influencing social hierarchies (Swartz 1997, 62). Inside the fields, actors position themselves based on relative forms of capital. Bourdieu distinguishes several types of capital: economic capital, which includes material resources such as money or property; cultural capital, which includes intangible assets such as education, knowledge, and aesthetic preferences; social capital, which refers to social connections and networks; and symbolic capital, which represents prestige and recognition that confer authority within a field. The value of each form of capital varies according to the specific logic of the field and influences the possibilities for action and influence of an individual or group (Swartz 1997, 75).

The interplay of habitus, field, and capital offers a comprehensive view of social behavior (Bourdieu 1984, 103). Habitus, the deeply ingrained dispositions and practices that individuals develop through socialization, enables actors to instinctively navigate fields. However, within the boundaries of each field—and through the resources available as capital—habitus finds expression and is either strengthened or challenged.

The Habitus Concept in Music Education

One of the first prominent references to Bourdieu’s concept of habitus in music education is the work of Rimmer (2006, 2010, 2012). He focused on how musical taste and practice are socially conditioned. Drawing heavily on Bourdieu’s work, he elaborated that, as a concept, musical habitus “provides a means of explaining how individuals’ relationships to music and their associated embodiment of cultural capital … enduringly connect to factors associated with their socialization and social locations” (Rimmer 2012, 306). The studies he presented (2006, 2010, 2012) emphasized the role of early life experiences, family and educational backgrounds, and broader socio-historical contexts in shaping musical dispositions. The high relevance of embodied knowledge as a component of habitus, which Bourdieu pointed out several times, proved to be particularly relevant for musical practice and music education related questions. According to Prior (2011), the study of musical habitus also focused on questions “around the enculturation of the musically adept body—a body in which specific musical competences are sedimented through processes of socialization” (130).

While the studies cited so far are more concerned with the enculturation of existing norms, practices, and knowledge, Hall’s (2015) work dealt with the emergence of musical habitus of choir boys. She clearly emphasized the transformative power of habitus for choir boys: “The innovative possibilities of habitus are suggested by the boy’s creative abilities to dip in and out of cultures narratives of masculinity” (55).

The volume Bourdieu and the Sociology of Music Education, edited by Burnard, Trulsson and Söderman (2015), offered a comprehensive and multifaceted insight into the range of music education research drawing on Bourdieu’s concepts and considerations.[1] In the introductory chapter (Söderman et al. 2015), the editors presented major areas through which they categorized the field of music education research with reference to Bourdieu: “Musical Taste and Identity” (5), “Musical Fields” (6–7) and “Musical Rules and Logics” (8–9). The authors emphasized that a music education based on Bourdieu’s theoretical foundations places topics such as “social justice, equality, gender, cultural production, power, taste and transformative practice” (10) at the center. By integrating a wide range of musical cultures and practices—including Western classical music, popular music and world music—Burnard and Trulsson (2015) reflected, for example, the cultural hybridity of modern societies and promoted musical inclusivity. Sagiv and Hall (2015) suggested that musical learning only becomes meaningful when connected to students’ personal interests and experiences, while opportunities for composing and improvising offer ways to foster creativity and self-expression. At the same time, a music education that drew on Bourdieu’s theories suggested a focus on intercultural dialogue and the dismantling of gender stereotypes, with music as a medium for empathy and social justice that at best empowers students to question norms and explore different identities (Hall 2015).

Habitus, Technology, and AI

Even if Bourdieu (1998) only occasionally dealt with technology and aspects of media studies, his work has often been the starting point for sociotechnological questions about the use of technology. Sterne (2003) already emphasized in the early 2000s that the reference to Pierre Bourdieu’s social theory can be of great value for technology research. He argued for a “social praxeology of technology,” whereby he attached great importance to understanding technologies as “particularly visible sets of crystallized subsets of practices, positions and dispositions in the habitus” (Sterne 2003, 386). Consequently, many studies in the field of technology research examined human habitus regarding technology use (Costa, Burke, and Murphy 2019; Czerniewicz and Brown 2013; Papacharissi and Easton 2013). Seaver (2017) conceptualized algorithms as integral to culture, shaped not only by rational procedures but also by institutions, people, and the everyday sensemaking as in every aspect of cultural life. This view contrasts with canonical technology-centered perspectives that understand algorithms as abstract procedures outside of human-centered cultural fields.

AI technologies are often at the center of scientific debate on researching algorithmic systems, as they rely on vast amounts of human-created data for training, encompassing text, images, and other media from diverse sources like books, websites, and databases (Hristova 2023). These datasets, often containing billions of examples, enable models to recognize patterns, context, and relationships. The scale of data is critical for the AI’s ability to generate meaningful and contextually relevant outputs, as it learns by identifying statistical probabilities within these massive corpora. The dependence on immense, curated data highlights both the complexity and human-driven foundation of generative AI development (Airoldi 2022).

The increasing pervasiveness of algorithmic systems in all areas of life is described as “algorithmic culture,” for which, according to Hristova (2023), Bourdieu’s habitus concept is particularly relevant for deeper understanding. Airoldi’s “Machine Habitus” (2022) and Romele’s “Digital Habitus” (2024) consequently highlighted the remarkable similarity between AI systems and Bourdieu’s concept: While the habitus concept originally focused on human actors, both authors concentrated on the theoretical differentiation of the habitus of algorithmic software systems, which are consequently also referred to as “eminently social animals” (Airoldi 2022, 4). Expanding on Bourdieu’s habitus concept, Airoldi (2022) formulates the following definition: “Machine habitus can be defined as the set of cultural dispositions and propensities encoded in a machine learning system through data-driven socialization processes. Since such dispositions and propensities derive from patterns in humangenerated data, they reflect social and cultural regularities crystallized in the machine-readable data contexts used for training and feedback” (113).

Given the expanded concept of machine habitus, it is imperative to consider its implications for music education. This raises the question: What internalized dispositions, schemas, and socio-cultural regularities are encoded in AI-generated data that bear significance for music education? This inquiry examines a phenomenon, which, building on Airoldi’s (2022) definition, can be termed the musical habitus of the machine.

Theoretical Underpinnings: Tracing Habitus—Uncovering Dispositions

The various perspectives on habitus discussed above show that Bourdieu’s oeuvre provides valuable perspectives that are also of great importance for music education. But even if his theoretical tools offer promising scientific potential, Costa et al. (2019) pointed out that “uncovering habitus is not a straightforward task” (21). The authors mentioned challenges in translating the broad concept into concrete analytical tools and the need to establish diverse methods and definitions of dispositions tailored to the respective research questions. Christin (2020) also stated that the scientific analysis of algorithmic systems and their inherent opacity is a major challenge, since they deal with “complex sociotechnical assemblages involving long chains of actors, technologies, and meanings” (898).

The concept of habitus is closely linked to the question of acquiring and maintaining values, norms, and societal structures. In other words, talking about habitus invokes the processes of socialization and thus also individual and collective learning processes (Nash 2003, 53–54). Accordingly, research referring to Bourdieu’s work placed particular attention on the analysis of “durable, transposable dispositions” (Bourdieu 1990, 53), which, according to Bourdieu’s definition quoted above, represent an essential ingredient of habitus.

The word “disposition” is frequently used by Bourdieu in various ways, leaving its precise meaning somewhat open. One definition he provided states that it expresses first the result of an organizing action, with a meaning close to that of words such as structure; it also designates a way of being, a habitual state (especially of the body) and, in particular, a predisposition, tendency, propensity or inclination (Bourdieu 1977).

Since the present article focuses on the analysis and differentiation of generative AI data and its specific features, I deal with non-human actors. A focus on the embodied dispositions of habitus in the original sense is therefore unfeasible (Airoldi 2022).[2] Instead, the specific products generated by AI-based algorithms are regarded as “results of an organizing action” (Bourdieu 1977, 214) that can be interpreted as documents of the habitus of the machine. As already indicated, the inherent generative structures of AI algorithms are abstracted from an extensive data set using comprehensive machine learning processes (Airoldi 2022, 35).

From an analytical perspective, the question now arises:  How to trace such a musical habitus of the machine? Seaver (2017, 1) pursued a research approach described as “ethnography of algorithmic systems.” Airoldi (2022), too, elaborated on several “research directions for the sociology of algorithms” (151) and proposed “follow the algorithm” and the “machine habitus’ dispositions” as one way to analyze “their outputs and predictions to explore how they ‘see’ the entangled users” (153). Christin (2020) drew on Seaver’s (2017) ideas and proposed a strategy that she described as “algorithmic comparison” in order to “shed light not only on the uses of algorithmic systems but also on their inner workings, regardless of how opaque and proprietary they are” (908). By analyzing similarities and differences between algorithms in different contexts, this strategy showed how specific conditions and social factors influenced the functioning and outcomes of algorithmic systems. This approach enabled the identification of patterns and deviations caused by technical and cultural factors and contributed to a deeper understanding of both the general and context-dependent properties of algorithms (Christin 2020).

Following the considerations presented here, various types of AI-generated data were subjected to open coding using qualitative content analysis, followed by a comparative analysis, as described by Kuckartz and Rädiker (2023). This analytical approach facilitated the systematic identification of contrasts and similarities within the data corpus. The combination of open coding and comparative analysis provided a structured framework for comprehending the complexity and diversity of AI-generated content, yielding profound insights into the “predisposition, tendency, propensity or inclination” (Bourdieu 1977, 214) of AI-generated data products with relevance for music education.

Analysis of AI-Generated Content Related to Music Education

The overarching areas of application selected are music lesson planning with ChatGPT, AI-generated images with Midjourney with relevance to music education, and the AI-driven music production platform Suno.ai. The data for this text comes partly from a qualitative interview study on using ChatGPT for music lesson planning (Treß 2024) and from a university seminar with music teacher training students in the summer semester of 2024. The three specific application areas were selected to ensure a high degree of relevance to classroom practice. Both lesson planning and the production of images and visual materials are often described as promising AI applications for the planning and implementation of lessons (Van Den Berg and Du Plessis 2023). The AI song-production platform Suno.ai was included in the sample to ensure a high degree of data diversity.

In the selection of examples, particular attention was given to choosing samples that were generated with very short and preferably open prompts, providing as little contextual information as possible beforehand. This approach is based on the premise that such open prompts leave a wide range of possible meanings to the AI system, which the algorithm then fills in according to its specific dispositions and the respective machine habitus (Airoldi 2022, 33).

Music Lesson Planning with ChatGPT

ChatGPT is an advanced AI-based text generation platform built on large language models and transformer architecture (Kasneci et al. 2023). The platform processes user-inputted text prompts using a sophisticated model trained on extensive text datasets. This model learns to recognize linguistic patterns, styles, and contextual relationships within the text data, enabling it to generate coherent and contextually relevant texts that align with the semantic content of the input prompts. Although ChatGPT was publicly introduced only two years ago, teachers already consider it to have an irreversible impact on all areas of education, such as classroom learning, assessment, administration and lesson planning (Bower et al. 2024; Van Den Berg and Du Plessis 2023).

The following lesson plans[3] serve as an empirical basis for a detailed analysis in this article.

Short Description: A table outlining a music lesson plan, including sections on introduction, theory instruction, practical application, and conclusion, with activities and allocated time for each section. Long Description: The image depicts a table for a music lesson plan divided into four sections: Introduction, Theory Instruction, Practical Application, and Conclusion. Each section lists specific activities along with the time allotted for each. • The "Introduction" section includes greetings, roll call, and a warm-up activity to listen to a short piece of music and make note observations, which lasts 10 minutes. • The "Theory Instruction" section covers basic music theory concepts such as notes and clefs, note values, time signatures, and the introduction of the Circle of Fifths, with a 15-minute duration. • The "Practical Application" section focuses on reading music, practicing rhythms, demonstrating notes on a keyboard, and composing a simple melody in small groups, allocated 20 minutes. • The "Conclusion" section involves groups presenting their melodies, recapping key points, and assigning homework (practice reading a short piece of music), which takes 15 minutes. Figure 1: ChatGPT prompt: “Plan a music lesson in a regular school”

The first example of a music lesson proposed by ChatGPT focusses on “Basic music theory concepts” (see Figure 1), and the didactic structure adheres to a standardized format. It begins with an introduction, followed by theoretical input, then progresses to “Practical application” of the taught theory, and concludes with a presentation, recap, and a homework task. ChatGPT does not define a specific age group, even in the more detailed chat version of the lesson outline. Although the lesson starts with an open listening exercise, it becomes evident in the subsequent sections that the content provided by the AI is exclusively focusing on “note values,” “clefs,” “time signatures,” and the “circle of fifths.” All these contents point to a strong orientation towards Western classical music theory (i.e., conventional forms of notation, time and meter, and functional harmony). It is assumed that the students have acquired basic music theory and practical knowledge in fifteen minutes and are able to sight-read both rhythms and melodies, as is planned in the next phase. Additionally, the use of a piano or keyboard is suggested during this phase for students to compose their own “simple” melodies in groups using the knowledge they have gained. From a didactic perspective, this example predominantly reflects a teacher-centered approach structured according to instructional logic. The exception to this is the planned group work during the application phase. To make matters more complicated, the duration of the lesson phases specified by ChatGPT appear to be far from sufficient for the acquisition of such complex and prerequisite music theory and practical knowledge—especially for learners with little or no prior musical knowledge and skills.

Short Description: A table outlining a music class lesson plan, with time slots, phases, content, methods, and materials for each phase. Long Description: The image presents a detailed lesson plan for a music class divided into different phases, each with its corresponding content, methods, and materials. • 0-5 min (Greeting and Introduction): The teacher welcomes the students and leads a brief class discussion about known musical instruments. The method used is a class discussion, with no specific materials required. • 5-10 min (Introduction): The teacher introduces the four main groups of musical instruments: string, wind, percussion, and keyboard instruments. This phase uses teacher presentation and class discussion, with pictures of instruments as materials. • 10-25 min (Presentation of Groups): The teacher explains and provides examples of instrument groups: string instruments (e.g., violin, viola), wind instruments (e.g., flute, trumpet), percussion instruments (e.g., drums, xylophone), and keyboard instruments (e.g., piano, organ). The teacher presentation is supported by audio examples. Materials include a laptop, projector, speakers, and pictures. • 25-35 min (Activity - Instrument Quiz): The students match sound samples to instruments using a worksheet. This is an individual or pair work activity, with worksheets and audio examples as materials. • 35-40 min (Practical Experience): If available, students view and try out real instruments. This practical activity requires the use of instruments or pictures of instruments. • 40-45 min (Conclusion and Summary): The teacher recaps the key points, holds an open Q&A session, and announces the next lesson. This is a class discussion phase with no specific materials required. Figure 2: ChatGPT prompt: “Plan a music lesson in a secondary school”

The second lesson example proposed by ChatGPT, based on the prompt “Plan a music lesson in a secondary school,” focuses on imparting knowledge about musical instruments. While the first example provided only the sections, activities, and the estimated time in a table, the lesson plan in the second example is more detailed. It includes the methods and materials, along with the times, phases, and content for each part of the lesson. From the user’s point of view, the different levels of detail are not comprehensible.

Like the first example, the lesson begins with a brief welcome and introduction, followed by the teacher-centered explanation of different instrument groups. An application phase follows, including a quiz, and the plan concludes with a summary and recapitulation of the learning content. Regarding the content, this lesson focuses on classical Western instruments. Although the categorization of the “four main groups” (strings, winds, percussion, and keyboard instruments) might allow at least some interpretive flexibility, the specific instruments listed by ChatGPT are almost exclusively from the Western classical music tradition. Additionally, the structure of the lessons, the didactic framing of individual phases, and the predominantly teacher-centered approach reflect a rather traditional understanding of teaching and learning.

The analysis reveals that ChatGPT presumes a significant level of prior knowledge among students, often underestimating the time required for complex, long-term learning processes (e.g. sight-reading) by suggesting disproportionately short learning periods for these phases.

AI-Based Image Generation via Midjourney

Midjourney is an advanced AI-based image generation platform based on generative adversarial networks or diffusion models (Ho et al. 2020). These models learn to recognize patterns, styles, and contextual relationships within the visual data, enabling them to generate images that match the semantic content of a user’s text prompt. The use of image-generating platforms such as Midjourney  plays an increasingly important role in both the teaching of different subjects and the planning of lessons, and it is impacting everyday teaching in schools, similar to AI-generated text data (Chiu 2023).

The image material analyzed comes from an AI-focused seminar on music teacher education, held at the University of Education in Freiburg in collaboration with the Freiburg University of Music. Students used the software Midjourney to generate music education-related images based on prompts they created in small groups. These images were later discussed and evaluated in a plenary session.

Short Description: A collage of children playing various musical instruments, including flutes, violins, pianos, guitars, and trumpets, in a music class or performance setting. Long Description: The image is a collage of two rows featuring children playing different musical instruments: • Top row: Children are playing the flute, violin, and piano. • Bottom row: A mix of children playing violins, guitars, and brass instruments, such as trumpets.
Figure 3: Midjourney prompt: “music education in school”

In the first example, a series[4] of images was generated by students using Midjourney based on the prompt “music education in school.”[5] The musical instruments listed in the second lesson plan above also appear in this visual representation. The examples exclusively show string instruments (predominantly violins), keyboards, woodwind and brass instruments (albeit in oddly modified and varied forms, as seen in the top left image), as well as classical guitars. The students depicted in the images, with their clothing, neat appearance, upright and highly focused postures, and predominantly directed gazes towards sheet music or music stands, suggest musical habitus shaped by a well-educated middle-class background. These images visualize musical practices that adhere to the Western classical ideal of performing and reproducing existing works with traditional instruments.

Short Description: A collage of four paintings depicting dinosaurs playing musical instruments in a jungle setting. Long Description: The image is a collage of four paintings showing dinosaurs in a lush, jungle-like environment, each playing different musical instruments: • First painting: A dinosaur skeleton standing beside a piano with sheet music open in front of it. • Second painting: A dinosaur playing an old-fashioned piano in a dense, green jungle. • Third painting: A dinosaur standing next to a cello, with musical notes scattered around in the foliage. • Fourth painting: A dinosaur holding a violin, sitting next to an open music book in a vibrant, tropical setting.
Figure 4: Midjourney prompt: “A dinosaur in music classroom”

The second example series of images suggests a clear contrast to the first image series. The prompt to imagine “a dinosaur in music classroom” exhibit a significant predominance of pianos, string instruments, and sheet music. In three of the images, a dinosaur is depicted holding a conductor’s baton in its forepaw, prominently highlighting another element of Western classical art music—the conductor as the central guiding figure that provides essential knowledge to a large ensemble of musicians and conveys it in a compelling manner for everyone by the baton. The depicted habitus is also reminiscent of a long-obsolete yet still extremely powerful image of the Eurocentric, romanticized, and inevitably lonely artistic genius.[6] The preference for Western classical music evident in ChatGPT’s lesson plans above is unmistakable in the images generated by Midjourney, highlighting a profound bias in favor of Western classical music. Even though only a limited selection can be shown here, I have created a large number of music education images in various contexts over the past few weeks and months with Midjourney. Almost all these images consistently feature an overwhelming presence of string instruments and piano keyboards in numerous forms, shapes, and colors. Even when explicitly prompted to exclude such instruments, a complete omission of Western musical instruments appears nearly unattainable for the image-generation algorithm of Midjourney.

Song Production Using Suno.ai

The third application discussed here focusses on the AI-based song generation platform Suno.ai. Even though the generative algorithms of Suno.ai differ from text and image generating systems, the fundamental dependency on large amounts of training data and the pattern recognition based on it is similar (Yu et al. 2024). The promised potential of generative music production platforms is already finding its way into scientific articles, albeit largely without critical scrutiny. Moreover, the promise is already being incorporated into the lesson plans and music education strategies of individual associations (NAfME 2024). Since the software environment features the generation of audio data, an analysis presented in text form must remain incomplete. Nevertheless, to get a comprehensive picture of the platform, certain areas of the Suno.ai website are considered below. I also examine the results of a comparative sound and music analysis of some of the system’s song products, which were created together with students in the above-mentioned seminar.

Short Description: A screenshot of the "About" section from the website Suno, describing how it enables anyone to create music. Long Description: The image is a screenshot of the "About" section of the Suno website. The text reads: "Suno is building a future where anyone can make great music. Whether you're a shower singer or a charting artist, we break barriers between you and the song you dream of making. No instrument needed, just imagination. From your mind to music." The background is dark, and there are navigation buttons for "About," "Team," and "Blog" at the top, as well as a "Make a song" button on the far right.
Figure 5: Screenshot of Suno.ai’s About webpage

Even a glance at the about section of the website reveals highly relevant information. The software company’s mission statement shows remarkable parallels to current core concerns in music education. In metaphorically charged language, the company presents itself as a pioneering songwriting platform that enables agency, empowerment, and accessibility for anyone without being dependent on previous musical experience or knowledge (see Figure 5).

Short Description: A screenshot of the Suno platform showcasing trending songs and music information, with the "Give it to me (Suno)" track highlighted. Long Description: The image is a screenshot from the Suno platform, showing the "Suno Showcase" section and "Trending Songs" on the left side. The song "Give it to me (Suno)" by Mr.Tree is featured at the top, showing the song's stats such as play count (468281) and likes (5,103). Next to it, another track titled "Suno" by yolkhead is displayed with a play count of 153556 and 2,10 likes. The right side features additional information about the song "Give it to me (Suno)" with its genre and a glowing "Suno" neon logo. In the "Trending" section beneath are several other songs listed. The interface includes buttons for playing songs, viewing playlists, and navigating to other sections like "Create," "Library," "Explore," and "Search."
Figure 6: Homesite and trending charts of Suno.ai (on November 24, 2024)

The home website of suno.ai initially presents itself to the registered user in a familiar format, as the design and arrangement of the individual content is strongly reminiscent of established music streaming services (e.g., Apple Music and Spotify). Various functions and menus can be selected in the left-side area, while already generated songs and statistical background information (likes, playback count) are displayed in a tile design in the middle area. More detailed context information on a selected song is displayed on the right (style, genre, lyrics, etc.). The song content on the website is organized using different subheadings and categories (trends, playlists, genres, etc.).

Short Description: A screenshot of the Suno platform's "Create" page, where users can generate music lyrics and select styles of music. Long Description: The image is a screenshot of the Suno platform's "Create" page. The left side of the interface displays options to generate lyrics and choose music styles. • The "Lyrics" section has a text box for entering custom lyrics or describing a song. There is a button labeled "Make Random Lyrics" to generate lyrics automatically, along with a character count indicator. • Below the "Lyrics" section, there is an option to upload audio or select an instrumental track. • The "Style of Music" section allows users to enter specific music genres or styles such as "punk," "blues," "80s," "pop rock," and "emotional." • Below that, there is a text box for entering a title for the song. • A "Create" button is located at the bottom to generate the music based on the provided input. The sidebar includes navigation buttons for "Home," "Create," "Library," "Explore," and "Search," and displays the user's credits and subscription status.
Figure 7: User interface of Suno.ai in custom mode

Even if the service initially looks like a music streaming provider, the “Create” menu tab indicates that the platform is not just about the reception, but also about the AI-supported generation of music. The button therefore also leads to a multi-section input interface that initially offers the option of formulating a voice-based “Song Description” prompt. In addition to this text input field, there are also two selector switches on the page. The “Instrumental” selector switch positioned below the input field is used to specify the production of a music file without vocals or lyrics. The second selector switch positioned above the text input field opens the so-called “Custom” mode, which reveals several other input fields. Here the user is offered various options for a more differentiated description of the desired end product. The “Lyrics” field offers the option of entering one’s own lyrics as the basis for the song to be created. A button below the field also offers the option of having lyrics created automatically (“Make Random Lyrics”). In the field below, the user has the option of defining musical genres or “styles” as a guideline for the track to be created. The style examples presented here show a clear orientation towards established popular music genres (“phonk,” “blues,” “pop rock”) but also include chronological references (“80s”) and the effect potentially triggered by the generated music (“emotional”). Finally, a title for the track can be defined in the last free field above the “Create” button.


Figure 8: Suno.ai’s explore mode

Regarding the actual purpose of suno.ai, the generation of songs from a text prompt, the general public’s assessment of the results is remarkably positive. Brian Hiatt (2024) of the rollingstone.com blog described the results as “shockingly convincing songs” (2024, 1). A comprehensive evaluation of the sonic dimension of Suno.ai’s products is beyond the scope of this article due to the vast number of songs available in diverse stylistic combinations (see Figure 8). However, an analysis of selected song products revealed several tendencies regarding inherent dispositions of Suno.ai’s musical habitus. Regardless of the chosen genre, the analyzed songs exhibited a very high degree of “loudness” (Vickers 2010), resulting in a significantly heightened perceived loudness. This compression minimized the differences between the loudest and softest parts of the track, potentially reducing the overall dynamic expressiveness of the music. Vocals usually appeared to be layered, which led to a high sonic density regardless of the genre selected. This sound aesthetic created a rich and textured vocal sound, but also led to a somewhat artificial impression in most examples, which is also described in the literature as the uncanny valley (Männistö-Funk and Sihvonen 2018). In a lot of songs, the vocals were also more or less obviously oriented towards an autotune aesthetic (Phillips 2021). Both vocals and solo parts such as string, brass, synthesizer, or guitar parts often featured digital artifacts and noise elements reminiscent of MP3 compression effects. These artifacts highlight the high amount of digital processing involved in the production.

While AI-based text and image data revealed a clear tendency towards Western classical musical dispositions, concepts, subjects, and lesson content, Suno.ai’s musical habitus consisted of a strong orientation towards contemporary popular music culture through the design of the website and the possible style and genre criteria for the generation of songs. Accordingly, most of the trending songs listed on the home webpage and the so-called new styles that the service offered via it’s explore webpage follow mainstream genre logic (e.g., rock, electro, soul, trap, metal, vapor jazz). Furthermore, the generated songs are presented primarily as personalized and endless abundance of consumer goods across the entire platform.

Comparative Discussion of the Interpretation Results

At first glance, the impressive results achieved by AI software in terms of text, image, and music data are undeniable. However, the positive first impression is put into perspective in view of the analysis presented here. First, it is important to note that the various AI technologies are not characterized by a single habitus of the machine, but by different habitus in the plural, since each technology has its own “cultural dispositions and inclinations” (Airoldi 2022, 113). As for direct applications in the field of music education, ChatGPT and Midjourney exhibited a hierarchical and mechanistic perspective on music teaching and learning. This tendency manifested in the structure of lessons (lesson plans, teacher-centered approaches, and knowledge transfer) as well as in the dispositions embedded in Midjourney’s visual materials regarding music educational practice (homogeneous ensembles and imagery focusing on conducted performances). This interpretation confirms Selwyn’s (2019) statement that “technologies carry implicit assumptions about what education is, and in whose interests education operates” (103), even when, as with the technologies discussed here, they were not primarily designed for educational purposes. In addition, ChatGPT and Modjourney produced a high amount of Western classical content. This disposition became evident in the learning content suggested by ChatGPT as well as in the instruments depicted by Midjourney. Additionally, the musical practices presented largely center on the reproduction of preexisting works. This emphasis is highlighted by the presence of musical artifacts such as batons and music stands as well as by the frequent inclusion of scores, sheet music, and musical notation in both text and imagery.

In contrast, Suno.ai primarily exhibited a concentration on popular music culture, which is evident not only in the design of the online platform but also in the content it produces. At the same time, there is also a clear tendency towards standardization regarding the heavy focus on stereotypical genre labels and the dispositions embedded in the songs’ sonic aesthetics.

Hristova (2023), referencing Bourdieu’s concept of habitus, noted that such a standardization tendency in the algorithmic technologies can lead to habitual adjustment effects concerning listening habits and user preferences. The author implied that the content’s repetitive and narrow focus may shape and limit users’ tastes and preferences, potentially leading to more and more constrained musical experiences: “Here technology continues to condition the habitus through the enforcement of taste: a taste is implied, habituated, and further amplified through the use of filter bubbles that favour more of the same” (Hristova 2023, 111).

In the case of Suno.ai, this standardization process may lead to a reinforcement and consolidation of musical preferences in two significant ways. Firstly, the platform’s design closely resembles established streaming services, which could result in users gravitating towards familiar styles. Secondly, the “Create” page further promotes adherence to existing style and genre conventions through its prompt-based selection of specific, pre-existing styles during the content generation process. Consequently, the platform suppresses idiosyncratic style combinations or creative thinking beyond genres and conventional classifications, both at the interface level and within the algorithm’s software architecture. Romele (2024) also concluded that the repetitive and continuous interaction between individuals and digital algorithms may lead to “flattening individuals into their own sameness” (128). Such a development, which is triggered and continually shaped by musical habitus of machines, would inevitably increase existing inequality and discrimination against non-dominant or marginalized forms of musical practice. Moreover, this dynamic is likely to be exacerbated by the increasing prevalence of algorithmically governed interactions. Eynon (2023) differentiated the concept of “algorithmic bias” and emphasized that the specific opacity of algorithmic systems poses a fundamental threat in the context of teaching and learning processes in formal education (254).

Another significant aspect arising from this analysis is the efficiency-driven emphasis on products over processes, which could profoundly influence the conceptualization of professionalization in music education. The sole focus on product-level outcomes may obscures the complex processes involved in planning and designing music educational formats and classroom materials as well as composing songs. The platform economy emphasizes the rapid reusability and consumption of digital mass products, overshadowing the nuanced engagement required for meaningful educational practice (Selwyn 2019). With the global hype surrounding artificial intelligence, educators and students alike are asking themselves whether they can afford to ignore these technological tools without being left behind (Hadi Mogavi et al. 2024). In an era marked by teacher shortages, shrinking school budgets, and reduced support for the arts, such efficiency-driven educational models pose a serious and realistic threat to music education as a whole. Such a dynamic is culminating in a “hegemony of algorithms” (Bost and Wang 2016, 34), in which the influence of technology is becoming omnipresent and increasingly shaping practices and standards in all educational fields. This criticism is all the more important given that all the services discussed here require at least a personalized user account and, in the medium term, a subscription payment model in order to be used. The dominance of a few global tech giants and their lack of transparency regarding the underlying principles guiding the development and marketing of such systems aggravate the problem. The comprehensive impact of such concentrations of power also means that people across the board feel powerless in the face of such powerful transformation processes. In this context, Romele (2024) states: “Instead of claiming a greater right in terms of participation, social groups and cultures passively accept innovations in and the implementation of AI systems as planned and carried out by experts” (140).

Consequences of Musical Habitus of Machines for Music Education

In music education, addressing the challenge of transforming traditional hegemonic practices into more inclusive approaches remains a formidable task for educators across the globe. The stabilization of musical habitus of machines that are focused on a few musical practices (whether Western classical or mainstream pop) therefore also threatens the diversity and breadth of music education worldwide (Hess 2020; Varkøy 2017).

Against the background of Bourdieu’s conceptual tools at the center of this text, the interpretation results illustrate how the field of music education can be reshaped by the influence of AI tools such as ChatGPT, Midjourney, and Suno.ai. These findings suggest that generative AI systems contribute to the intensification of social stratification by reinforcing the reproduction of “objectified and institutionalized cultural capital” (Swartz 1997, 77).

Such technological transformation processes are also closely linked to capital shifts. The privileging of Western classical music reflects the strengthening of cultural capital, while the polished aesthetics of AI-generated outputs consolidate symbolic capital by associating these tools with professionalism and legitimacy (Sagiv and Hall 2015, 113–14). Economic capital is becoming a decisive factor, as subscription models restrict access to these technologies based on financial means, thus reinforcing inequalities within the field of music education. Social capital is similarly affected, as algorithmic systems encourage interactions that align with standardized outcomes, marginalizing collaborative practices that deviate from these norms (Benedict and O’Leary 2019, O’Leary 2023).

The discussion emphasized how algorithmic systems influence musical taste and change the learners’ habitus. By reinforcing familiar patterns and limiting diversity, these systems contribute to symbolic violence, as they stabilize dominant musical norms and marginalize less established forms of expression. Furthermore, the growing influence of efficiency and market-oriented thinking in the educational context redefines creativity as something that can be measured and optimized. This trend risks undermining the autonomy of music education by aligning it too closely with economic objectives, leaving less room for critical, aesthetic and cultural exploration (Powell et al. 2017).

How to Escape the Musical Habitus of the Machine

In view of the alarming perspectives that have emerged in the discussion, the question arises as to how a small and already marginalized school subject like music could escape this “coming wave” (Suleyman 2023). In the following, I present several strategies and approaches that demonstrate how music education may actively engage with the seemingly unstoppable process of technological transformation through AI technology. The ideas are based on studying McQuillan (2022) and Selwyn (2019), whose general considerations regarding critical approaches to AI I apply to music education. An artistic perspective on the implications of increasing technologization offers unique and promising opportunities for critical discourse and active participation in shaping socio-technological transformation processes far beyond the field of music education. Rather than writing about a technological change that encompasses all areas of life from the perspective of a marginalized school subject, music education could play an exceedingly influential role in interdisciplinary discourses such as critical media education and the (re-)attainment of agency in the post-digital age.

Promoting a Critical Approach to AI in Teacher Training

AI data reflects historically entrenched issues in music education, or as Holly Herndon puts it: “Well, AI is just us” (McDermott 2020, para. 24). However, this “glimpse into the data mirror” provides us with an opportunity to become aware of the dispositions ingrained in AI-generated data—dispositions that reflect our own practices, the dominance of certain educational approaches, instruments, and institutionalized forms in teaching and learning. This offers critical perspectives for students on highly relevant music educational, ethical, and socio-political issues (e.g., racism, inclusion, marginalization of minorities and their musical practices). It also provides an opportunity for a critically constructive engagement with AI technologies, starting with confronting preservice music teachers with technical background knowledge of how AI systems work, including their limitations and socio-political implications, as well as refuting the notion that AI is inherently superior to human judgment.

Promoting the Appreciation of Human Creativity, Artistic Practice and Subversive Ways of Technology Use

The music classroom appears to be an ideal setting for emphasizing the uniqueness of human perception in experiencing emotional depth and originality in music—regardless of whether it was generated with or without AI support. However, it is also essential to address critical questions regarding the authenticity and (legal) authorship of artistic products. The exploration and discussion of AI and technology-critical forms of musical expression and corresponding artists also proves to be a fruitful field for engagement in music education. Borkowski (2023) shows, for example, how different female musicians actively engaged with AI-based voice assistants in their work. He concluded that such “artistic practices therefore problematize the narratives of tectonic paradigm shifts and sensory revolutions” (Borkowski 2023, 149). This could also include encouraging students to hack and repurpose AI tools creatively. Using AI-generated outputs as raw materials for artistic transformation could be a fruitful starting point to developing counter-narratives that critique or subvert dominant AI-driven narratives. Students could even be empowered to use art as a form of resistance.

Fostering Collective and Collaborative Approaches

Encouraging collaborative projects and community building among students can help resist the individualistic and competitive nature of AI-driven metrics (Selwyn 2019, 69). These activities emphasize teamwork and collective creativity, fostering a supportive educational environment. Andrejevic and Selwyn (2023) pointed out that this does not necessarily have to go hand in hand with AI abstinence, instead calling for a discussion about “community‐oriented automation” (229) in the classroom.

Advocacy for Ethical Discussions and Structural Changes

Ethical discussions about AI’s implications, including bias, surveillance, and data privacy, should be integrated into the curriculum. Educators should promote awareness of policy and regulatory issues surrounding AI and advocate for changes that protect artists’ rights and ensure ethical AI development. The importance of fair working conditions and job security for artists and educators should also be emphasized in this context.

Conclusions and Outlook

In this article I examined the dispositions inherent in AI-generated data relevant to music education and music classroom practice and first delineated and elaborated on Bourdieu’s theoretical tool set. Subsequently, through a concretization of the musical habitus of the machine, three distinct types of data were qualitatively analyzed to uncover embedded dispositions and recurring patterns. The analysis revealed that AI-generated data tends to present a relatively narrow conception of music, which perpetuates longstanding inequalities and marginalization, potentially leading to the homogenization of music education content and practices. This critical reflection showed that there is a lot of potential to support teacher development, encourage critical thinking about technology, and improve teacher training and practice in music education.

Even if the first steps towards dealing with the music educational consequences of machine habitus have been taken, further questions need to be asked. These issues range from copyright and property rights, which have already received considerable attention, , to the possible marginalization of non-verbal and embodied aspects of musical practice. Given that AI technologies often focus on text-based inputs and language processing (Yildirim and Paul 2024), there is a risk of oversimplifying musical practice and overlooking the rich nuances of diverse performance traditions and implicit knowledge. To promote more open and inclusive musical habitus in machines, music teachers and music educators should be involved in the development and training of AI systems, as well as in critical reflections on their potentials and challenges in music education. This collaborative approach could ensure that AI tools reflect a diverse array of musical practices.

Furthermore, cultivating critical AI literacy among educators is essential for responsibly engaging with these technologies (Kress 2003; Leander and Burriss 2020). It is also crucial to continually reassess the role of AI in music education, ensuring its use aligns with educational and ethical objectives rather than market pressures. It is now evident that the scientific community should reconsider its role in perpetuating the “modernist technological acceleration” (Bock et al. 2024, 17), by constantly introducing new technologies and developments into practice as the norm. Increasingly, teachers at the grassroots level are expressing a “a desire for deceleration” (Bock et al. 2024, 17) and for reasonable “technoskepticism in schools” (Pleasants, Krutka, and Nichols 2023). Music education must assert its own priorities, resisting the commodification of education and ensuring that technological tools enhance rather than overshadow the learning experience. In this regard music classrooms could serve as spaces that inspire students to thoughtfully explore the dynamic relationship between technology and musical expression. By fostering a critical and constructive attitude towards technologies such as artificial intelligence, music education can empower learners to creatively question and overcome existing technological dependencies, rather than perpetuate them.


About the Author

Johannes Treß is a music educator, researcher and composer/performer specializing in post-digital music education, maker music education, AI in music education and creative musical practices. He is co-leader of a project on post-digital music teacher education. His dissertation from 2022 on group improvisation in music education was graded summa cum laude and his scientific work has also received numerous awards. Since May 2023, he has been junior professor of music at the University of Education, Freiburg. Treß studied school music with a focus on jazz/pop and performs internationally as a saxophonist, live electronics musician, theatre musician and composer.


References

Airoldi, Massimo. 2022. Machine habitus: Toward a sociology of algorithms. Polity Press.

Andrejevic, Marc, and Neil Selwyn. 2023. Education in an era of pervasive automation. Postdigital Science and Education 5 (1): 220–31. https://doi.org/10.1007/s42438-022-00336-x

Bearman, Margaret, and Rola Ajjawi. 2023. Learning to work with the black box: Pedagogy for a world with artificial intelligence. British Journal of Educational Technology 54 (5): 1160–73. https://doi.org/10.1111/bjet.13337

Benedict, Cathy, and Jared O’Leary. 2019. Reconceptualizing “Music Making:” Music technology and freedom in the age of neoliberalism. Action, Criticism, and Theory for Music Education 18 (1): 26–43. https://doi.org/10.22176/act18.1.26

Bock, Annekatrin, Felicitas Macgilchrist, Kerstin Rabenstein, and Nadine Wagener-Böck. 2024. Hoping for community in a technologically decelerated world—A critical utopian approach. Futures: 103434. https://doi.org/10.1016/j.futures.2024.103434.

Borkowski, Alex. 2023. Vocal aesthetics, AI imaginaries: Reconfiguring smart interfaces. Afterimage 50 (2): 129–49. https://doi.org/10.1525/aft.2023.50.2.129

Bost, Marcia, and Xiaobo Wang. 2016. The unboxing of techné: Thinking outside the algorithms. Computers & Writing Proceedings 17: 33–40.

Bourdieu, Pierre. 1977. Outline of a theory of practice. Cambridge University Press.

Bourdieu, Pierre. 1984. Distinction: A social critique of the judgement of taste. Harvard University Press.

Bourdieu, Pierre. 1990. The logic of practice. Stanford University Press.

Bourdieu, Pierre. 1998. On television. New Press.

Bower, Matt, Jodie Torrington, Jennifer W. M. Lai, Peter Petocz, and Mark Alfano. 2024. How should we change teaching and assessment in response to increasingly powerful generative artificial intelligence? Outcomes of the ChatGPT teacher survey. Education and Information Technologies 29: 15403–15439. https://doi.org/10.1007/s10639-023-12405-0

Burnard, Pamela, Ylva Hofvander Trulsson, and Johan Söderman, eds. 2015. Bourdieu and the sociology of music education. Routledge. https://doi.org/10.4324/9781315569819

Chiu, Thomas K. F. 2023. The impact of generative AI (GenAI) on practices, policies and research direction in education: A case of ChatGPT and Midjourney. Interactive Learning Environments September: 1–17. https://doi.org/10.1080/10494820.2023.2253861

Christin, Angèle. 2020. The ethnographer and the algorithm: Beyond the black box. Theory and Society, 49 (5–6): 897–918. https://doi.org/10.1007/s11186-020-09411-3

Costa, Cristina, Ciaran Burke, and Mark Murphy. 2019. Capturing habitus: Theory, method and reflexivity. International Journal of Research & Method in Education 42 (1): 19–32. https://doi.org/10.1080/1743727X.2017.1420771

Czerniewicz, Laura, and Cheryl Brown. 2013. The habitus of digital ‘Strangers’ in higher education. British Journal of Educational Technology 44 (1): 44–53. https://doi.org/10.1111/j.1467-8535.2012.01281.x

Edgerton, Jason D., and Lance W. Roberts. 2014. Cultural capital or habitus? Bourdieu and beyond in the explanation of enduring educational inequality. Theory and Research in Education 12 (2): 193–220. https://doi.org/10.1177/1477878514530231

Eynon, Rebecca. 2023. Algorithmic bias and discrimination through digitalisation in education  A socio-technical view. In World yearbook of education 2024: Digitalisation of education in the era of algorithms, automation and artificial intelligence, edited by Ben Williamson, Janja Komljenovic, and Kalervo Gulson, 245–60. London: Routledge. https://doi.org/10.4324/9781003359722.

Hadi Mogavi, Reza, Chao Deng, Justin Juho Kim, Pengyuan Zhou, Young D. Kwon, Ahmed Hosny Saleh Metwally, Ahmed Tlili, et al. 2024. ChatGPT in education: A blessing or a curse? A qualitative study exploring early adopters’ utilization and perceptions. Computers in Human Behavior: Artificial Humans 2 (1): 100027. https://doi.org/10.1016/j.chbah.2023.100027

Hall, Clare. 2015. Singing gender and class: Understanding choirboys’ musical habitus. In Bourdieu and the sociology of music education, edited by Pamela Burnard, Ylva Hofvander Trulsson, and Johan Söderman, 43–59. Routledge.

Hess, Juliet. 2020. Music education and the colonial project: Stumbling toward anticolonial music education. In Proceedings of the 2020 AERA Annual Meeting. AERA. https://www.aera.net/Publications/Online-Paper-Repository/AERA-Online-Paper-Repository-Viewer/ID/1567504

Hiatt, Brian. 2024, March 17. A ChatGPT for music is here. Inside suno, the startup changing everything. Rolling Stone. https://www.rollingstone.com/music/music-features/suno-ai-chatgpt-for-music-1234982307/

Ho, Jonathan., Ajay Jain, and Pieter Abbeel. 2020. Denoising diffusion probabilistic models. In Proceedings of the 34th International Conference on Neural Information Processing Systems (NIPS ’20). Article 574: 6840–6851. http://arxiv.org/abs/2006.11239

Hristova, Stefka. 2023. Habits and habitus in algorithmic culture. In Handbook of critical studies of artificial intelligence, edited by Simon Lindgren, 108–16. Edward Elgar Publishing. https://doi.org/10.4337/9781803928562.00014

Kasneci, E., Sessler, K., Küchemann, S., Bannert, M., Dementieva, D., Fischer, F., Gasser, U., Groh, G., Günnemann, S., Hüllermeier, E., Krusche, S., Kutyniok, G., Michaeli, T., Nerdel, C., Pfeffer, J., Poquet, O., Sailer, M., Schmidt, A., Seidel, T., … Kasneci, G. 2023. ChatGPT for good? On opportunities and challenges of large language models for education. Learning and Individual Differences 103: 102274. https://doi.org/10.1016/j.lindif.2023.102274

Kim, Jin Hyun, and Uwe Seifert. 2006. Embodiment: The body in algorithmic sound generation. Contemporary Music Review 25 (1–2): 139–49. https://doi.org/10.1080/07494460600647592

Kress, Gunther R. 2003. Literacy in the new media age. Routledge.

Kuckartz, Uwe, and Stefan Rädiker. 2023. Qualitative content analysis: Methods, practice and software. SAGE Publications.

Leander, Kevin M., and Sarah K. Burriss. 2020. Critical literacy for a posthuman world: When people read, and become, with machines. British Journal of Educational Technology 51 (4): 1262–76. https://doi.org/10.1111/bjet.12924

Männistö-Funk, Tiina, and Tanja Sihvonen. 2018. Voices from the Uncanny Valley: Voices from the Uncanny Valley. Digital Culture & Society 4 (1): 45–64. https://doi.org/10.14361/dcs-2018-0105

Maton, Karl. 2012. Habitus. In Pierre Bourdieu: Key concepts, edited by Michael Grenfell, 48–64. Routledge.

McDermott, Emily. 2020, January 7. Holly Herndon on her AI baby, Reanimating tupac, and extracting voices. ARTnews.com (blog). https://www.artnews.com/art-in-america/interviews/holly-herndon-emily-mcdermott-spawn-ai-1202674301/

McQuillan, Dan. 2022. Resisting AI: An anti-fascist approach to artificial intelligence. Policy Press. https://doi.org/10.1332/policypress/9781529213492.001.0001

National Association for Music Education. 2024. AI and music education. https://nafme.org/blog/ai-in-education/

Nash, Roy. 2003. Social explanation and socialization: On Bourdieu and the structure, disposition, practice scheme. The Sociological Review 51 (1): 43–62. https://doi.org/10.1111/1467-954X.00407

O’Leary, Emmet J. 2023. Music Education on youtube and the challenges of platformization. Action, Criticism, and Theory for Music Education 22 (4): 14–43. https://doi.org/10.22176/act22.4.14

Papacharissi, Zizi, and Emily Easton. 2013. In the habitus of the new. In A companion to new media dynamics, 167–84. John Wiley & Sons. https://doi.org/10.1002/9781118321607.ch9

Phillips, Matthew T. 2021. Soundcloud rap and alien creativity. Journal of Popular Music Studies 33 (3): 125–44. https://doi.org/10.1525/jpms.2021.33.3.125

Pleasants, Jacob, Daniel G. Krutka, and T. Philip Nichols. 2023. What relationships do we want with technology? Toward technoskepticism in schools. Harvard Educational Review 93 (4): 486–515. https://doi.org/10.17763/1943-5045-93.4.486

Prior, Nick. 2011. Critique and renewal in the sociology of music: Bourdieu and beyond. Cultural Sociology 5 (1): 121–38. https://doi.org/10.1177/1749975510389723

Powell, Bryan, Gareth Dylan Smith, and Abigail D’Amore. 2017. Challenging symbolic violence and hegemony in music education through contemporary pedagogical approaches. Education 3-13 45 (6): 734–43. https://doi.org/10.1080/03004279.2017.1347129

Rehbein, Boike. 2016. Die Soziologie Pierre Bourdieus (The sociology of Pierre Bourdieu). UTB.

Rimmer, Mark. 2006. Songs In the key of life: The musical habitus and young peoples community music participation. https://core.ac.uk/reader/40072800

Rimmer, Mark. 2010. Listening to the monkey: Class, youth and the formation of a musical habitus. Ethnography 11 (2): 255–83. https://doi.org/10.1177/1466138109339115

Rimmer, Mark. 2012. Beyond omnivores and univores: The promise of a concept of musical habitus. Cultural Sociology 6 (3): 299–318. https://doi.org/10.1177/1749975511401278

Romele, Alberto. 2024. Digital habitus: A critique of the imaginaries of artificial intelligence. Routledge.

Sagiv, Dan, and Clare Hall. 2015. Producing a classical habitus: Reconsidering instrumental music teaching methods. In Bourdieu and the sociology of music education, edited by Pamela Burnard, Ylva Hofvander Trulsson, and Johan Söderman, 113–26. Routledge. https://doi.org/10.4324/9781315569819

Seaver, Nick. 2017. Algorithms as culture: Some tactics for the ethnography of algorithmic systems. Big Data & Society 4 (2): 205395171773810. https://doi.org/10.1177/2053951717738104

Seaver, Nick. 2022. Computing taste: Algorithms and the makers of music recommendation. University of Chicago Press. https://doi.org/10.7208/chicago/9780226822969.001.0001

Selwyn, Neil. 2019. Should robots replace teachers? AI and the future of education. Digital Futures Series. Polity Press.

Söderman, Johan, Pamela Burnard, and Ylva Hofvander-Trulsson. 2015. Contextualising Bourdieu in the field of music and music education. In Bourdieu and the sociology of music education, edited by Pamela Burnard, Ylva Hofvander Trulsson, and Johan Söderman, 1–12. Routledge. https://doi.org/10.4324/9781315569819

Sterne, Jonathan. 2003. Bourdieu, technique and technology. Cultural Studies 17 (3–4): 367–89. https://doi.org/10.1080/0950238032000083863a

Suleyman, Mustafa. 2023. The coming wave. Crown.

Swartz, David. 1997. Culture and power: The sociology of Pierre Bourdieu. University of Chicago Press. https://doi.org/10.7208/chicago/9780226161655.001.0001

Treß, Johannes. 2024. Augmentierte Unterrichtsplanung auf dem Prüfstand. Eine explorative Studie mit Musiklehrkräften zur KI-gestützten Unterrichtsplanung (Augmented lesson planning put to the test: an exploratory study with music teachers on AI-supported lesson planning). In Jahresband des Arbeitskreises Musikpädagogische Forschung / 45th yearbook of the German Association for Research in Music Education, edited by Bernd Clausen, Julia Ehninger, and Malte Sachsse, 117–29. Waxmann. https://doi.org/10.31244/9783830999188.10

Van Den Berg, Geesje, and Elize Du Plessis. 2023. ChatGPT and generative AI: Possibilities for its contribution to lesson planning, critical thinking and openness in teacher education. Education Sciences 13 (10): 998. https://doi.org/10.3390/educsci13100998

Varkøy, Øivind. 2017. Perspectives on uniformity, sameness, and homogeneity and proclamations of pluralism, diversity, and heterogeneity. Philosophy of Music Education Review 25 (1): 4. https://doi.org/10.2979/philmusieducrevi.25.1.02

Vear, Craig. 2022. Embodied AI and musicking robotics. In The language of creative AI, edited by Craig Vear and Fabrizio Poltronieri, 113–35. Springer. https://doi.org/10.1007/978-3-031-10960-7_7

Vickers, Earl. 2010. The loudness war: Background, speculation and recommendations. https://www.sfxmachine.com/docs/loudnesswar/loudness_war.pdf

Yildirim, Ilker, and L.A. Paul. 2024. From task structures to world models: What do LLMs know? Trends in Cognitive Sciences 28 (5): 404–15. https://doi.org/10.1016/j.tics.2024.02.008

Yu, Jiaxing, Songruoyao Wu, Guanting Lu, Zijin Li, Li Zhou, and Kejun Zhang. 2024. Suno: Potential, prospects, and trends. Frontiers of Information Technology & Electronic Engineering 25 (7): 1025–30. https://doi.org/10.1631/FITEE.2400299


Notes

[1] A detailed presentation of the articles published in the volume would go beyond the scope of this article, so only the texts that explicitly refer to the habitus concept are presented in the short summary. For a further list of diverse music education references to the habitus concept, see also Hall (2015, 47).

[2] There is already a discourse on the question of the embodiment of algorithmically mediated human-machine interaction that should not go unmentioned here (Kim and Seifert 2006; Vear 2022). However, this article focuses on the analysis of AI-generated data material.

[3] Although the teachers developed and worked out some very elaborate and detailed lesson plans with ChatGPT during the interview sessions (Treß, 2024), for reasons of better comprehensibility and exemplarity, only tabular lesson plans created by ChatGPT are presented and analyzed here.

[4]   By default, Midjourney outputs four image variants for a single prompt. For the first example, two screen series of two identical prompts were selected, as two groups of students formulated the same prompt as input in parallel.

[5] The prompts in Midjourney typically start with the command ‘/imagine (…)’.

[6] However, the subject conceived by the students can also be understood as an allegorical metaphor, portraying the Western and particularly Eurocentric musical habitus of the classical-romantic genius as a long threatened and endangered species (much like the dinosaurs themselves).