Comments below Dongxuelian's cover of Tiaolouji
Comments below Dongxuelian's cover of Tiaolouji
As AI Dongxuelian gains increasing attention, a debate has emerged within the fan community. Some argue that AI-singing is superior to the original singer, while others believe such comparisons are disrespectful. The debate reached a peak in late March 2025 when both Dongxuelian (the V-Tuber) and Dongyangxuelian (the AI-singing creator) uploaded their versions of the trending song "Tiaolouji" (跳楼机). Dongyangxuelian's AI cover, posted on March 15th, quickly gained 1.95 million views, while Dongxuelian's version, uploaded two days later, garnered only 541k views. Some fans began to mock Dongxuelian, posting comments such as, "Is this AI voice model trained poorly?" and "I think the 'next-door' version is better."
On March 20th, Dongyangxuelian surprisingly announced on Bilibili that he would be taking a break from creating AI-Dongxuelian videos. In a follow-up post, he emphasized that he did not want to cause any discomfort to the original artist. He urged fans not to post related comments under the Dongxuelian’s videos, acknowledging that his success was partly due to the her support for fan creations, and asked for respectful behavior from fans. Dongyangxuelian’s fans speculated that Dongxuelian may have expressed dissatisfaction and "threatened" him over copyright concerns. On April 19th, Dongyangxuelian uploaded a new AI-Singing, which suggested that they have reached certain agreements.
(updated 21st, April, 2025)
AI Dongxuelian Singing "Tiaolouji"
Dongxuelian Singing "Tiaolouji"
While the Tiaolouji incident brought AI and human performers into direct comparison, it’s important to remember that AI-singing is never fully autonomous — it is, at its core, a collaborative process. Every AI-Singing is shaped by the creative labor of a human trainer who collects vocal datasets, fine-tunes voice models, experiments with emotional delivery, and refines the musical output. Human and AI actors co-create these performances, and it is this collaboration that enables AI-singing to thrive as a participatory, remix-based phenomenon on platforms like Bilibili.
Yet once these co-produced voices enter the public sphere, the framing often shifts from collaboration to competition. In the Tiaolouji case, public discourse focused less on the creative process and more on who “performed better” — the original V-Tuber or her AI-generated double. This shift exposes underlying tensions around authorship, recognition, and economic value, particularly in the context of digital creators whose voices are central to their marketability and professional identity. For performers like Dongxuelian, vocal performance is not just artistic expression — it is integral to their visibility, branding, and income in a platform-based economy.
This contrasts with other uses of AI-generated voice, such as in navigation systems. For example, Gaode Map ( a Chinese software) has invited celebrities to "lend" their voices to AI navigators — and many embrace the opportunity. These collaborations are seen not as threats, but as brand extensions that enhance their visibility.
As Sovits and similar technologies become increasingly accessible online, the line between fan and the creator will continue to blur. Moving forward, the challenge lies not only in technical refinement, but in building shared/ legal norms around attribution, emotional respect, and platform ethics. Can AI-generated voices coexist with their human counterparts without displacing them? Can fan creators and figures like Dongxuelian co-produce music?
As a researcher and a fan, I see the potential for AI-singing to become a posthuman music form—not one that replaces human agency, but one that reconfigures how we imagine creativity, labor, and voice itself. But I also see substantial challenges. Current legal frameworks have not kept pace with these developments, leaving questions of ownership, authorship, and voice rights unresolved. Content creators often have to navigate a grey zone where innovation and ethical ambiguity coexist. Without clearer guidance from platforms or policy, community shared norms remain the primary mode of governance — and they’re somehow fragile.
Ultimately, the future of AI-singing depends not just on better models or technologies, but on better conversations between creators, human voice "owners", algorithms, fans, and the human and more-than-human voices we’re still learning how to hear.