Although it seems likely that Crunchyroll uses an LLM for translation in some way, I wouldn’t call that “confirmed” since that might be the result of an individual translator using it.
The actions of an employee, when reviewed and released by a company, are the actions of that company. A company is just the sum of its employees’ actions.
Also, LLM have been there for a while. So there are a few possible situations
- LLM used is authorized or even encouraged. In this case it’s the company
- LLM use is controlled, and this falls into one of the authorized cases. Same thing really. Also their authorized use cases need review
- LLM use is forbidden, or restricted and this is not an authorized use. In this case it falls on the company to review what’s being done. It’s their responsibility.
So yeah, whatever the situation, it’s on Crunchyroll.
How are subtitles created usually? Are they provided by the source material team, some professional third party that manually transcribes the video, or just fans doing it for free?
See that’s the kicker, for the longest time it was basically all fan translated subtitles, and only recently have payed for translation become the norm.
So it’s really quite pathetic for them to try and save a few bucks by replacing a proper translator with a LLM, given that there are still plenty of passionate fans who would have done it for free. Especially given that translating between Japanese and English in a cultural context heavy situation is something these LLMs are really bad at.
given that there are still plenty of passionate fans who would have done it for free
I’d imagine this is a non-starter from a corporate standpoint. I know if I were in charge I’d be terrified of the idea of just trusting community-submitted subtitles to not have random slurs or something inserted. That said I still think it would be super cool if they’d let people source and use their own subtitle files; I now it’s possible because I have a tampermonkey script that lets me do just that.
That’s the core of the issue, crunchy roll has sat its self as a corporate middleman, buying the rights to distribute shows and then charging consumers a subscription for access.
But they can’t be bothered to do the only actual damn work their position would realistically demand, beyond renting server space; providing translations for the foreign media they’re distributing.
That’s without even discussing the fact that not a single penny users give them will end up in the hands of any of the exploited artists who actually made the shows, since the industry doesn’t work on residuals or any other kind of profit sharing, the licensing fees crunchy roll pays essentially going straight to financiers.
That’s without even discussing the fact that not a single penny users give them will end up in the hands of any of the exploited artists who actually made the shows
That’s quite the assertion. How exactly do you suggest they’re buying the rights to distribute the shows then?
They’re buying them from production committees and other such organizations. Most anime is made on essentially “commission” basis, where a studio is payed a fixed upfront amount by a group of financiers and other interests, who then distribute the show, sell the merch, and license it internationally. Essentially studios and those who work there are payed no residuals or other profit sharing scheme like is common in the American film and television industry.
There is actually a bit of a cartel in that regard, with the third parties that purchase shows from studios having collaborated to suppress the cost of seasons for nearly 2 decades, leading to stagnant wages and rampant overworking of artists as the quality and quantity of work expected increases while the budget stays the same. Increasingly artists at the companies have had to fall back on gig work beyond their standard hours to make ends meet, getting payed by frame in their off hours to make a little extra money, effectively working 16 hour days through this additional work. There is some movement to change this as of late, but, this is still essentially the norm.
Yes, but do you think they’d buy the shows from those production committees and other organizations if people weren’t interested in paying subscriptions to watch them? That’s like saying Bandai doesn’t get money when I buy gunpla from a store like usagunplastore just because usagunplastore already bought the gunpla from Bandai months ago and Bandai isn’t getting more money from that particular purchase.
Animators being horribly underpaid is a different topic entirely.
The people who actually made the show, animators, voice actors, and writers do not get money based on your crunchy rolls subscription, and those production committees that do get money, didn’t make the shows, they just initially financed them.
Assuming the show is based on a manga or light novel, the original artist/writer might if they were lucky enough to negotiate shares in the production committee, but most are not in a position to do so.
For me, what matters, is that the people who made the art get compensated fairly, that they are able to live a good life. That people are encouraged to make art by my consumption of it, and the current system doesn’t do that. It’s a horrific exploitative machine where purchase reward further exploitation of the people who actually put work and effort in to make the art.
You left out the part of the sentence where they actually answer your question.
So pirate the shit and use whatever subtitles you want.
In terms of anime fansubs, it’s normally just great folks in the community. Some got hired by studios. But the studio is meant to provide the subs.
It seems that they have, or at least had in 2023, internal teams that handled the translations. https://www.crunchyroll.com/news/interviews/2023/9/30/international-translation-day-2023
I maintain my own media library and I ensure every file has English and German subtitles. There are a variety of ways to source srt files but when all else fails a machine with enough compute can transcribe video files using open source whisper. After I generate an English srt file from the video I send it to OpenAI to create the German translation.
Is there something similar for manga? Something that can overlay Japanese text on images, similar to what we have on smartphones but for the PC?
If your video file is Japanese language use a whisper model optimized for Japanese. Once it produces the Japanese srt you can get translations from open ai. Use handbrake to add the srt to the file and you’re done. Good luck!
Sorry. By manga I’ve meant image data, like pngs or JPEGs.
As someone who is able to speak Japanese, I’d notice the drop in quality of translation almost instantly.
I never turn on subs anyway when I watch my anime though.
I have to since my partner doesn’t speak Japanese, but half the time I end up having to correct lines for them once or twice, to make things make sense. The non-egregious stuff I don’t even bother with. It’s crazy how amateurish some of the mistakes are, or even what are clearly choices to omit entire sentences, for no reason.
おい、ゆうじ君、海行こうぜ
“Hi Yuji!”
君
As someone who learns japanese. Is that a kanji for a honorific? probably kun? ゆうじ is the name, although weird that it is written in hiragana I guess… But I fail at this one 海行こうぜ
The first Kanji has the one for mother as part of it I think… And the second one is pronounced it ‘i’ so …iikouze ? Let’s go somewhere?
Yes, 君 is ‘kun’ when used as an honorific.
海 is ‘umi’, or sea/ocean. You are correct that the second half of the kanji (母) is the same as the standalone character for mother, but it’s base radical is ⽏, which also just means mother. The first radical, ⺡, means water/ liquid, so you can sort of infer that “water mother” = ocean. Not all kanji work out this nicely with their radical structure, though.
Last part is spot on, ikou (行こう) is the (conjugation?) of iku or ‘to go’ that expresses a suggestion to do, i.e. “let’s (go)”.
I feel like this is a reasonable use of chat gpt.
For YouTube tutorial videos I have no issue with relying on GPT, but I think it’s important to recognize that the translation of art is art. I don’t feel good about the idea of something without a soul or perspective interpolating a work of art from one culture and language into another that might be wildly different from where it started.
That all said, I think Crunchyroll and anyone else using AI art without disclosing it absolutely should be honest about it.
I feel like what makes the most sense and is likely what’s happening is that ChatGPT is being used to do the initial translation, and then a human is auditing that translation and making adjustments. So just a faster way to get the scaffolding and grunt-work out of the way.
they appear to be copying direct translations from chat GPT in to the subtitles, judging by the fact that one of the subtitles said “Chat GPT says:” and then the line in German. People who speak German also noticed that the grammar and sentence structure for many of these shows has been awful and nonsensical at times.
If anyone is doing any sort of oversight, they don’t appear to speak German them selves and are just betting that the output will be accurate and pasting it in.
Someone who spoke German and Japanese fluently enough to do competent oversight could probably translate faster than they could edit and rephrase the work of an LLM, which are notoriously bad at translating languages in a high context situation like dialog in a animated show. LLMs are also generally very bad with high context languages like Japanese, and even worse at translating between them and low context languages like German.
Both translation and subtitles have highly efficient tooling when in the hands of a professional. Translators nowadays use a mix and will build up a dynamic database as they go through a corpus that needs coherence. What’s bad in this instance is not the usage of some AI, but of a badly adapted AI and ultimately of mediocre results which gives an amateurish impression.