• iopq@lemmy.world
    link
    fedilink
    English
    arrow-up
    16
    arrow-down
    1
    ·
    22 hours ago

    Honestly, translating the good articles from other languages would improve Wikipedia immensely.

    For example, the Nanjing dialect article is pretty bare in English and very detailed in Mandarin

    • Echo Dot@feddit.uk
      link
      fedilink
      English
      arrow-up
      16
      ·
      edit-2
      15 hours ago

      You can do that, that’s fine. As long as you can verify it is an accurate translation, so you need to know the subject matter and the target language.

      But you could probably also have used Google translate and then just fine tune the output yourself. Anyone could have done that at any point in the last 10 years.

      • lunarul@lemmy.world
        link
        fedilink
        English
        arrow-up
        2
        ·
        8 hours ago

        As long as you can verify it is an accurate translation

        Unless the process has changed in the last decade, article translations are a multi-step process, which includes translators and proof-readers. It’s easier to get volunteer proof-readers than volunteer translators. Adding AI for the translation step, but keeping the proof-reading step should be a great help.

        But you could probably also have used Google translate and then just fine tune the output yourself. Anyone could have done that at any point in the last 10 years.

        Have you ever used Google translate? Putting an entire Wikipedia article through it and then “fine tuning” it would be more work than translating it from scratch. Absolutely no comparison between Google translate and AI translations.

        • Echo Dot@feddit.uk
          link
          fedilink
          English
          arrow-up
          1
          ·
          6 hours ago

          Putting an entire Wikipedia article through it and then “fine tuning” it would be more work than translating it from scratch.

          That depends on if you are capable of translating the language if you don’t know the language then the translator will give you a good start.

      • iopq@lemmy.world
        link
        fedilink
        English
        arrow-up
        3
        arrow-down
        1
        ·
        13 hours ago

        Google translate is horrendously bad at Korean, especially with slang and accidental typos. Like nonsense bad.

        • kazerniel@lemmy.world
          link
          fedilink
          English
          arrow-up
          2
          arrow-down
          1
          ·
          11 hours ago

          Same in Hungarian, machine translation still often gives hilariously bad results. It’s especially prone to mixing up formal and informal ‘you’ within the same paragraph, something which humans never do. At least it’s easy to tell when a website is one of those ‘auto-translated to 30 languages’ content mill.

    • SkunkWorkz@lemmy.world
      link
      fedilink
      English
      arrow-up
      5
      arrow-down
      1
      ·
      16 hours ago

      I recently have edited a small wiki page that was obviously written by someone that wasn’t proficient in English. I used AI to just reword what was already written and then I edited the output myself. It did a pretty good job. It was a page about some B-list Indonesian actress that I just stumbled upon and I didn’t want to put time and effort into it but the page really needed work done.

    • graphene@sopuli.xyz
      link
      fedilink
      English
      arrow-up
      1
      ·
      12 hours ago

      Wikipedia’s translation tool for porting articles between languages currently uses google translate so I could see an LLM being an improvement but LLMs are also way way costlier than normal translation models like google translate. Would it be worth it? And also would the better LLM translations make editors less likely to reword the translation to make it’s tone better?

      • iopq@lemmy.world
        link
        fedilink
        English
        arrow-up
        1
        ·
        9 hours ago

        You can use an LLM to reword the translation to make the tone better. It’s literally what LLMs are designed to do