Google Translate adds 110 new languages using AI, breaking down communication barriers for millions around the world.

The new languages are:

Abkhaz
Acehnese
Acholi
Afar
Alur
Avar
Awadhi
Balinese
Baluchi
Baoulé
Bashkir
Batak Karo
Batak Simalungun
Batak Toba
Bemba
Betawi
Bikol
Breton
Buryat
Cantonese
Chamorro
Chechen
Chuukese
Chuvash
Crimean Tatar
Dari
Dinka
Dombe
Dyula
Dzongkha
Faroese
Fijian
Fon
Friulian
Fulani
Ga
Hakha Chin
Hiligaynon
Hunsrik
Iban
Jamaican Patois
Jingpo
Kalaallisut
Kanuri
Kapampangan
Khasi
Kiga
Kikongo
Kituba
Kokborok
Komi
Latgalian
Ligurian
Limburgish
Lombard
Luo
Madurese
Makassar
Malay (Jawi)
Mam
Manx
Marshallese
Marwadi
Mauritian Creole
Meadow Mari
Minang
Nahuatl (Eastern Huasteca)
Ndau
Ndebele (South)
Nepalbhasa (Newari)
NKo
Nuer
Occitan
Ossetian
Pangasinan
Papiamento
Portuguese (Portugal)
Punjabi (Shahmukhi)
Q’eqchi’
Romani
Rundi
Sami (North)
Sango
Santali
Seychellois Creole
Shan
Sicilian
Silesian
Susu
Swati
Tahitian
Tamazight
Tamazight (Tifinagh)
Tetum
Tibetan
Tiv
Tok Pisin
Tongan
Tswana
Tulu
Tumbuka
Tuvan
Udmurt
Venda
Venetian
Waray
Wolof
Yakut
Yucatec Maya
Zapotec

  • RBG@discuss.tchncs.de
    link
    fedilink
    English
    arrow-up
    12
    ·
    5 months ago

    Is the accuracy as high as with other aspect of AI, e.g. LLMs answering with 90% correctness?

  • kolorafa@lemmy.world
    link
    fedilink
    English
    arrow-up
    2
    arrow-down
    7
    ·
    5 months ago

    If only some of the newly added translations was not garbage, adding local dialects that differs by location is useless, it’s more like a gimmick. It just makes it harder pick language as you need to scroll over all of those useless ones…

    It’s probabably just a PR stunt.

    • Fitik@fedia.ioOP
      link
      fedilink
      arrow-up
      8
      arrow-down
      1
      ·
      edit-2
      5 months ago

      @kolorafa@lemmy.world Eh, probably still better than siting with dictionary tring to translate word by word, no? Especially if you don’t know the language For some of these languages there was no machine translation before that at all

      all of those useless ones

      No language is useless, please don’t say that, some are just smaller

      • kolorafa@lemmy.world
        link
        fedilink
        English
        arrow-up
        1
        ·
        5 months ago

        No language is useless, please don’t say that, some are just smaller

        Yes, sorry, I didn’t check other languages, as in the list there most likely are languages that are still heavily used and/or official country languages.

        Initially I was under the impression that they didn’t get added before because all of them are not official ones and/or used anymore like in the case of “Silesian” that I did check and confirmed that should be avoided as much as possible. It’s a disservice to new generation of people that still (partial) learn it in their own while growing in that region. Then those people go to different region inside their own country and (because they or the region “use that variant”) then have troubles with communicating.

        You have clear example with Poland and Germany. Example, imagine you live in France, you want to go to Poland (or Germany), you go to school (or get some lessons) to Learn Polish (or Germany), you drive to some random region of Poland (and Germany) and … surprise, only in that “small” region they talk not with Polish (or Germany) but their own (one of dozens) variant that they call “Silesian” (or Bavarian). And even if you go and learn “Silesian” (or Bavarian) it most likely will only help maybe partially…

      • kolorafa@lemmy.world
        link
        fedilink
        English
        arrow-up
        1
        ·
        edit-2
        5 months ago

        I call them useless because this language is a subset of old language that no one uses - or more precise no one should use it any more. It is a regional language from era where every region had their own culture and was occupied by multiple countries so the language is a mix of multiple molded languages that differs by region.

        Why it’s useless (in example of “Silesian”):

        • you can’t use it inside the country as it’s not official language that you can use when conversing inside goverment (yes, in some local regional governmental offices they understand it, but it’s not official)
        • in Poland people use the language “Sląski” (or like google call it “Silesian”) for many different set’s of words and dialects, and they differ to a point that one “Silesian” speaking guy sometimes have issues talking to another “Silesian” speaking guy
        • the translation is not helpful to people using that language as they know the official country language too
        • the translation probably will never be useful to any other person as they can always going to prefer to use official country language (or English)
        • the translation is bad on it’s own (as it’s only one variant of dozens if not more)

        So I’m going to call that language “useless” as other than historical value it should disappear as it this “language” was never fully defined/described and will never due to many “variants”.

        I don’t have an issue that they craeted it, I have an issue that they mixed the this language with other official (useful because still used) languages.

        I hope that in the future they will finally cleanup the bullshit language selector in google translate grouping them somehow to make it easier to quickly select the actually useful languages that people actually use on day-to-day basic.

        better than siting with dictionary for historian? maybe for me as a “native person speaking (one of the variants of) that language”? no, as I already pointed out it’s only one random variant from dozens if not more.