this post was submitted on 16 Oct 2023
24 points (62.5% liked)

Technology

34388 readers
236 users here now

This is the official technology community of Lemmy.ml for all news related to creation and use of technology, and to facilitate civil, meaningful discussion around it.


Ask in DM before posting product reviews or ads. All such posts otherwise are subject to removal.


Rules:

1: All Lemmy rules apply

2: Do not post low effort posts

3: NEVER post naziped*gore stuff

4: Always post article URLs or their archived version URLs as sources, NOT screenshots. Help the blind users.

5: personal rants of Big Tech CEOs like Elon Musk are unwelcome (does not include posts about their companies affecting wide range of people)

6: no advertisement posts unless verified as legitimate and non-exploitative/non-consumerist

7: crypto related posts, unless essential, are disallowed

founded 5 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] narwhal@lemmy.ml 2 points 11 months ago (2 children)

What about preserving languages that are close to extinct, but still have language data available? Can LLMs help in this case?

[–] ImpossibilityBox@lemmy.world 5 points 11 months ago (1 children)

Preservation only but not likely any better than a linguistic historian.

But it gets tricky because LLMs only function on HUGE sets of data. LLMs are nothing more than complicated probability engines. Give it the question "What color is the sky?" and the math extracted from the massive databases that it has says the highest probability answer is "Blue". It doesn't actually KNOW the answer it just knows the probabilities of different words.

Without large amounts of data on the dying language current gen LLM's won't be accurate or able to generate reliable answers. Shoot... LLMs can barely generate reliable answers with the massive datasets they currently have.

I strongly recommend anyone even remotely interested in LLMs to read this interactive article:

https://ig.ft.com/generative-ai/

[–] Veraticus@lib.lgbt 1 points 11 months ago

This is a great article, thanks for linking it!

[–] Veraticus@lib.lgbt 1 points 11 months ago

Yeah, that would be a good usage of an LLM!