this post was submitted on 07 Feb 2024
178 points (98.4% liked)
Technology
58429 readers
4233 users here now
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related content.
- Be excellent to each another!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, to ask if your bot can be added please contact us.
- Check for duplicates before posting, duplicates may be removed
Approved Bots
founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
I don't know enough to know whether or not that's true. My understanding was that Google's Deep mind invented the transformer architecture with their paper "all you need is attention." A lot, if not most, LLMs use a transformer architecture, though your probably right a lot of them base it on the open source models OpenAI made available. The "generative" part is just descriptive of the model generating outputs (as opposed to classification and the like), and pre trained just refers to the training process.
But again I'm a dummy so you very well may be right.