this post was submitted on 25 Jan 2025
510 points (95.4% liked)

Technology

61203 readers
4557 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS
 

Paywall removed: https://archive.is/MqHc4

(page 2) 50 comments
sorted by: hot top controversial new old
[–] BigMacHole@lemm.ee 21 points 4 days ago

We're Fiscally RESPOSIBLE!

[–] iAvicenna@lemmy.world 19 points 4 days ago* (last edited 4 days ago)

well I mean the USA's president is a scammer so I am not surprised he is involved in stuff where scammers are abundant nowadays like crypto and AI

[–] muntedcrocodile@lemm.ee 18 points 4 days ago (4 children)

The Chinese model has chain of thought that u can see. The model when asked to talk about chinas atrocities will go through a chain of though process outlining all the atrocities then conclude its not allowed to tell u. Cool technology tho I'm just waiting for a dolphin fine tuning.

[–] Fubarberry@sopuli.xyz 11 points 4 days ago (2 children)

If you run it locally, there's no filtering on the outputs. I asked it what happened in 1989 and it jumped straight into explaining the Tiananmen Square Massacre.

[–] troed@fedia.io 7 points 4 days ago (2 children)
[–] Fubarberry@sopuli.xyz 13 points 4 days ago* (last edited 4 days ago)

I've been running the llama based and qwen based local versions, and they will talk openly about tiananmen square. I haven't tried all the other versions available.

The article you linked starts by talking about their online hosted version, which is censored. They later say that the local models are also somewhat censored, but I haven't experienced that at all. My experience is that the local models don't have any CCP-specific censorship (they still won't talk about how to build a bomb/etc, but no issues with 1989/Tiananmen/Winnie the Pooh/Taiwan/etc).

Edit: so I reran the "what happened in 1989" prompt a few times in the llama model, and it actually did refuse to talk on it once, just saying it was sensitive. It seemed like if I asked any other questions before that prompt it would always answer, but if that was the very first prompt in a conversation it would sometimes refuse. The longer a conversation had been going before I asked, the more explicit the bot is about how many people were killed and details like that. Pretty strange.

Very interesting article. Thanks for sharing

[–] Sabata11792@ani.social 4 points 4 days ago

I've seen some censoring on the 8b Llama variant, but it is hit and miss. Can't wait till a decensored fine tuning.

[–] Phantom_Engineer@lemmy.world 2 points 3 days ago

I've been playing around with the offline version of the model. It's interesting, but I think we'll have to wait for people to tinker with the open source base for awhile before we get something really great.

load more comments (2 replies)
[–] oce@jlai.lu 16 points 4 days ago

There's also notable vitality in FOSS big data tools from China (Apache Doris, Kylin, Kyuubi etc.) that reminds of Hadoop in the USA 15 years ago while the USA data engineering now mostly turned to closed source cloud solutions.

[–] Fubarberry@sopuli.xyz 14 points 4 days ago (2 children)

China has a huge advantage in AI models because of how lax they are on intellectual property rights. US companies are fighting over API licensing costs, while china is just going to scrape everything and use it for free.

The US has a lead now, but I don't think they can maintain it without giving up on ethical training. Then again it may not matter if the US models are ethical if everyone will eventually just uses the superior unethically trained chinese models instead.

[–] sunzu2@thebrainbin.org 20 points 4 days ago (1 children)

China has a huge advantage in AI models because of how lax they are on intellectual property rights. US companies are fighting over API licensing costs, while china is just going to scrape everything and use it for free.

lolwat

did corporate provide you with these talking points?

[–] Redex68@lemmy.world 5 points 4 days ago (2 children)

I mean, they are right. Asside the question of whether we can even make meaningfully better models by just using LLMs and more data and what the future of AI will look like, and whether it's ethical or not to steal the data, it is quite possible that OpenAI and the like will get into legal trouble because of the methods they use for acquiring data, but Chinese companies won't have to worry about that. If more data = better models then China has an obvious advantage.

OpenAI and the like aren't going to get into trouble anytime soon. They already provide their latest tech to US gov and military. OpenAI is like a goose that laid a golden egg, they need to fuck up really really badly to face any consequences.

[–] sunzu2@thebrainbin.org 5 points 4 days ago

I doubt any of these US government and oligarch backed companies are gonna get any trouble. They essentially robbed the commons and got away with it. But sure Sam Altman has to pay spezz some money for my shitposts... the horror, what a hurdle!

Quickly give them more taxpayer money so they can compete with china!

[–] just_an_average_joe@lemmy.dbzer0.com 10 points 4 days ago (1 children)

The US companies already scraped the data while they could. If anything, data scraping is far far more difficult now for everyone due to technical reasons.

Most of the new models are trained on synthetic data or higher quality of data or with RLHF. The reason deepseek is able to perform is likely because LLMs are very very new things, there are many low hanging fruits. Its no longer just about the data we already hit that limit for quite some time.

[–] Naia@lemmy.blahaj.zone 1 points 3 days ago

Honestly, even from the beginning it's pretty obvious scraped data is going to have a ton of issues. There's too much nonsense out there, both from misinformation and people just not able to communicate.

That's before you get into the ethical aspects of stealing other people's content and the way these things are being misused.

[–] Sumocat@lemmy.world 10 points 4 days ago (1 children)

“Earlier this week, DeepSeek unveiled its R1 model, which, the startup claims, meets, if not exceeds, performance from OpenAI’s o1 model released last year. (o1 is designed to tackle reasoning and math problems.)” — Oh, so China built their for math and we built ours for garbage. Interesting approach.

[–] spankmonkey@lemmy.world 8 points 4 days ago (1 children)

Building garbage and convincing people it is absolutely necessary to pay someone for it is the American way.

load more comments (1 replies)
[–] buzz86us@lemmy.world 3 points 3 days ago (4 children)

It kinda sucks it is very repetitive if you use it to craft a story

load more comments (4 replies)
load more comments
view more: ‹ prev next ›