this post was submitted on 22 Sep 2024
177 points (100.0% liked)

Free and Open Source Software

17799 readers
15 users here now

If it's free and open source and it's also software, it can be discussed here. Subcommunity of Technology.


This community's icon was made by Aaron Schneider, under the CC-BY-NC-SA 4.0 license.

founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] off_brand_@beehaw.org 1 points 6 days ago* (last edited 6 days ago) (2 children)

What's the traffic on invidious? Like, while I don't necessarily agree with the ad-block-block, the profit motive makes sense given their ubiquity. But are there really enough users of alternate YouTube frontends that Google is capturing any meaningful profit? Especially when developer hours are expensive and could be used elsewhere on more valuable projects?

[–] helenslunch@feddit.nl 4 points 6 days ago

Honestly, dev hours are probably a pittance compared to the potential revenue of more ads watched and/or additional YT prem subscriptions.

[–] Moonrise2473@feddit.it 1 points 4 days ago* (last edited 4 days ago) (1 children)

I feel it's just a side effect of them trying to block ai companies stealing large amounts of videos for training models. They see too many downloads from a datacenter IP address and require user login to continue

Openai's whisper often recognizes mangled words as "please like and subscribe" so they're actively stealing videos and their subs (the manually created ones by companies like "caption+ by js", which creators paid hundreds of dollars to make, not the free ones made by Google automatic transcriber or whisper itself) to improve their models so they can make profit

[–] FozzyOsbourne@lemm.ee 1 points 4 days ago (1 children)
[–] Moonrise2473@feddit.it 2 points 4 days ago* (last edited 4 days ago) (1 children)

Stealing, without the quotation marks. If you copy something and profit off it without crediting, compensating or asking permission to who paid for it, it's stealing. We can't downplay it as "but they just downloaded 700k hours of videos and 200k pirated books for training a simple model that they're charging users $20 a month, what's the issue"

If you copy something for personal enjoyment without profiting from it, then it's not stealing.

[–] FozzyOsbourne@lemm.ee 1 points 4 days ago (1 children)

I get your point, it's just hard to give a shit when one amoral megacorp takes some profit away from another. Google owns and profits from YouTube videos and occasionally throws a few pennies to the creators if they haven't broken this week's selection of ever-changing arbitrary rules.

[–] Moonrise2473@feddit.it 1 points 3 days ago

Probably Google just wants to block them not because they care about the creators but because it's costing them bandwidth money.

From the new agreement that they had with warner bros for creating closed captions, it looks like Google is also stealing the subs for training, they had direct access

It just sucks that someone pays hundreds of dollars to have a human create subs for a show, then that is used without credit or permission for training a model (actually whisper accidentally credits moments of silence with the name of the subbing groups used for training)