this post was submitted on 17 Sep 2023
663 points (92.2% liked)
Games
33083 readers
1146 users here now
Welcome to the largest gaming community on Lemmy! Discussion for all kinds of games. Video games, tabletop games, card games etc.
Weekly Threads:
Rules:
-
Submissions have to be related to games
-
No bigotry or harassment, be civil
-
No excessive self-promotion
-
Stay on-topic; no memes, funny videos, giveaways, reposts, or low-effort posts
-
Mark Spoilers and NSFW
-
No linking to piracy
More information about the community rules can be found here.
founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
Compression and transmission of data causes loss of parity. We lose or flip some 1s and 0s. Over time the effects become very noticeable. The best visual example I can think of are experiments where YouTubers downloaded and reuploaded their own video 100 times, it very quickly degrades. In a more reasonable scenario, near lossless file types and compressions would degrade much more slowly.
That just means Youtube's software uses lossy compression, that is a Youtube problem, not a digital media problem. Are you familiar with the concept of file hashing? A short string can be derived from a file, such that if any bit of the file is altered, it will produce a different hash. This can be used in combination with other methods to ensure perfect data consistency; for example a file torrent that remains well seeded won't degrade, because the hash is checked by the software, so if anyone's copy changes at all due to physical degradation of a harddrive or whatever other reason, the error will be recognized and routed around. If you don't want to rely on other people to preserve something, there is always RAID, a 50 year old technology that also avoids data changing or being lost assuming that you maintain your hardware and replace disks as they break.
Here's the fundamental reason you're wrong about this: computers are capable of accounting for every bit, conclusively determining if even one of them has changed, and restoring from redundant backup. If someone wants to perfectly preserve a digital file and has the necessary resources and knowledge, they can easily do so. No offense but what you are saying is ignorant of a basic property of how computers work and what they are capable of.
It's the most obvious example of a digital media problem. Computers might be able to account for every bit with the use of parity files and backups with frequent parity checks, but the fact is most people aren't running a server with 4 separately powered and monitored drives as their home computer, and even the most complex system of data storage can fail or degrade eventually.
We live in a world of problems, like the YouTube problem, compression problems, encoding problems, etc. We do because we chose efficiency and ease of use over permanency.
Yes, and this can be done through mostly automatic or distributed processes.
I wouldn't describe it as complex, just the bare minimum of what is required to actually preserve data with no loss. All physical mediums may degrade through physical processes, but redundant systems can do better.
It isn't hard to seed a torrent. If a group of people want to preserve a file, they can do it this way, perfectly, forever, so long as there remain people willing to devote space and bandwidth.
All of these problems boil down to intent. Do people intend to preserve a file, do they not care, do they actively favor degradation? In the case of the OP game, it seems that the latter must be the case. Same with Youtube, same with all those media companies removing shows and movies entirely from all public availability, same with a lot of companies. If someone wants to preserve something, they choose the correct algorithms, simple as that. There isn't necessarily much of a tradeoff for efficiency and ease of use in doing so, disk space is cheap, bandwidth is cheap, the technology is mature and not complicated to use. Long term physical storage can be a part of that, but it isn't a replacement for intent or process.
I think you didn't read correctly on the statement about the most complex system failing. I'm not saying that is the most complex system, I am saying the most complex system will fail.
LMAO at the idea of comparing every bit of every portion of every seeder's copy with each other simultaneously and then cross referencing every parity file to be doubly safe, and then failing to see the chance of loss of parity during transmission of said files even after that. I will admit it would take a lot longer for a torrented file to degrade than some other forms of file distribution, but it's not going to last for a thousand years, mate.
And I am saying complexity has little to do with it and also that a system can exist that will not fail.
Specifically why not? What is unrealistic about this scenario, assuming enough people care to continue with the preservation effort? All nodes must fail simultaneously for any data to be lost. The probability of any given node failing at any given time is a finite probability, independent event. The probability of N nodes failing simultaneously is P^N. That is exponential scaling. Very quickly you reach astronomically low probabilities, 1000 years is nothing and could be safely accomplished with a relatively low number of peers. Maybe there are external factors that would make that less realistic, like whether new generations will even care about preserving the data, but considering only the system itself it is entirely realistic.
Read the above conversation to find out.
This has nothing to do with copying a file. YouTube re-encodes videos whenever they are uploaded.
A file DOES NOT DEGRADE when it is copied. That is something that happened to VHS and cassette tapes. It does not happen to digital files. You can even verify this by generating a hash of a file, copy it 10,000 times, and generate a new hash and they would be 100% identical.
You should perform that exact experiment with a sufficient number of bits, you'll be surprised.
No I won't be, because I've done this before for various reasons, but not a single but was changed.
Let me put it this way. A computer stores programs and instructions it needs to run in files on a drive. These files contain exact and precise instructions for various components to operate. If even a SINGLE bit is off in just a couple of the OS files, your computer will start throwing constant errors if not just crashing entirely.
And this isn't just theory. It's provable. Cosmic rays have been known to sometimes hit a drive and cause a bit-flip. Or another issue is a drive not being powered on for a long time causing bit-rot
At this point I'm starting to think you're a troll. There's no way someone believes what you're saying.
Edit: autocorrect
I'm going to stop responding to you few left in this thread because I don't think you're trolls, I know you are.
Then you're not a troll, just completely deluded and frankly stupid. You've been getting so many genuine responses trying to help you learn, but you keep digging in your heels and doubling down on being confidently wrong.
Believe whatever you want, just keep it to yourself.
They want to "help me learn" that a form of media storage invented and refined within a couple of decades will outlast all other forms, because they've deluded themselves that the things they rely on are perfect and that failure is impossible.
What you're talking about is 100% unrelated to what the discussion is about. The media the files are stored on are irrelevant. USB flash drives are known to die within just a couple of years in some cases. But when the storage media itself fails, then the data on it is more is less lost. And that includes degradation of the medium itself. That's why both spinning hard drives and solid state drives need to be powered on and "refreshed" every so often (about a year for solid state and roughly a few years for magnetic). And degradation in this context means beyond the point where each bit can be reliably and accurately read from the medium. Once you go past that point you end up with corrupted data. And that includes pictures and videos. A raw picture probably won't be affected too much with a single bit flipping, but a jpg for example, will visibly look corrupted. This is what a corrupted jpg looks like. And it can occure with just a single bite or byte being incorrectly changed/saved jpg1 jpg2
And here's an example of corrupted video: https://www.youtube.com/watch?v=i-bz21deEeY
All it takes is a single corrupted byte in either the b-frame or i-frame in a video and it will cause that momentary glitch. That's what happens when data "degrades". Digitally copying a file absolutely does not "degrade" data each time it is copied. The idea is just laughable. We aren't talking about copying a VHS tape.
You're referring to a video codec degrading as it keeps rendering the video again, not just copying and pasting the bits. There is no degradation from copying and pasting a file as-is.
No, I am not referring to that. YouTubers have the option to download their own videos. Not steal it with a video downloading tool.
That's YouTube's processed video not the original.
And when you download the processed video and reupload it, it's a 1 to 1 conversion of the same video codec, and every generation it gets worse. That example is a low hanging fruit, but the concept applies to everything.
That 1:1 conversion through the same codec is very likely lossy. However that's not a straight file copy which is what you originally said causes degradation.
You really jumped in here to tell me exactly the contents of a comment I made just below it in the thread, as if I didn't already know it.
I jumped in to point out the flaw in the YouTube experiment you're referring to.
Can you think of a better visual example that a simple person could see and understand?
Imo, an easy way to remove YouTube's postprocessing from the equation would be to copy a video file to and from a nas or other computer several times and compare it with the untouched file.
No, this is because YouTube compresses every file before distributing it. This happens even when downloading on the creator side.
Literally every file distribution method compresses the media first. A better argument was that YouTube re-encodes the video during the re-upload with a particularly lossy method to save on bandwidth and server space.