Relic5646

joined 2 years ago
[–] Relic5646@lemmy.world 2 points 1 year ago

I've been suuuuper lazy troubleshooting this so it's been a few weeks, but I talked to WD support, they said to run a full extended S.M.A.R.T. test on the drive, it passed with no issues.

Reconnected it to my server using a different SATA cable on a different port on the motherboard, with a different power connector. It resilvered with no problems, and a zpool scrub returned no errors this time so hopefully I'm in the clear!

I have a script that runs once a week that does a scrub then sends the output of zpool status to a Discord channel. When this first started it had read errors (as mentioned in the post), then checksum errors two weeks later. With there being a couple different errors before troubleshooting, and now with no errors after a scrub I'm hoping this means everything's fine now.

[–] Relic5646@lemmy.world 29 points 1 year ago (3 children)

I always thought it was nuts that people don't remember, then I turned 30 and got married to someone who has a birthday less than a year before mine. Now I have two ages to remember (that are sometimes the same) and it takes a second for me to remember which is me when asked.

[–] Relic5646@lemmy.world 2 points 1 year ago

Wow that's pretty substantial, thanks for the tips! Wow yeah Backblaze does seem pretty affordable.

[–] Relic5646@lemmy.world 2 points 1 year ago (2 children)

Thanks, I'll try some of those things out and see if a second scrub says the same.

That being said I have pretty good backups

Out of curiosity, what do you do for backups? The initial cost of 3x12TB drives was enough to make me not want to spend a bunch more money on backup stuff at the time, but now that I'm seeing errors I'm willing to spend a bit of money again and should look into my options.

[–] Relic5646@lemmy.world 2 points 1 year ago

Thanks, I will start a backup now. I don't have any extra automated backups so I guess this is my wake-up call to figure something out.

 

My weekly zpool scrub came back with this:

  pool: blackhole
 state: DEGRADED
status: One or more devices are faulted in response to persistent errors.
    Sufficient replicas exist for the pool to continue functioning in a
    degraded state.
action: Replace the faulted device, or use 'zpool clear' to mark the device
    repaired.
  scan: scrub repaired 0B in 02:01:59 with 0 errors on Tue Jul 11 04:02:09 2023
config:

    NAME                                    STATE     READ WRITE CKSUM
    blackhole                               DEGRADED     0     0     0
      raidz1-0                              DEGRADED     0     0     0
        ata-WDC_WD120EDAZ-11F3RA0_5PG8DYKC  ONLINE       0     0     0
        ata-WDC_WD120EFBX-68B0EN0_5QKJ6M8B  ONLINE       0     0     0
        ata-WDC_WD120EFBX-68B0EN0_5QKJTT8B  FAULTED     51     0     0  too many errors

errors: No known data errors

I only got the drive 6 months ago, well within WD's 3 year warranty so I opened a support case, but do errors like this basically always mean the drive is its way out or is it possible to have false positives?

[–] Relic5646@lemmy.world 2 points 2 years ago

Uncomfortable pooping somewhere with no bidet available?