Quote History Quoted:
I'm with this -- too many odd software problems generally mean a hardware problem. Conflicting IP address is a weird one but hey, weird things happen when computers start to die.
View Quote
There are really only a fairly specific set of "software problems" that are caused by hardware problems, and they're *usually* easy to spot if you know what you're doing. In general, 99% of all software is utter garbage under the hood, and riddled with bugs. Just because you don't see the bugs every day doesn't mean they aren't there.
I don't even know just precisely how many machines I keep running 24x7, it constantly grows, and I haven't checked in a while. But I do know that there are five figures in the number. For every hardware issue I deal with, I deal with 99+ software issues. It's not even uncommon for me to deal with issues that *seem* like they could be hardware, there's still a significant number of times that it ends up being software.
Latest case? Software occasionally, repeatedly stopping, and saying that the disk was full, read-only, or otherwise inaccessible. True culprit? When the software performed a DNS request *and* got an IPV6 result (had to have IPV6), it would fail to close the handle to the socket, causing a descriptor leak. Eventually, under the right conditions, it would exhaust the number of descriptors, and would no longer be able to open a new one to write to disk. Once we showed that to the vendor and they fixed the bug, it never happened again.
Another case, a large amount of our hardware was occasionally reporting memory issues - as in, the BIOS was logging ECC or other issues with DIMMs, and marking that DIMM as failed. Guess what, changing the DIMMs didn't help. Worked with the vendor, and whoopsie... it was a bug in the firmware which would sometimes incorrectly flag a DIMM as failed. With updated firmware, the problem was fixed.
Now I have seen weird hardware issues, like a bad disk controller cause permanent hardware issues with any motherboard it was hooked to, and I've seen one disk controller that would damage any hard drive connected to it. But these are really, extremely fringe cases. Software is almost universally buggy. Hardware probably is, too, but IME, software is much more commonly the culprit of people's problems.