Troubles with CPU and Motherboard: Updated Details on “2 Petabytes of Data Written on a 2-Year-Old SSD”

Understanding and Troubleshooting Hardware Errors: A Case Study of Persistent CPU and Motherboard Issues

In today’s article, we explore a real-world example of hardware troubleshooting involving persistent errors that impact system stability and performance. This case study centers on a user experiencing critical SSD health warnings alongside recurring CPU and motherboard errors, highlighting the diagnostic process, attempted solutions, and possible next steps.

System Background

  • Processor: Intel Core i7 9700K
  • Motherboard: Gigabyte Z390 Aorus Elite
  • Storage: SSD with 2 petabytes of writes, indicating it has reached critical health levels

Issue Overview

The primary concern involves a distinctive hardware error known as a WHEA Event 19. This error relates to the CPU’s cache hierarchy and indicates underlying hardware issues, often requiring in-depth troubleshooting. The user reports that Windows logs this error continually, approximately every second, which correlates with extremely high CPU core usage—particularly on Core 4, which appears to be running at 150% utilization due to error logging activities.

Error Details

  • Windows Event Viewer logs show repeated WHEA 19 errors
  • Task Manager and performance monitors indicate Core 4 is under abnormal stress
  • Error logs are accessible for review (see example report here)
  • CPU Core 4 consistently exhibits high activity, likely due to error correction and logging processes

Troubleshooting Steps Taken

To isolate the root cause, the user undertook several hardware and software adjustments:

  • Disabling CPU-related BIOS settings such as Speedshift, Turbo, and C-states to rule out BIOS misconfiguration
  • Testing RAM modules individually to identify potential memory channel issues
  • Removing the graphics card to eliminate PCIe-related problems
  • Observing the behavior in Safe Mode, where the error does not manifest, suggesting lower CPU stress or differing system conditions
  • Attempting to disable Windows error logging to reduce system stress, though this raises caution about monitoring the problematic core

Current Status and Potential Solutions

While disabling the problematic CPU core temporarily alleviates the issue—effectively running the system with three cores instead of eight—this is not a viable long-term solution. The user plans to replace the CPU and motherboard components to definitively determine whether hardware failure is the root cause.

Recommendations for Software-Side Troubleshooting

If you encounter similar issues, consider the following steps:

  1. Monitor Event Logs Carefully: Keep track of recurring hardware-related errors such

Share this content:

Leave a Reply

Your email address will not be published. Required fields are marked *