[2024-feb-29] Sad news: Eric Layton aka Nocturnal Slacker aka vtel57 passed away on Feb 26th, shortly after hospitalization. He was one of our Wiki's most prominent admins. He will be missed.

Welcome to the Slackware Documentation Project

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
Next revisionBoth sides next revision
howtos:hardware:hardware_diagnostics [2014/01/21 19:54 (UTC)] – Added power button section metaschimahowtos:hardware:hardware_diagnostics [2014/06/21 17:17 (UTC)] – [Common Symptoms] metaschima
Line 38: Line 38:
   * Reboot loop where the system shuts down and powers up on its own continuously.   * Reboot loop where the system shuts down and powers up on its own continuously.
   * The system may not boot after pressing the power button, and you may need to press it more than once.   * The system may not boot after pressing the power button, and you may need to press it more than once.
-  * You may get a "boot failure" message when the system POSTs.+  * You may get a "boot failure" message during system POST. 
 +  * You may hear beep codes.
 === CMOS battery === === CMOS battery ===
   * The time is reset on each boot.   * The time is reset on each boot.
Line 59: Line 60:
 === CPU === === CPU ===
   * May or may not POST.   * May or may not POST.
-  * Kernel panics are possible with multi-core machines, when only one core is affected. 
   * There may be beep codes.   * There may be beep codes.
   * The fans are typically running at 100%.   * The fans are typically running at 100%.
-  * If it is overheating it may trigger a MCE (Machine-check exception) which will cause a kernel panic and forced shutdown, or it may throttle itself down and the system will seem slower.+  * Kernel panics are seen with multi-core machines, when only one core is affected. These are clearly listed as ''Hardware Error'' MCE (Machine-check exception). 
 +  * If it is overheating it may also trigger a MCE which will cause a kernel panic and forced shutdown, or it may throttle itself down and the system will seem slower.
 === Motherboard === === Motherboard ===
   * Check for swollen capacitors.   * Check for swollen capacitors.
Line 73: Line 74:
   * May not open when you press the button, but instead make some clunking sounds and open only after many button presses.   * May not open when you press the button, but instead make some clunking sounds and open only after many button presses.
 === Mouse === === Mouse ===
-  * Problems with drag-and-drop, drag-and-select, double-click.+  * Problems with click-and-drag, drag-and-drop, drag-to-select, double-click.
   * Mouse pointer jumps.   * Mouse pointer jumps.
   * Acceleration issues.   * Acceleration issues.
Line 95: Line 96:
 You should **ALWAYS** use a surge protector on **ALL** electronic devices **ALL** the time. This prevents damage to the PSU, to the motherboard, and it saves you lots of time and money wasted on replacing electronics damaged by power surges. Many surge protectors come with a warranty that will refund a certain amount of money if your equipment is damaged while using the surge protector properly. The surge protector is cheap, the equipment is expensive, and the refund usually more than covers it. You should **ALWAYS** use a surge protector on **ALL** electronic devices **ALL** the time. This prevents damage to the PSU, to the motherboard, and it saves you lots of time and money wasted on replacing electronics damaged by power surges. Many surge protectors come with a warranty that will refund a certain amount of money if your equipment is damaged while using the surge protector properly. The surge protector is cheap, the equipment is expensive, and the refund usually more than covers it.
 </note> </note>
 +=== Power Button ===
 +  * A bad power button is surprisingly difficult to diagnose, and there are no software tests that can help. Looking for characteristic symptoms such as a reboot loop and swapping out hardware may be the only way to diagnose it.
 === CMOS battery === === CMOS battery ===
   * You could take the battery out, making sure to use the special tab to remove it rather than trying to pry it off, and measure its voltage. Or you could just throw it away in the proper battery disposal container and replace it anyway just to be sure, and also because you may have had to remove a graphics card or PCI/PCIE card to reach the battery and you may not have a battery tester for these types of batteries.   * You could take the battery out, making sure to use the special tab to remove it rather than trying to pry it off, and measure its voltage. Or you could just throw it away in the proper battery disposal container and replace it anyway just to be sure, and also because you may have had to remove a graphics card or PCI/PCIE card to reach the battery and you may not have a battery tester for these types of batteries.
Line 199: Line 202:
 === PSU === === PSU ===
 Replace the PSU with a new PSU, if the symptoms disappear, you can be sure it was the PSU. However, note that a bad PSU may damage the motherboard or other components, or maybe it was a power surge that damaged everything. Replace the PSU with a new PSU, if the symptoms disappear, you can be sure it was the PSU. However, note that a bad PSU may damage the motherboard or other components, or maybe it was a power surge that damaged everything.
 +=== Power Button ===
 +Try to clean the power button using compressed air. If that doesn't work then replace the power button if possible, otherwise replace the case along with the button or take it to a repair shop and see what they can do.
 === CMOS battery === === CMOS battery ===
 Replace the battery carefully using the special tab. Do **NOT** pry it off using a screwdriver because then it will break and it won't go back in. Replace the battery carefully using the special tab. Do **NOT** pry it off using a screwdriver because then it will break and it won't go back in.
Line 210: Line 215:
 === CPU === === CPU ===
 If you suspect the CPU is overheating, then you can try to remove the heatsink-fan block according to your CPU or heatsink manual, [[http://www.howtocleananything.com/home-interior-tips/computer/how-to-clean-thermal-paste-off-cpu/|clean off the old thermal paste]], [[http://www.maximumpc.com/article/howtos/howto_install_cpu_and_apply_thermal_paste|apply new thermal paste]], or let an expert do it. Otherwise, replace the CPU. Make sure it is actually the CPU and not the RAM that is the problem, as a CPU is very expensive compared to RAM. If you suspect the CPU is overheating, then you can try to remove the heatsink-fan block according to your CPU or heatsink manual, [[http://www.howtocleananything.com/home-interior-tips/computer/how-to-clean-thermal-paste-off-cpu/|clean off the old thermal paste]], [[http://www.maximumpc.com/article/howtos/howto_install_cpu_and_apply_thermal_paste|apply new thermal paste]], or let an expert do it. Otherwise, replace the CPU. Make sure it is actually the CPU and not the RAM that is the problem, as a CPU is very expensive compared to RAM.
 +<note>
 +If you cannot replace the CPU and it is a multi-core CPU with only one core that has failed, [[http://www.linuxquestions.org/questions/showthread.php?p=5162136|for example the L1 cache of CPU core #2]], then you can try to disable the failed core. One way to do this is to use the ''maxcpus='' kernel boot option. In the example you would use:
 +<code>
 +maxcpus=2
 +</code>
 +this is because CPU cores are numbered starting with 0, so 4 cores would be numbered 0, 1, 2, 3. Note that if CPU #0 is failing, you may be out of luck. Another complementary way is to first make sure you have ''CONFIG_HOTPLUG_CPU'' built-in to the kernel, and running:
 +<code>
 +echo 1 > /sys/devices/system/cpu/cpu3/online
 +</code>
 +on every boot. In this example this turns ON CPU #3. So, basically you would have the 3 good cores running i.e. 0, 1, 3.
 +</note>
 === Motherboard === === Motherboard ===
 Replace the motherboard, and make sure the PSU is not damaged, as it can damage your new motherboard. Replace the motherboard, and make sure the PSU is not damaged, as it can damage your new motherboard.
 howtos:hardware:hardware_diagnostics ()