This article is a mirror article of machine translation, please click here to jump to the original article.

View: 10623|Reply: 1

Linux kernel:NMI watchdog: BUG: soft lockup - CPU

[Copy link]
Posted on 11/22/2021 2:28:52 PM | | |
Problem: The kernel alarm is not down, but the server is not down. Analysis of the causes of the soft lockup bug, soft lockup name explanation: The so-called soft lockup means that this bug does not completely freeze the system, but several processes (or kernel threads) are locked in a certain state (usually in the kernel area), and in many cases this is due to the use of kernel locks.

Message from syslogd@centos7-itsvse at Nov 22 13:52:22 ...
kernel:NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [java:6062]

Solution (Not tested

(End)





Previous:Obtain the local IP address method in the Linux shell script
Next:The SQL statement in PMM MySQL Query Analysis Examples is incomplete
 Landlord| Posted on 11/22/2021 2:30:08 PM |
watchdog
In short, watchdog is a mechanism to ensure the normal operation of the system or to exit from a dead loop, deadlock, etc.

Watchdogs are divided into hardware watchdogs and software watchdogs. The hardware watchdog uses a timer circuit, the timing output is connected to the reset end of the circuit, and the program clears the timer within a certain time range (commonly known as "feeding the dog"), so when the program is working normally, the timer can never overflow, so it cannot generate a reset signal. If the program fails and the watchdog is not reset within the timer period, the watchdog timer overflows to generate a reset signal and restart the system. The principle of the software watchdog is the same, but the timer on the hardware circuit is replaced with the processor's internal timer, which can simplify the hardware circuit design, but it is not as reliable as the hardware timer, such as the failure of the system's internal timer itself cannot be detected.

There are two types of software watchdogs, normal soft dogs for detecting soft lockups (based on clock interrupts), and NMI dogs for detecting hard lockups (based on NMI interrupts).

Note 1: The clock interrupt priority is less than the NMI interrupt
Note 2: Lockup refers to a certain section of kernel code occupying the CPU. Lockup can cause the entire system to become unresponsive.
Soft lockup and hard lockup, the only difference between them is that hard lockup occurs when the CPU shield is interrupted.
————————————————
Copyright Notice: This article is an original article by CSDN blogger "Hober_yao", in accordance with the CC 4.0 BY-SA copyright agreement, please attach the original source link and this statement for reprinting.
Original link:https://blog.csdn.net/yhb1047818384/article/details/70833825
Disclaimer:
All software, programming materials or articles published by Code Farmer Network are only for learning and research purposes; The above content shall not be used for commercial or illegal purposes, otherwise, users shall bear all consequences. The information on this site comes from the Internet, and copyright disputes have nothing to do with this site. You must completely delete the above content from your computer within 24 hours of downloading. If you like the program, please support genuine software, purchase registration, and get better genuine services. If there is any infringement, please contact us by email.

Mail To:help@itsvse.com