[lvs-users] ksoftirqd/0 runs at a 100% and looks like it could a kernel issue.
brentgclarklist at gmail.com
Thu May 31 11:21:18 BST 2012
Im writing to you because we have a two node cluster setup with just apache2, heartbeart and LVS, and every now and then find that ksoftirqd/0 runs at a 100%. We find that the only way to fix it, is to actually reboot the server.
For a while now we thought it was an IRQ issue, for you just google "ksoftirqd cpu", and people suggest to look at irqbalancer, setting affinity. Some have suggested ensuring that your nic supports MSI-X (which ours do).
Googling "lvs 126.96.36.199" has led me on a weird path of reading a lengthy thread on the LVS mailing list, that looks like has spanned two months and this was in 2010, dealing with this ksoftirqd issue.
Long story short, and if I understood correctly,the /ksoftirqd/ issue could actually be a kernel versioning issue. And it was introduced in 2.6.28 and looks like it was only fixed in 2.6.37.rc1.
We run Debian Squeeze, which is on 2.6.32+29.
For you own viewing:
Start by reading all 'Kernel 2.6.35 and 100% S.I. CPU Time' in http://lists.graemef.net/pipermail/lvs-users/2010-September/subject.html#start
Then move on to http://lists.graemef.net/pipermail/lvs-users/2010-October/subject.html#start
I would like to ask, could it be a a kernel issue?
And is or has anyone else experienced problems with the current Debian kernel.
If you did experience problems, did using the backports kernel address this issue?
Let me know what you think, if someone could assist, it would be appreciated.
More information about the lvs-users