My environment is as follows:
– Supermicro H11SSL-NC with EPYC 7232P CPU
– ESXi-7.0b-16324942-standard
– Western Digital SN750 1T disk
– Debian 10 guest
I’m testing WD disk with this configuration but I experience crash every time when testing file I/O with sysbench. I’m using the following command:
sysbench fileio –file-test-mode=seqwr –file-total-size=10G –file-block-size=16K –threads=8 –time=60 run
Please take a look at the following kernel log:
2020-09-07T09:10:19.407Z cpu8:1049379)WARNING: NVMEIO:2223 Controller 256 receiv – Pastebin.com
You can see that there is a critical warning 0x2 in the beginning and after that everything goes down. APD starts and All Paths Down is finally reached. Disk never recovers and I have to power off the guest. It is in invalid state after that and I need to reboot the host to recover.
It appears that increasing –file-block-size will make the system crash earlier. With 4K block size it seems to work.
Any ideas what is the issue and/or how to workaround this? What is that critical warning 0x2?