Run capstart.sh
command to begin rotating parallel tcpdumps on all nodes with external interfaces.
- the scripts
check-nfs.sh
andcheck-nfs-datacapture.sh
must be loaded onto the node and stored in/root
to be used via cron. - configure an ssh key on the node that can reach a specific node on the storage cluster as root (required to terminate the server tcpdump process by calling
capstop.sh
) - Make sure the file
/tmp/nfsissuecaptured
does not exist - run
echo 0 > /tmp/nfsmonitor.semaphore
to reset the event notification from previous captures - Configure
check_nfs.sh
to run every 10 minutes via cron - Configure
check-nfs-datacapture.sh
to run every 2 minutes so that it catches events quickly. - launch new tcpdump capture with
tcpdump -w /tmp/pcaps/$(date '+%Y-%m-%dT%H.%M.%S').$(hostname).em1.pcap -i em1 -s 640 -C 250 -W 100 -Z root
- Recommended that this is launched in a tmux session on the individual node with additional panes running:
tail -F /var/log/messages | grep CAPTURE
tail -F /var/log/cron -n 40