Troubleshooting Network Issues¶
Step 1: Check Network Addresses (on tsuru)¶
After ssh’ing to tsuru, run:
ip address
Interfaces 2, 4, and 5 should be UP.
- 2 - fabric switch
- 4 - local network
- 5 - lab network
If DOWN, run Ron’s setup scripts to configure the ip addresses for the network interfaces:
bash ~rp/setup.sh
ip address
Step 2: Initializing Camera¶
We want to make sure the camera is initialize when testing the network connections.
First, log in to tsuru (vnc is helpful here to run Xi-CAM).
If the Controls GUI does not show in Xi-CAM Acquire after selecting fastccd
device,
check that you have $PASSWD_MONGO
and $USER_MONGO
set.
If initalizing the camera (via the Initialize
button) fails, check the service logs:
journalctl -f -u fastccd_support_ioc
This may show something like socket.timeout=timed out
.
Re-check the network interface addresses (Step 1).
Retry the initialization of the camera in Xi-CAM.
Step 3: Check connection between tsuru and CIN¶
We will then want to test the connection between tsuru and the camera interface node.
CIN IP: 10.0.5.207
On tsuru, run:
sudo ping -f 10.0.5.207
After a few seconds, CTRL-C and check packet loss (20% is usual).
Next, we will want to look at the connection from the fastccdATCA1 server to the fabric switch.
Step 4: Check connection between fastccdATCA1 and CIN¶
ssh to fastccdatca1.dchp.lbl.gov and repeat the ping flood from step 3.
This may also show packet loss of around 20%.
Step 5: Check fastccdATCA1 and tsuru network configuration¶
is this relevant??? – recording for note purposes, modify later
On fastcameraatca1 and tsuru, check the net.core.rmem_default
and net.core.rmem_max
values:
systemctl net.core.rmem_default
systemctl net.core.rmem_max
Max should match default. If not, modify net.core.rmem_default
to match the net.core.rmem_max
in the following files:
/etc/sysctl.conf
/etc/rc.d/rc.local
Then, reboot:
sudo shutdown -r 0
After rebooting, reconnect and repeat steps 3 and 4.