Communication error on send on Baobab's /srv/beegfs/dpnc

Primary informations

Username: boutinh
Cluster: Baobab
Beegfs server: /srv/beegfs/dpnc/ on specific files

Description

Commands trying to access files: cat, vim, md5sum, rsync or zip locally or trying to move files to another server will either freeze (ctrl+c not working) or return “Communication error on send” and then freeze. htop shows the command with status S. ctrl+c changes that status to D.
This appears only on the beegfs server mentioned above and for certain files (I tried other directories and files and the commands execute). It appears the files I have issues with were all moved there/generated in the past month.
This issue appeared mid-afternoon yesterday (Sat 6th).

Steps to Reproduce

from /srv/beegfs/dpnc/groups/dampe/users/hugo/libeb_flux/MC_select/:
md5sum Filtered_Etruth_Ereco_Weight_PSDCharge_PSDCut_PSDRawCharge_PSDChargeOld_PSDChargeBoron_STKHits_noSTKFilter_Proton.npy ~

Expected Result

For the checksum to print

Actual Result

md5sum Filtered_Ereco_PSDCharge_PSDCut_PSDChargeOld_PSDChargeBoron_STKHits_20*
md5sum: Filtered_Ereco_PSDCharge_PSDCut_PSDChargeOld_PSDChargeBoron_STKHits_2015.npy: Communication error on send

1 Like

Hello @Hugo.Boutin

Do you still have the issue ?

On my side:

(baobab)-[root@login1 MC_select]$  md5sum Filtered_Etruth_Ereco_Weight_PSDCharge_PSDCut_PSDRawCharge_PSDChargeOld_PSDChargeBoron_STKHits_noSTKFilter_Proton.npy
9e1ae3ebf62ba3d3a319a714b7f2ff18  Filtered_Etruth_Ereco_Weight_PSDCharge_PSDCut_PSDRawCharge_PSDChargeOld_PSDChargeBoron_STKHits_noSTKFilter_Proton.npy

Hi Adrien,

Indeed it is working now, thanks! What was the issue?

I have too few information to investigate more :confused: