Hi,
unfortunately the combination you suggest isn’t valid. The nearest would be
two times 3g.20g.
Anyway as you seems to be able to “saturate” the full A100 there isn’t indeed any good reason ton split it for your use case.
The issue we’ll face is other GPU jobs with less resources needs will use a full A100 for no good reason. Slurm is missing a way to exclude those kind of GPUs if the user is only asking for a generic GPU for example.
As you seems to have an application that can do some “real world” benchmarking on GPUs, if you have some spare time I would be very interested to have a comparison with the other GPUs we provide : RTX, V100 for example.
I’ll revert the A100 to full in the next few days.