Troubleshooting

Intended audience: Anyone who is administering Data Transfer Monitoring.

Known Issues

Issue

Description

Workaround

File Counts are not accurate

The Summit will resend files if USDF Ceph S3 is not responsive causing duplicate file notifications for the same file.

See Calculate File Counts

Negative Time File transfers

End Readout Messages Timestamps for DARK LSSTCamcheckout images are after the file transfers

No known work around. It could be investigated to exclude these from file transfer calculations.

Monitoring

See Overview for links to the dashboard.

Metrics not Displaying

Symptoms: Metrics are not getting displayed in Grafana.

Cause: The scrape settings or port settings could have been modified or removed.

Solution: Validate the following is set on the pod. If not update the deployment manifest.

metadata:
  annotations:
    prometheus.io/port: "8000"
    prometheus.io/scrape: "true"