University of Cape Town
UCT High Performance Computing Cluster


Last 72 hours completed jobs for a100 queue

2716 Jobs: Total CPU time: 1 hours (0 days)       Total Wall time: 586 hours (24 days)

2026-03-05
Completion time User Job ID and name Account CPU time Wall time MaxRSS MaxVMem Tasks Cores Nodes AveRead AveWrite Exit code: status
2026-03-05 05:15:53 dplmor006 540510 LSTM_lossfunc_WaDi_focal_WS48_H2048x1024x512x256x128x64_LR0.0001 acsl 12-11:57:24 3-02:59:21 9.79 GB 0 GB 1 4(1) 1 0.01M 0.00M TIMEOUT
2026-03-05 08:51:30 01476962 581025 MyJob nlpgroup80 7-12:49:16 1-21:12:19 26.47 GB 0 GB 1 4(1) 1 0.01M 0.00M COMPLETED
2026-03-05 16:07:56 grddan017 586669 yolo11m a100free 00:00:28 00:00:07 0.51 GB 0 GB 1 4(1) 1 0.01M 0.00M CANCELLED BY grddan017
2026-03-05 16:18:22 grddan017 586671 yolo11-VSF-l a100free 00:16:28 00:04:07 16.57 GB 0 GB 1 4(1) 1 0.01M 0.00M CANCELLED BY grddan017
2026-03-05 16:23:16 bhduna001 586666 nextshift-train-a100 vaccine 01:52:24 00:28:06 11.3 GB 0 GB 1 4(1) 1 0.01M 0.00M FAILED
2026-03-05 16:23:29 bhduna001 586667 nextshift-train-a100 vaccine 00:00:52 00:00:13 0 GB 0 GB 1 4(1) 1 0.01M 0.00M CANCELLED BY bhduna001
2026-03-05 16:36:00 bhduna001 586674 nextshift-train-a100 vaccine 00:46:52 00:11:43 6.78 GB 0 GB 1 4(1) 1 0.01M 0.00M FAILED
2026-03-05 17:03:25 bhduna001 586675 nextshift-train-a100 vaccine 01:49:40 00:27:25 7.43 GB 0 GB 1 4(1) 1 0.01M 0.00M FAILED
2026-03-05 17:07:11 bhduna001 586683 nextshift-train-a100 vaccine 00:15:04 00:03:46 6.14 GB 0 GB 1 4(1) 1 0.01M 0.00M FAILED
2026-03-05 17:27:34 bhduna001 586684 nextshift-train-a100 vaccine 01:21:32 00:20:23 6.7 GB 0 GB 1 4(1) 1 0.01M 0.00M FAILED
2026-03-05 17:44:01 bhduna001 586685 nextshift-train-a100 vaccine 01:05:48 00:16:27 6.62 GB 0 GB 1 4(1) 1 0.01M 0.00M FAILED
2026-03-05 17:55:02 bhduna001 586686 nextshift-train-a100 vaccine 00:44:04 00:11:01 6.54 GB 0 GB 1 4(1) 1 0.01M 0.00M FAILED
2026-03-05 18:04:28 01476962 581122 MyJob nlpgroup80 1-12:51:52 09:12:58 37.44 GB 0 GB 1 4(1) 1 0.01M 0.00M COMPLETED
2026-03-05 18:32:01 grddan017 586672 yolo11m-lut a100free 08:50:36 02:12:39 16.28 GB 0 GB 1 4(1) 1 0.01M 0.00M COMPLETED
2026-03-05 20:36:35 01476962 581024 MyJob nlpgroup80 9-11:50:20 2-08:57:35 37.44 GB 0 GB 1 4(1) 1 0.01M 0.00M COMPLETED
2026-03-05 23:58:03 01476962 581021 MyJob nlpgroup80 10-01:23:32 2-12:20:53 37.44 GB 0 GB 1 4(1) 1 0.01M 0.00M COMPLETED
2026-03-06
Completion time User Job ID and name Account CPU time Wall time MaxRSS MaxVMem Tasks Cores Nodes AveRead AveWrite Exit code: status
2026-03-06 01:32:57 grddan017 586670 yolo11-TSM-l-lut a100free 1-13:15:04 09:18:46 25.71 GB 0 GB 1 4(1) 1 0.01M 0.00M COMPLETED
2026-03-06 04:45:57 01476962 581123 MyJob nlpgroup80 1-18:45:56 10:41:29 37.44 GB 0 GB 1 4(1) 1 0.01M 0.00M COMPLETED
2026-03-06 04:45:57 mwrsim003 584044 custom_pretraining.sh nlpgroup80 00:00:00 00:00:00 0 GB 0 GB 1 10(1) 1 0.01M 0.00M FAILED
2026-03-06 05:49:04 01476962 581125 MyJob nlpgroup80 1-12:49:56 09:12:29 37.44 GB 0 GB 1 4(1) 1 0.01M 0.00M COMPLETED
2026-03-06 09:52:36 skscla001 587020 MMS-DAT-Model-HPO nlpgroup80 00:00:00 00:00:00 0 GB 0 GB 0 1 CANCELLED BY skscla001
2026-03-06 10:20:23 01476962 581126 MyJob nlpgroup80 1-17:29:20 10:22:20 37.44 GB 0 GB 1 4(1) 1 0.01M 0.00M COMPLETED
2026-03-06 10:48:40 dplmor006 584802 TCN_lossfunc_PumpDataset_weighted_WS6_H512x1024x2048_LR1e-06 acsl 6-06:38:44 1-13:39:41 1.87 GB 0 GB 1 4(1) 1 0.01M 0.00M COMPLETED
2026-03-06 11:39:06 01476962 581023 MyJob nlpgroup80 12-00:00:36 3-00:00:09 37.39 GB 0 GB 1 4(1) 1 0.01M 0.00M TIMEOUT
2026-03-06 14:13:23 grddan017 587731 yolo11m a100free 07:05:12 01:46:18 17.89 GB 0 GB 1 4(1) 1 0.01M 0.00M COMPLETED
2026-03-06 16:46:09 01476962 581127 MyJob nlpgroup80 2-00:00:48 12:00:12 37.44 GB 0 GB 1 4(1) 1 0.01M 0.00M COMPLETED
2026-03-06 17:28:06 rchnic009 580816 r10_Sf1b_6RU vaccine 6-13:30:10 3-06:45:05 5.11 GB 0 GB 1 2(1) 1 0.01M 0.00M COMPLETED
2026-03-06 17:40:34 bhduna001 587966 nextshift-train-a100 vaccine 00:07:08 00:01:47 5.84 GB 0 GB 1 4(1) 1 0.01M 0.00M FAILED
2026-03-06 17:42:01 bhduna001 587967 nextshift-train-a100 vaccine 00:05:48 00:01:27 3.99 GB 0 GB 1 4(1) 1 0.01M 0.00M FAILED
2026-03-06 17:54:03 rchnic009 581120 r8_Sf2a2_6RU vaccine 6-12:18:40 3-06:09:20 5.11 GB 0 GB 1 2(1) 1 0.01M 0.00M COMPLETED
2026-03-06 19:06:32 bhduna001 587970 nextshift-train-a100 vaccine 01:17:41 01:17:41 7.53 GB 0 GB 1 1(1) 1 0.01M 0.00M COMPLETED
2026-03-06 19:09:21 bhduna001 587972 nextshift-train-a100 vaccine 05:01:12 01:15:18 7.43 GB 0 GB 1 4(1) 1 0.01M 0.00M COMPLETED
2026-03-06 20:22:09 bhduna001 587973 nextshift-train-a100 vaccine 05:02:28 01:15:37 7.27 GB 0 GB 1 4(1) 1 0.01M 0.00M COMPLETED
2026-03-06 22:58:32 bhduna001 587974 nextshift-train-a100 vaccine 15:16:44 03:49:11 7.54 GB 0 GB 4 4(1) 1 1755.62M 1379.73M FAILED
2026-03-06 23:02:34 bhduna001 587975 nextshift-train-a100 vaccine 10:41:40 02:40:25 7.63 GB 0 GB 4 4(1) 1 1635.36M 962.72M FAILED
2026-03-07
Completion time User Job ID and name Account CPU time Wall time MaxRSS MaxVMem Tasks Cores Nodes AveRead AveWrite Exit code: status
2026-03-07 00:16:53 bhduna001 587971 nextshift-train-a100 vaccine 01:18:21 01:18:21 7.19 GB 0 GB 1 1(1) 1 0.01M 0.00M COMPLETED
2026-03-07 01:08:44 skscla001 588018 MMS_Model_DAT_AT nlpgroup80 01:57:20 00:29:20 33 GB 0 GB 1 4(1) 1 0.01M 0.00M COMPLETED
2026-03-07 01:22:29 skscla001 588019 MMS_Model_DAT_AT nlpgroup80 02:48:28 00:42:07 30.1 GB 0 GB 1 4(1) 1 0.01M 0.00M COMPLETED
2026-03-07 02:01:56 skscla001 588020 MMS_Model_DAT_AT nlpgroup80 03:32:48 00:53:12 33.3 GB 0 GB 1 4(1) 1 0.01M 0.00M COMPLETED
2026-03-07 02:14:33 skscla001 588021 MMS_Model_DAT_AT nlpgroup80 03:28:16 00:52:04 30.04 GB 0 GB 1 4(1) 1 0.01M 0.00M COMPLETED
2026-03-07 02:45:32 skscla001 588023 MMS_Model_DAT_AT nlpgroup80 02:03:56 00:30:59 29.91 GB 0 GB 1 4(1) 1 0.01M 0.00M COMPLETED
2026-03-07 02:46:05 skscla001 588025 MMS_Model_DAT_AT nlpgroup80 00:02:12 00:00:33 0.47 GB 0 GB 1 4(1) 1 0.01M 0.00M COMPLETED
2026-03-07 02:53:46 skscla001 588022 MMS_Model_DAT_AT nlpgroup80 03:27:20 00:51:50 30.03 GB 0 GB 1 4(1) 1 0.01M 0.00M COMPLETED
Completion time User Job ID and name Account CPU time Wall time MaxRSS MaxVMem Tasks Cores Nodes AveRead AveWrite Exit code: status