University of Cape Town
UCT High Performance Computing Cluster


Last 72 hours completed jobs for a100 queue

7896 Jobs: Total CPU time: 668 hours (27 days)       Total Wall time: 651 hours (27 days)

2026-04-29
Completion time User Job ID and name Account CPU time Wall time MaxRSS MaxVMem Tasks Cores Nodes AveRead AveWrite Exit code: status
2026-04-29 00:39:59 lmbanr001 788957 ft-ner_zul_long nlpgroup 23:55:20 02:59:25 3.2 GB 0 GB 1 8(1) 1 0.01M 0.00M COMPLETED
2026-04-29 03:11:46 ctzcar020 787949 Pn33B_3RU_Na_run7 vaccine 1-11:41:18 17:50:39 5.11 GB 0 GB 1 2(1) 1 0.01M 0.00M COMPLETED
2026-04-29 03:17:41 01476962 779645 MyJob nlpgroup80 8-21:38:32 2-05:24:38 37.44 GB 0 GB 1 4(1) 1 0.01M 0.00M COMPLETED
2026-04-29 04:18:02 01476962 779643 MyJob nlpgroup80 9-01:40:40 2-06:25:10 37.43 GB 0 GB 1 4(1) 1 0.01M 0.00M COMPLETED
2026-04-29 04:47:59 01476962 779647 MyJob nlpgroup80 4-17:01:20 1-04:15:20 37.44 GB 0 GB 1 4(1) 1 0.01M 0.00M COMPLETED
2026-04-29 06:01:39 lmbanr001 788810 eval-run_mamba_masakhapos_all nlpgroup 3-16:23:52 11:02:59 1.76 GB 0 GB 1 8(1) 1 0.01M 0.00M COMPLETED
2026-04-29 09:45:27 lmbanr001 789359 eval-masakhaner_tsn nlpgroup 00:04:40 00:00:35 0.62 GB 0 GB 1 8(1) 1 0.01M 0.00M FAILED
2026-04-29 09:45:27 lmbanr001 789360 eval-masakhaner_xho nlpgroup 00:04:40 00:00:35 0.66 GB 0 GB 1 8(1) 1 0.01M 0.00M FAILED
2026-04-29 09:45:27 lmbanr001 789361 eval-masakhaner_zul nlpgroup 00:04:40 00:00:35 0.63 GB 0 GB 1 8(1) 1 0.01M 0.00M FAILED
2026-04-29 10:09:31 lmbanr001 789401 eval-masakhaner_tsn nlpgroup 00:02:32 00:00:19 0 GB 0 GB 1 8(1) 1 0.01M 0.00M FAILED
2026-04-29 10:09:32 lmbanr001 789402 eval-masakhaner_xho nlpgroup 00:02:32 00:00:19 0 GB 0 GB 1 8(1) 1 0.01M 0.00M FAILED
2026-04-29 10:09:46 lmbanr001 789403 eval-masakhaner_zul nlpgroup 00:02:00 00:00:15 0.22 GB 0 GB 1 8(1) 1 0.00M 0.00M FAILED
2026-04-29 10:09:52 lmbanr001 789404 ft-pos_all_tagseq nlpgroup 00:02:40 00:00:20 0.25 GB 0 GB 1 8(1) 1 0.00M 0.00M FAILED
2026-04-29 10:10:39 bxxjin001 789399 dpb_p1_6models_1epoch a100free 00:06:28 00:01:37 2.96 GB 0 GB 1 4(1) 1 0.01M 0.00M COMPLETED
2026-04-29 10:10:46 lmbanr001 789406 eval-masakhaner_tsn nlpgroup 00:02:32 00:00:19 0.36 GB 0 GB 1 8(1) 1 0.01M 0.00M FAILED
2026-04-29 10:10:47 lmbanr001 789407 eval-masakhaner_xho nlpgroup 00:02:32 00:00:19 0.36 GB 0 GB 1 8(1) 1 0.01M 0.00M FAILED
2026-04-29 10:10:55 lmbanr001 789408 eval-masakhaner_zul nlpgroup 00:02:08 00:00:16 0.24 GB 0 GB 1 8(1) 1 0.01M 0.00M FAILED
2026-04-29 10:11:10 lmbanr001 789409 ft-pos_all_tagseq nlpgroup 00:03:12 00:00:24 0.98 GB 0 GB 1 8(1) 1 0.00M 0.00M FAILED
2026-04-29 10:11:40 lmbanr001 789411 eval-run_mamba_masakhaner_tsn nlpgroup 00:02:16 00:00:17 0 GB 0 GB 1 8(1) 1 0.01M 0.00M FAILED
2026-04-29 10:11:40 lmbanr001 789412 eval-run_mamba_masakhaner_xho nlpgroup 00:02:08 00:00:16 0 GB 0 GB 1 8(1) 1 0.01M 0.00M FAILED
2026-04-29 10:11:40 lmbanr001 789413 eval-run_mamba_masakhaner_zul nlpgroup 00:02:08 00:00:16 0 GB 0 GB 1 8(1) 1 0.01M 0.00M FAILED
2026-04-29 10:11:57 bxxjin001 789400 dpb_p2_6models_1epoch a100free 00:11:40 00:02:55 3.5 GB 0 GB 1 4(1) 1 0.01M 0.00M COMPLETED
2026-04-29 10:12:40 lmbanr001 789414 eval-run_mamba_masakhaner_tsn nlpgroup 00:02:24 00:00:18 0 GB 0 GB 1 8(1) 1 0.01M 0.00M FAILED
2026-04-29 10:12:40 lmbanr001 789415 eval-run_mamba_masakhaner_xho nlpgroup 00:02:24 00:00:18 0 GB 0 GB 1 8(1) 1 0.01M 0.00M FAILED
2026-04-29 10:12:41 lmbanr001 789416 eval-run_mamba_masakhaner_zul nlpgroup 00:02:32 00:00:19 0 GB 0 GB 1 8(1) 1 0.01M 0.00M FAILED
2026-04-29 10:20:19 bxxjin001 789432 dpb_p1_6models_1epoch a100free 00:00:00 00:00:00 0 GB 0 GB 0 1 CANCELLED BY bxxjin001
2026-04-29 10:20:19 bxxjin001 789433 dpb_p2_6models_1epoch a100free 00:00:00 00:00:00 0 GB 0 GB 0 1 CANCELLED BY bxxjin001
2026-04-29 10:48:53 lmbanr001 789422 eval-run_mamba_masakhaner_xho_long_stepfix nlpgroup 04:39:28 00:34:56 1.74 GB 0 GB 1 8(1) 1 0.01M 0.00M COMPLETED
2026-04-29 10:59:50 bxxjin001 789465 dpb_p4_12runs_1epoch a100free 00:43:04 00:10:46 43.5 GB 0 GB 1 4(1) 1 0.01M 0.00M CANCELLED BY bxxjin001
2026-04-29 12:30:25 bhduna001 789612 nextshift-train-a100 vaccine 00:14:00 00:14:00 3.74 GB 0 GB 1 1(1) 1 1494.48M 112.94M FAILED
2026-04-29 12:30:28 bhduna001 789613 nextshift-train-a100 vaccine 00:14:02 00:14:02 3.75 GB 0 GB 1 1(1) 1 1497.47M 115.61M FAILED
2026-04-29 12:44:35 bhduna001 789615 nextshift-train-a100 vaccine 00:14:07 00:14:07 3.81 GB 0 GB 1 1(1) 1 1501.31M 114.22M FAILED
2026-04-29 12:44:39 bhduna001 789614 nextshift-train-a100 vaccine 00:14:14 00:14:14 3.68 GB 0 GB 1 1(1) 1 1501.57M 114.52M FAILED
2026-04-29 12:51:23 bhduna001 789616 nextshift-train-a100 vaccine 00:06:47 00:06:47 3.54 GB 0 GB 1 1(1) 1 1440.86M 62.14M CANCELLED BY bhduna001
2026-04-29 12:51:23 bhduna001 789617 nextshift-train-a100 vaccine 00:06:44 00:06:44 3.53 GB 0 GB 1 1(1) 1 1440.32M 61.68M CANCELLED BY bhduna001
2026-04-29 13:01:52 bxxjin001 789491 dpb_p4_12runs_1epoch a100free 08:01:56 02:00:29 44.82 GB 0 GB 1 4(1) 1 0.01M 0.00M TIMEOUT
2026-04-29 13:02:14 lmbanr001 789660 eval-run_mamba_masakhapos_tagseq_decode_probe_ckpt500 nlpgroup 00:02:48 00:00:21 0.34 GB 0 GB 1 8(1) 1 0.01M 0.00M FAILED
2026-04-29 13:05:59 bhduna001 789667 nextshift-train-a100 vaccine 00:13:58 00:13:58 3.76 GB 0 GB 1 1(1) 1 1494.52M 112.91M FAILED
2026-04-29 13:06:08 bhduna001 789668 nextshift-train-a100 vaccine 00:14:06 00:14:06 3.68 GB 0 GB 1 1(1) 1 1497.52M 115.59M FAILED
2026-04-29 13:19:46 bhduna001 789669 nextshift-train-a100 vaccine 00:13:47 00:13:47 3.72 GB 0 GB 1 1(1) 1 1502.71M 115.43M FAILED
2026-04-29 13:19:51 bhduna001 789670 nextshift-train-a100 vaccine 00:13:43 00:13:43 3.68 GB 0 GB 1 1(1) 1 1501.01M 113.89M FAILED
2026-04-29 13:29:51 lmbanr001 789421 eval-run_mamba_masakhaner_tsn_long_stepfix nlpgroup 1-02:07:12 03:15:54 1.78 GB 0 GB 1 8(1) 1 0.01M 0.00M COMPLETED
2026-04-29 13:30:09 bxxjin001 789684 dpb_p4_continue a100free 00:00:00 00:00:00 0 GB 0 GB 1 2(1) 1 0.01M 0.00M COMPLETED
2026-04-29 13:30:09 lmbanr001 789685 eval-run_mamba_masakhapos_tagseq_decode_probe_ckpt500 nlpgroup 00:02:24 00:00:18 0.53 GB 0 GB 1 8(1) 1 0.00M 0.00M FAILED
2026-04-29 13:33:45 bhduna001 789671 nextshift-train-a100 vaccine 00:13:59 00:13:59 3.78 GB 0 GB 1 1(1) 1 1501.45M 114.56M FAILED
2026-04-29 13:33:58 bhduna001 789672 nextshift-train-a100 vaccine 00:14:07 00:14:07 3.69 GB 0 GB 1 1(1) 1 1501.14M 114.29M FAILED
2026-04-29 13:38:14 bxxjin001 789590 nm_p4_compare a100free 01:12:00 00:36:00 42.8 GB 0 GB 1 2(1) 1 0.01M 0.00M COMPLETED
2026-04-29 13:43:30 bxxjin001 789709 mmd_bench a100free 00:07:16 00:01:49 0.51 GB 0 GB 1 4(1) 1 0.01M 0.00M FAILED
2026-04-29 13:45:30 bxxjin001 789714 mmd_verify a100free 00:00:16 00:00:04 0 GB 0 GB 1 4(1) 1 0.00M 0.00M COMPLETED
2026-04-29 13:53:36 lmbanr001 789417 ft-pos_all_tagseq nlpgroup 1-05:29:52 03:41:14 3.27 GB 0 GB 1 8(1) 1 0.01M 0.00M COMPLETED
2026-04-29 14:12:59 lmbanr001 789798 eval-run_mamba_masakhapos_tagseq_decode_probe_ckpt500 nlpgroup 01:57:12 00:14:39 1.82 GB 0 GB 1 8(1) 1 0.01M 0.00M COMPLETED
2026-04-29 14:19:52 lmbanr001 789824 probe-pos-tagseq nlpgroup 00:01:24 00:00:21 0.3 GB 0 GB 1 4(1) 1 0.01M 0.00M FAILED
2026-04-29 14:26:12 lmbanr001 789826 probe-pos-tagseq nlpgroup 00:06:04 00:01:31 1.35 GB 0 GB 1 4(1) 1 0.01M 0.00M COMPLETED
2026-04-29 14:41:42 lmbanr001 789833 probe-pos-tagseq nlpgroup 00:19:52 00:04:58 2.41 GB 0 GB 1 4(1) 1 0.01M 0.00M COMPLETED
2026-04-29 15:05:01 bxxjin001 789873 dpb_xpert_smoke a100free 00:02:04 00:01:02 34.35 GB 0 GB 1 2(1) 1 0.01M 0.00M FAILED
2026-04-29 15:06:18 lmbanr001 789859 eval-run_mamba_afrihg_xho nlpgroup 01:32:32 00:11:34 1.64 GB 0 GB 1 8(2) 1 0.01M 0.00M COMPLETED
2026-04-29 15:08:28 bxxjin001 789885 mmd_dryrun a100free 00:02:56 00:00:22 0.76 GB 0 GB 1 8(1) 1 0.00M 0.00M COMPLETED
2026-04-29 15:18:07 bhduna001 789875 nextshift-train-a100 vaccine 00:13:54 00:13:54 3.82 GB 0 GB 1 1(1) 1 1496.58M 114.60M FAILED
2026-04-29 15:18:20 bhduna001 789876 nextshift-train-a100 vaccine 00:14:07 00:14:07 3.67 GB 0 GB 1 1(1) 1 1497.15M 115.18M FAILED
2026-04-29 15:19:04 bhduna001 789877 nextshift-train-a100 vaccine 00:00:57 00:00:57 3.89 GB 0 GB 1 1(1) 1 1371.33M 2.67M CANCELLED BY bhduna001
2026-04-29 15:19:04 bhduna001 789878 nextshift-train-a100 vaccine 00:00:44 00:00:44 2.49 GB 0 GB 1 1(1) 1 1372.12M 3.50M CANCELLED BY bhduna001
2026-04-29 15:19:04 bhduna001 789879 nextshift-train-a100 vaccine 00:00:00 00:00:00 0 GB 0 GB 0 1 CANCELLED BY bhduna001
2026-04-29 15:19:04 bhduna001 789880 nextshift-train-a100 vaccine 00:00:00 00:00:00 0 GB 0 GB 0 1 CANCELLED BY bhduna001
2026-04-29 15:19:42 bxxjin001 789886 dpb_xpert_smoke a100free 00:20:46 00:10:23 35.64 GB 0 GB 1 2(1) 1 0.01M 0.00M COMPLETED
2026-04-29 15:21:55 bhduna001 789894 nextshift-train-a100 vaccine 00:01:52 00:01:52 2.7 GB 0 GB 1 1(1) 1 432.88M 16.34M CANCELLED BY bhduna001
2026-04-29 15:21:55 bhduna001 789895 nextshift-train-a100 vaccine 00:01:52 00:01:52 2.69 GB 0 GB 1 1(1) 1 432.83M 16.30M CANCELLED BY bhduna001
2026-04-29 15:21:55 bhduna001 789896 nextshift-train-a100 vaccine 00:00:00 00:00:00 0 GB 0 GB 0 1 CANCELLED BY bhduna001
2026-04-29 15:21:55 bhduna001 789897 nextshift-train-a100 vaccine 00:00:00 00:00:00 0 GB 0 GB 0 1 CANCELLED BY bhduna001
2026-04-29 15:21:55 bhduna001 789898 nextshift-train-a100 vaccine 00:00:00 00:00:00 0 GB 0 GB 0 1 CANCELLED BY bhduna001
2026-04-29 15:21:55 bhduna001 789899 nextshift-train-a100 vaccine 00:00:00 00:00:00 0 GB 0 GB 0 1 CANCELLED BY bhduna001
2026-04-29 15:36:20 bxxjin001 789903 mmd_backfill a100free 01:54:02 00:10:22 41.76 GB 0 GB 1 11(1) 1 0.01M 0.00M CANCELLED BY bxxjin001
2026-04-29 15:37:39 lmbanr001 789423 eval-run_mamba_masakhaner_zul_long_stepfix nlpgroup 1-19:09:36 05:23:42 1.85 GB 0 GB 1 8(1) 1 0.01M 0.00M COMPLETED
2026-04-29 15:39:16 bxxjin001 789921 mmd_backfill a100free 00:00:00 00:00:00 0 GB 0 GB 0 1 CANCELLED BY bxxjin001
2026-04-29 15:40:21 bhduna001 789915 nextshift-train-a100 vaccine 00:05:00 00:05:00 2.87 GB 0 GB 1 1(1) 1 441.46M 134.40M FAILED
2026-04-29 15:40:21 bhduna001 789916 nextshift-train-a100 vaccine 00:04:59 00:04:59 2.89 GB 0 GB 1 1(1) 1 441.46M 134.24M FAILED
2026-04-29 15:44:39 bhduna001 789918 nextshift-train-a100 vaccine 00:04:18 00:04:18 2.81 GB 0 GB 1 1(1) 1 456.25M 37.40M FAILED
2026-04-29 15:44:43 bhduna001 789917 nextshift-train-a100 vaccine 00:04:22 00:04:22 2.82 GB 0 GB 1 1(1) 1 456.07M 36.48M FAILED
2026-04-29 15:46:32 bxxjin001 789922 mmd_backfill a100free 00:00:00 00:00:00 0 GB 0 GB 0 1 CANCELLED BY bxxjin001
2026-04-29 15:48:57 bhduna001 789919 nextshift-train-a100 vaccine 00:04:18 00:04:18 2.93 GB 0 GB 1 1(1) 1 453.66M 34.83M FAILED
2026-04-29 15:49:00 bhduna001 789920 nextshift-train-a100 vaccine 00:04:17 00:04:17 2.58 GB 0 GB 1 1(1) 1 453.86M 35.00M FAILED
2026-04-29 16:10:20 bhduna001 790008 nextshift-train-a100 vaccine 00:04:48 00:04:48 2.81 GB 0 GB 1 1(1) 1 441.63M 134.25M FAILED
2026-04-29 16:10:21 bhduna001 790009 nextshift-train-a100 vaccine 00:04:48 00:04:48 2.84 GB 0 GB 1 1(1) 1 441.47M 134.21M FAILED
2026-04-29 16:14:37 bhduna001 790011 nextshift-train-a100 vaccine 00:04:16 00:04:16 2.01 GB 0 GB 1 1(1) 1 452.44M 34.42M FAILED
2026-04-29 16:14:39 bhduna001 790010 nextshift-train-a100 vaccine 00:04:19 00:04:19 2.04 GB 0 GB 1 1(1) 1 452.69M 34.66M FAILED
2026-04-29 16:18:50 bhduna001 790013 nextshift-train-a100 vaccine 00:04:11 00:04:11 1.96 GB 0 GB 1 1(1) 1 453.26M 35.60M FAILED
2026-04-29 16:18:51 bhduna001 790012 nextshift-train-a100 vaccine 00:04:14 00:04:14 1.97 GB 0 GB 1 1(1) 1 453.15M 35.50M FAILED
2026-04-29 16:27:22 bhduna001 790040 nextshift-train-a100 vaccine 00:04:43 00:04:43 2.1 GB 0 GB 1 1(1) 1 440.79M 132.98M FAILED
2026-04-29 16:27:23 bhduna001 790039 nextshift-train-a100 vaccine 00:04:45 00:04:45 2.1 GB 0 GB 1 1(1) 1 440.44M 133.81M FAILED
2026-04-29 16:34:29 bhduna001 790088 nextshift-train-a100 vaccine 00:01:32 00:01:32 2.11 GB 0 GB 1 1(1) 1 414.20M 113.20M FAILED
2026-04-29 16:34:30 bhduna001 790089 nextshift-train-a100 vaccine 00:01:33 00:01:33 2.09 GB 0 GB 1 1(1) 1 414.16M 113.21M FAILED
2026-04-29 16:41:52 bhduna001 790097 nextshift-train-a100 vaccine 00:01:50 00:01:50 2.01 GB 0 GB 1 1(1) 1 433.91M 21.84M CANCELLED BY bhduna001
2026-04-29 16:41:52 bhduna001 790098 nextshift-train-a100 vaccine 00:01:49 00:01:49 2.01 GB 0 GB 1 1(1) 1 433.70M 21.63M CANCELLED BY bhduna001
2026-04-29 16:49:16 bhduna001 790126 nextshift-train-a100 vaccine 00:00:08 00:00:08 0.21 GB 0 GB 1 1(1) 1 0 0.00M FAILED
2026-04-29 16:49:16 bhduna001 790127 nextshift-train-a100 vaccine 00:00:08 00:00:08 0.2 GB 0 GB 1 1(1) 1 0 0.00M FAILED
2026-04-29 16:52:32 bhduna001 790129 nextshift-train-a100 vaccine 00:00:03 00:00:03 0 GB 0 GB 1 1(1) 1 0 0.00M FAILED
2026-04-29 16:52:33 bhduna001 790130 nextshift-train-a100 vaccine 00:00:03 00:00:03 0 GB 0 GB 1 1(1) 1 0 0.00M FAILED
2026-04-29 16:56:12 bhduna001 790134 nextshift-train-a100 vaccine 00:00:04 00:00:04 0 GB 0 GB 1 1(1) 1 0 0.00M FAILED
2026-04-29 16:56:14 bhduna001 790135 nextshift-train-a100 vaccine 00:00:03 00:00:03 0 GB 0 GB 1 1(1) 1 0 0.00M FAILED
2026-04-29 17:48:57 bxxjin001 789959 dpb_p4_continue a100free 00:00:00 00:00:00 0 GB 0 GB 0 1 CANCELLED BY bxxjin001
2026-04-29 18:00:58 bhduna001 790205 nextshift-train-a100 vaccine 00:22:06 00:22:06 4.41 GB 0 GB 1 1(1) 1 1698.44M 287.30M COMPLETED
2026-04-29 18:01:13 bhduna001 790206 nextshift-train-a100 vaccine 00:22:20 00:22:20 4.45 GB 0 GB 1 1(1) 1 1627.97M 286.22M COMPLETED
2026-04-29 18:15:55 bhduna001 790237 nextshift-train-a100 vaccine 00:01:02 00:01:02 2.1 GB 0 GB 1 1(1) 1 427.41M 13.78M FAILED
2026-04-29 18:15:56 bhduna001 790238 nextshift-train-a100 vaccine 00:01:02 00:01:02 2.13 GB 0 GB 1 1(1) 1 427.35M 13.72M FAILED
2026-04-29 18:21:53 01476962 779644 MyJob nlpgroup80 11-09:55:44 2-20:28:56 37.44 GB 0 GB 1 4(1) 1 0.01M 0.00M COMPLETED
2026-04-29 18:24:54 bhduna001 790240 nextshift-train-a100 vaccine 00:01:02 00:01:02 1.95 GB 0 GB 1 1(1) 1 428.00M 15.09M FAILED
2026-04-29 18:24:56 bhduna001 790241 nextshift-train-a100 vaccine 00:01:02 00:01:02 1.96 GB 0 GB 1 1(1) 1 428.00M 15.09M FAILED
2026-04-29 18:28:50 bhduna001 790281 nextshift-train-a100 vaccine 00:01:38 00:01:38 2.13 GB 0 GB 1 1(1) 1 413.60M 111.25M FAILED
2026-04-29 18:28:50 bhduna001 790282 nextshift-train-a100 vaccine 00:01:38 00:01:38 2.14 GB 0 GB 1 1(1) 1 413.48M 111.09M FAILED
2026-04-29 18:35:31 bhduna001 790284 nextshift-train-a100 vaccine 00:00:33 00:00:33 1.73 GB 0 GB 1 1(1) 1 438.06M 6.50M COMPLETED
2026-04-29 18:35:31 bhduna001 790285 nextshift-train-a100 vaccine 00:00:33 00:00:33 1.7 GB 0 GB 1 1(1) 1 438.12M 6.31M COMPLETED
2026-04-29 18:38:29 bhduna001 790287 nextshift-train-a100 vaccine 00:01:02 00:01:02 1.95 GB 0 GB 1 1(1) 1 427.42M 13.79M FAILED
2026-04-29 18:38:30 bhduna001 790286 nextshift-train-a100 vaccine 00:01:03 00:01:03 1.97 GB 0 GB 1 1(1) 1 427.42M 15.08M FAILED
2026-04-29 18:42:30 bhduna001 790288 nextshift-train-a100 vaccine 00:00:02 00:00:02 0 GB 0 GB 1 1(1) 1 0 0.00M FAILED
2026-04-29 18:42:30 bhduna001 790289 nextshift-train-a100 vaccine 00:00:02 00:00:02 0 GB 0 GB 1 1(1) 1 0 0.00M FAILED
2026-04-29 18:44:57 lmbanr001 789860 eval-run_mamba_masakhapos_tsn nlpgroup 1-05:09:12 03:38:39 1.84 GB 0 GB 1 8(2) 1 0.01M 0.00M COMPLETED
2026-04-29 18:45:44 bhduna001 790290 nextshift-train-a100 vaccine 00:02:21 00:02:21 1.96 GB 0 GB 1 1(1) 1 444.31M 34.95M CANCELLED BY bhduna001
2026-04-29 18:45:44 bhduna001 790291 nextshift-train-a100 vaccine 00:02:20 00:02:20 1.98 GB 0 GB 1 1(1) 1 443.90M 34.53M CANCELLED BY bhduna001
2026-04-29 18:48:58 lmbanr001 789861 eval-run_mamba_masakhapos_xho nlpgroup 1-01:30:32 03:11:19 1.83 GB 0 GB 1 8(2) 1 0.01M 0.00M CANCELLED BY lmbanr001
2026-04-29 18:48:58 lmbanr001 789862 eval-run_mamba_masakhapos_zul nlpgroup 00:32:08 00:04:01 1.77 GB 0 GB 1 8(2) 1 0.01M 0.00M CANCELLED BY lmbanr001
2026-04-29 18:50:43 bhduna001 790312 nextshift-train-a100 vaccine 00:00:59 00:00:59 1.96 GB 0 GB 1 1(1) 1 428.00M 15.01M FAILED
2026-04-29 18:50:44 bhduna001 790313 nextshift-train-a100 vaccine 00:01:00 00:01:00 1.96 GB 0 GB 1 1(1) 1 428.00M 15.01M FAILED
2026-04-29 19:00:45 bhduna001 790322 nextshift-train-a100 vaccine 00:07:10 00:07:10 2 GB 0 GB 1 1(1) 1 538.52M 104.33M COMPLETED
2026-04-29 19:00:53 bhduna001 790323 nextshift-train-a100 vaccine 00:07:17 00:07:17 1.99 GB 0 GB 1 1(1) 1 498.58M 102.10M COMPLETED
2026-04-29 19:04:37 lmbanr001 790310 eval-run_mamba_masakhapos_xho nlpgroup 02:04:56 00:15:37 1.76 GB 0 GB 1 8(1) 1 0.01M 0.00M CANCELLED BY lmbanr001
2026-04-29 19:04:37 lmbanr001 790311 eval-run_mamba_masakhapos_zul nlpgroup 02:04:48 00:15:36 1.68 GB 0 GB 1 8(1) 1 0.01M 0.00M CANCELLED BY lmbanr001
2026-04-29 19:17:25 skscla001 789964 xlsr_model_20hrs nlpgroup 01:53:40 00:28:25 16.54 GB 0 GB 1 4(1) 1 0.01M 0.00M COMPLETED
2026-04-29 19:20:56 lmbanr001 790338 eval-run_mamba_masakhapos_xho nlpgroup 02:10:16 00:16:17 1.66 GB 0 GB 1 8(1) 1 0.01M 0.00M COMPLETED
2026-04-29 19:21:34 lmbanr001 790339 eval-run_mamba_masakhapos_zul nlpgroup 02:15:12 00:16:54 1.62 GB 0 GB 1 8(1) 1 0.01M 0.00M COMPLETED
2026-04-29 19:29:28 bhduna001 790366 nextshift-train-a100 vaccine 00:07:15 00:07:15 2.01 GB 0 GB 1 1(1) 1 497.32M 100.78M COMPLETED
2026-04-29 19:29:30 bhduna001 790367 nextshift-train-a100 vaccine 00:07:16 00:07:16 1.97 GB 0 GB 1 1(1) 1 497.10M 100.55M COMPLETED
2026-04-29 19:36:41 bhduna001 790368 nextshift-train-a100 vaccine 00:07:13 00:07:13 2.13 GB 0 GB 1 1(1) 1 497.59M 101.16M COMPLETED
2026-04-29 19:36:45 bhduna001 790369 nextshift-train-a100 vaccine 00:07:15 00:07:15 2.14 GB 0 GB 1 1(1) 1 500.04M 103.98M COMPLETED
2026-04-29 19:43:56 bhduna001 790370 nextshift-train-a100 vaccine 00:07:15 00:07:15 2.28 GB 0 GB 1 1(1) 1 496.75M 100.81M COMPLETED
2026-04-29 19:43:59 bhduna001 790371 nextshift-train-a100 vaccine 00:07:14 00:07:14 1.99 GB 0 GB 1 1(1) 1 496.96M 101.03M COMPLETED
2026-04-29 19:51:09 bhduna001 790372 nextshift-train-a100 vaccine 00:07:13 00:07:13 2.43 GB 0 GB 1 1(1) 1 503.10M 104.58M COMPLETED
2026-04-29 19:51:15 bhduna001 790373 nextshift-train-a100 vaccine 00:07:16 00:07:16 2.11 GB 0 GB 1 1(1) 1 499.23M 101.88M COMPLETED
2026-04-29 19:58:25 bhduna001 790374 nextshift-train-a100 vaccine 00:07:16 00:07:16 2.46 GB 0 GB 1 1(1) 1 499.81M 100.76M COMPLETED
2026-04-29 19:58:31 bhduna001 790375 nextshift-train-a100 vaccine 00:07:16 00:07:16 2.14 GB 0 GB 1 1(1) 1 499.92M 100.86M COMPLETED
2026-04-29 20:05:41 bhduna001 790376 nextshift-train-a100 vaccine 00:07:16 00:07:16 2.43 GB 0 GB 1 1(1) 1 498.71M 100.86M COMPLETED
2026-04-29 20:05:50 bhduna001 790377 nextshift-train-a100 vaccine 00:07:19 00:07:19 2.12 GB 0 GB 1 1(1) 1 500.34M 102.56M COMPLETED
2026-04-29 20:09:39 lmbanr001 790433 eval-run_mamba_sa_general_missing nlpgroup 00:02:16 00:00:17 0 GB 0 GB 1 8(1) 1 0.01M 0.00M FAILED
2026-04-29 20:12:20 lmbanr001 790434 eval-run_mamba_base_masakhapos_all nlpgroup 00:23:44 00:02:58 1.29 GB 0 GB 1 8(1) 1 0.01M 0.00M CANCELLED BY lmbanr001
2026-04-29 20:12:57 bhduna001 790378 nextshift-train-a100 vaccine 00:07:16 00:07:16 2.42 GB 0 GB 1 1(1) 1 499.78M 101.08M COMPLETED
2026-04-29 20:13:06 bhduna001 790379 nextshift-train-a100 vaccine 00:07:16 00:07:16 2.14 GB 0 GB 1 1(1) 1 499.35M 100.65M COMPLETED
2026-04-29 20:20:12 bhduna001 790380 nextshift-train-a100 vaccine 00:07:14 00:07:14 2.43 GB 0 GB 1 1(1) 1 501.18M 102.69M COMPLETED
2026-04-29 20:20:25 bhduna001 790381 nextshift-train-a100 vaccine 00:07:19 00:07:19 2.11 GB 0 GB 1 1(1) 1 499.02M 100.43M COMPLETED
2026-04-29 20:27:31 bhduna001 790382 nextshift-train-a100 vaccine 00:07:19 00:07:19 2.43 GB 0 GB 1 1(1) 1 500.13M 100.75M COMPLETED
2026-04-29 20:27:43 bhduna001 790383 nextshift-train-a100 vaccine 00:07:18 00:07:18 1.98 GB 0 GB 1 1(1) 1 502.40M 103.13M COMPLETED
2026-04-29 20:31:59 lmbanr001 790437 eval-run_mamba_base_masakhapos_xho nlpgroup 02:13:36 00:16:42 1.28 GB 0 GB 1 8(1) 1 0.01M 0.00M COMPLETED
2026-04-29 20:34:01 lmbanr001 790435 eval-run_mamba_sa_general_missing nlpgroup 03:02:16 00:22:47 2.6 GB 0 GB 1 8(1) 1 0.01M 0.00M COMPLETED
2026-04-29 20:34:47 bhduna001 790384 nextshift-train-a100 vaccine 00:07:16 00:07:16 2.42 GB 0 GB 1 1(1) 1 501.94M 102.70M COMPLETED
2026-04-29 20:35:02 bhduna001 790385 nextshift-train-a100 vaccine 00:07:19 00:07:19 2.12 GB 0 GB 1 1(1) 1 499.62M 100.29M COMPLETED
2026-04-29 20:48:20 lmbanr001 790438 eval-run_mamba_base_masakhapos_zul nlpgroup 02:10:48 00:16:21 1.36 GB 0 GB 1 8(1) 1 0.01M 0.00M COMPLETED
2026-04-29 20:48:20 lmbanr001 790441 launch_evaluation.sh nlpgroup 00:00:00 00:00:00 0 GB 0 GB 1 8(1) 1 0.01M 0.00M FAILED
2026-04-29 20:48:20 lmbanr001 790442 launch_evaluation.sh nlpgroup 00:00:00 00:00:00 0 GB 0 GB 1 8(1) 1 0.01M 0.00M FAILED
2026-04-29 20:48:20 lmbanr001 790443 launch_evaluation.sh nlpgroup 00:00:00 00:00:00 0 GB 0 GB 1 8(1) 1 0.00M 0.00M FAILED
2026-04-29 20:48:21 lmbanr001 790444 launch_evaluation.sh nlpgroup 00:00:08 00:00:01 0 GB 0 GB 1 8(1) 1 0.00M 0.00M FAILED
2026-04-29 20:48:21 lmbanr001 790445 launch_evaluation.sh nlpgroup 00:00:00 00:00:00 0 GB 0 GB 1 8(1) 1 0.00M 0.00M FAILED
2026-04-29 20:48:21 lmbanr001 790446 launch_evaluation.sh nlpgroup 00:00:00 00:00:00 0 GB 0 GB 1 8(1) 1 0.00M 0.00M FAILED
2026-04-29 20:48:21 lmbanr001 790447 launch_evaluation.sh nlpgroup 00:00:00 00:00:00 0 GB 0 GB 1 8(1) 1 0.00M 0.00M FAILED
2026-04-29 20:48:21 lmbanr001 790448 launch_evaluation.sh nlpgroup 00:00:00 00:00:00 0 GB 0 GB 1 8(1) 1 0.00M 0.00M FAILED
2026-04-29 20:48:21 lmbanr001 790449 launch_evaluation.sh nlpgroup 00:00:00 00:00:00 0 GB 0 GB 1 8(1) 1 0.00M 0.00M FAILED
2026-04-29 20:48:22 lmbanr001 790450 launch_evaluation.sh nlpgroup 00:00:08 00:00:01 0 GB 0 GB 1 8(1) 1 0.00M 0.00M FAILED
2026-04-29 20:50:47 lmbanr001 790439 eval-run_mamba_base_masakhapos_tsn nlpgroup 02:14:08 00:16:46 1.28 GB 0 GB 1 8(1) 1 0.01M 0.00M COMPLETED
2026-04-29 20:52:39 lmbanr001 790454 eval-run_mamba_decode_sweep_t2x_xho_val nlpgroup 00:02:48 00:00:21 0.01 GB 0 GB 1 8(1) 1 0.01M 0.00M FAILED
2026-04-29 20:52:47 lmbanr001 790455 eval-run_mamba_decode_sweep_afrihg_all_val nlpgroup 00:03:52 00:00:29 0.89 GB 0 GB 1 8(1) 1 0.01M 0.00M FAILED
2026-04-29 20:57:31 lmbanr001 790456 eval-run_mamba_decode_sweep_ner_all_val_greedy nlpgroup 00:38:56 00:04:52 1.58 GB 0 GB 1 8(1) 1 0.01M 0.00M CANCELLED BY lmbanr001
2026-04-29 20:57:31 lmbanr001 790457 eval-run_mamba_decode_sweep_ner_all_val_greedy_rp12_nr3 nlpgroup 00:37:52 00:04:44 0 GB 0 GB 1 8(1) 1 0.01M 0.00M CANCELLED BY lmbanr001
2026-04-29 20:57:31 lmbanr001 790458 launch_evaluation.sh nlpgroup 00:00:00 00:00:00 0 GB 0 GB 0 1 CANCELLED BY lmbanr001
2026-04-29 20:57:31 lmbanr001 790459 launch_evaluation.sh nlpgroup 00:00:00 00:00:00 0 GB 0 GB 0 1 CANCELLED BY lmbanr001
2026-04-29 20:57:31 lmbanr001 790460 launch_evaluation.sh nlpgroup 00:00:00 00:00:00 0 GB 0 GB 0 1 CANCELLED BY lmbanr001
2026-04-29 20:57:31 lmbanr001 790461 launch_evaluation.sh nlpgroup 00:00:00 00:00:00 0 GB 0 GB 0 1 CANCELLED BY lmbanr001
2026-04-29 20:57:31 lmbanr001 790462 launch_evaluation.sh nlpgroup 00:00:00 00:00:00 0 GB 0 GB 0 1 CANCELLED BY lmbanr001
2026-04-29 20:57:31 lmbanr001 790463 launch_evaluation.sh nlpgroup 00:00:00 00:00:00 0 GB 0 GB 0 1 CANCELLED BY lmbanr001
2026-04-29 21:00:19 lmbanr001 790468 eval-run_mamba_decode_sweep_afrihg_all_val nlpgroup 00:20:08 00:02:31 1.62 GB 0 GB 1 8(1) 1 0.01M 0.00M COMPLETED
2026-04-29 21:01:55 lmbanr001 790467 eval-run_mamba_decode_sweep_t2x_xho_val nlpgroup 00:32:56 00:04:07 0 GB 0 GB 1 8(1) 1 0.01M 0.00M COMPLETED
2026-04-29 21:23:00 lmbanr001 790469 eval-run_mamba_decode_sweep_ner_all_val_greedy nlpgroup 03:01:28 00:22:41 1.63 GB 0 GB 1 8(1) 1 0.01M 0.00M COMPLETED
2026-04-29 21:29:36 lmbanr001 790470 eval-run_mamba_decode_sweep_ner_all_val_greedy_rp12_nr3 nlpgroup 03:41:28 00:27:41 1.73 GB 0 GB 1 8(1) 1 0.01M 0.00M COMPLETED
2026-04-29 21:30:40 bxxjin001 790006 mmd_backfill a100free 1-05:38:20 02:41:40 50.23 GB 0 GB 1 11(1) 1 0.01M 0.00M COMPLETED
2026-04-29 21:32:27 lmbanr001 790471 eval-run_mamba_decode_sweep_ner_all_val_beam5 nlpgroup 01:15:36 00:09:27 1.79 GB 0 GB 1 8(1) 1 0.01M 0.00M COMPLETED
2026-04-29 21:34:48 lmbanr001 790473 eval-run_mamba_decode_sweep_pos_all_val_greedy nlpgroup 00:33:04 00:04:08 1.57 GB 0 GB 1 8(1) 1 0.01M 0.00M COMPLETED
2026-04-29 21:38:21 lmbanr001 790474 eval-run_mamba_decode_sweep_pos_all_val_greedy_rp12_nr3 nlpgroup 00:47:12 00:05:54 1.64 GB 0 GB 1 8(1) 1 0.01M 0.00M COMPLETED
2026-04-29 21:41:07 lmbanr001 790475 eval-run_mamba_decode_sweep_pos_all_val_beam5 nlpgroup 00:50:32 00:06:19 1.78 GB 0 GB 1 8(1) 1 0.01M 0.00M COMPLETED
2026-04-29 21:41:26 lmbanr001 790472 eval-run_mamba_decode_sweep_ner_all_val_beam5_rp12_nr3 nlpgroup 01:34:40 00:11:50 1.83 GB 0 GB 1 8(1) 1 0.01M 0.00M COMPLETED
2026-04-29 21:53:36 lmbanr001 790476 eval-run_mamba_decode_sweep_pos_all_val_beam5_rp12_nr3 nlpgroup 02:02:00 00:15:15 1.83 GB 0 GB 1 8(1) 1 0.01M 0.00M COMPLETED
2026-04-29 22:09:07 lmbanr001 790451 sallm-mamba2-hybrid nlpgroup 00:00:00 00:00:00 0 GB 0 GB 0 1 CANCELLED BY lmbanr001
2026-04-29 22:09:42 lmbanr001 790577 eval-run_mamba_sa_general_classification_missing nlpgroup 00:02:24 00:00:18 0.52 GB 0 GB 1 8(1) 1 0.01M 0.00M FAILED
2026-04-29 22:09:42 lmbanr001 790581 eval-run_mamba_sa_general_structured_missing nlpgroup 00:02:24 00:00:18 0.52 GB 0 GB 1 8(1) 1 0.01M 0.00M FAILED
2026-04-29 22:28:14 lmbanr001 790603 launch_hpo.sh nlpgroup 00:00:08 00:00:01 0 GB 0 GB 1 8(1) 1 0.01M 0.00M FAILED
2026-04-29 22:28:14 lmbanr001 790604 launch_hpo.sh nlpgroup 00:00:00 00:00:00 0 GB 0 GB 1 8(1) 1 0.01M 0.00M FAILED
2026-04-29 22:28:14 lmbanr001 790605 launch_hpo.sh nlpgroup 00:00:00 00:00:00 0 GB 0 GB 1 8(1) 1 0.00M 0.00M FAILED
2026-04-29 22:28:14 lmbanr001 790606 launch_hpo.sh nlpgroup 00:00:00 00:00:00 0 GB 0 GB 1 8(1) 1 0.00M 0.00M FAILED
2026-04-29 22:56:27 ctzcar020 787964 Pn33G_3RU_Na_run3 vaccine 3-02:03:00 1-13:01:30 5.11 GB 0 GB 1 2(1) 1 0.01M 0.00M COMPLETED
2026-04-29 22:58:19 lmbanr001 790589 eval-run_mamba_sa_general_classification_missing nlpgroup 06:21:36 00:47:42 6.24 GB 0 GB 1 8(1) 1 0.01M 0.00M COMPLETED
2026-04-29 23:23:55 lmbanr001 790590 eval-run_mamba_sa_general_structured_missing nlpgroup 09:46:24 01:13:18 1.79 GB 0 GB 1 8(1) 1 0.01M 0.00M COMPLETED
2026-04-30
Completion time User Job ID and name Account CPU time Wall time MaxRSS MaxVMem Tasks Cores Nodes AveRead AveWrite Exit code: status
2026-04-30 02:50:15 lmbanr001 790777 hpo-opt_news_xho_hb nlpgroup 1-10:43:52 04:20:29 2.63 GB 0 GB 1 8(1) 1 0.01M 0.00M COMPLETED
2026-04-30 03:37:52 bxxjin001 790431 dpb_all6_1ep a100free 16:00:48 08:00:24 67.1 GB 0 GB 1 2(1) 1 0.01M 0.00M TIMEOUT
2026-04-30 04:47:37 01476962 779649 MyJob nlpgroup80 4-01:58:20 1-00:29:35 37.44 GB 0 GB 1 4(1) 1 0.01M 0.00M COMPLETED
2026-04-30 05:05:29 01476962 779648 MyJob nlpgroup80 4-07:11:12 1-01:47:48 37.44 GB 0 GB 1 4(1) 1 0.01M 0.00M COMPLETED
2026-04-30 06:06:14 ctzcar020 789453 Pn33B_3RU_Na_run8 vaccine 1-14:55:52 19:27:56 5.11 GB 0 GB 1 2(1) 1 0.01M 0.00M COMPLETED
2026-04-30 06:56:15 01476962 779650 MyJob nlpgroup80 4-08:33:04 1-02:08:16 37.44 GB 0 GB 1 4(1) 1 0.01M 0.00M COMPLETED
2026-04-30 07:59:36 lmbanr001 790779 hpo-opt_t2x_xho_hb nlpgroup 2-20:45:28 08:35:41 2.58 GB 0 GB 1 8(1) 1 0.01M 0.00M COMPLETED
2026-04-30 12:40:07 bhduna001 792408 nextshift-train-a100 vaccine 00:11:05 00:11:05 2.1 GB 0 GB 1 1(1) 1 512.50M 88.70M CANCELLED BY bhduna001
2026-04-30 12:40:07 bhduna001 792409 nextshift-train-a100 vaccine 00:11:05 00:11:05 1.97 GB 0 GB 1 1(1) 1 512.51M 88.72M CANCELLED BY bhduna001
2026-04-30 13:06:45 lmbanr001 792481 eval-run_mamba_sa_general_belebele_tiefix nlpgroup 01:19:28 00:09:56 2.05 GB 0 GB 1 8(1) 1 0.01M 0.00M COMPLETED
2026-04-30 13:26:53 lmbanr001 792559 eval-run_mamba_sa_general_belebele_probe nlpgroup 00:08:00 00:01:00 1.49 GB 0 GB 1 8(1) 1 0.01M 0.00M COMPLETED
2026-04-30 13:34:55 bhduna001 792413 nextshift-train-a100 vaccine 00:54:04 00:54:04 2.21 GB 0 GB 1 1(1) 1 884.96M 418.65M COMPLETED
2026-04-30 13:35:25 bhduna001 792414 nextshift-train-a100 vaccine 00:54:31 00:54:31 2.23 GB 0 GB 1 1(1) 1 918.74M 418.75M COMPLETED
2026-04-30 14:54:48 bhduna001 792612 nextshift-train-a100 vaccine 00:55:32 00:55:32 2.21 GB 0 GB 1 1(1) 1 905.86M 517.75M COMPLETED
2026-04-30 14:54:49 bhduna001 792611 nextshift-train-a100 vaccine 00:55:34 00:55:34 2.26 GB 0 GB 1 1(1) 1 905.83M 517.83M COMPLETED
2026-04-30 15:51:40 bhduna001 792614 nextshift-train-a100 vaccine 00:56:51 00:56:51 2.31 GB 0 GB 1 1(1) 1 877.68M 415.25M COMPLETED
2026-04-30 15:51:48 bhduna001 792613 nextshift-train-a100 vaccine 00:57:00 00:57:00 2.35 GB 0 GB 1 1(1) 1 919.22M 418.75M COMPLETED
2026-04-30 16:15:26 bxxjin001 792791 xpert_ckpt_retest a100free 01:27:40 00:43:50 44.45 GB 0 GB 1 2(1) 1 0.01M 0.00M COMPLETED
2026-04-30 16:57:22 bxxjin001 792786 p7_continue_results a100free 03:02:06 01:31:03 43.99 GB 0 GB 1 2(1) 1 0.01M 0.00M FAILED
2026-04-30 17:20:36 lmbanr001 790778 hpo-opt_afrihg_all_hb nlpgroup 6-02:58:16 18:22:17 3.94 GB 0 GB 1 8(1) 1 0.01M 0.00M COMPLETED
2026-04-30 18:27:22 bxxjin001 792969 p6_new_smoke a100free 06:00:00 01:30:00 44.89 GB 0 GB 1 4(1) 1 0.01M 0.00M TIMEOUT
2026-04-30 20:47:44 lmbanr001 796207 ft-pos_all_canary_hf_targets nlpgroup 00:04:32 00:00:34 1.33 GB 0 GB 1 8(1) 1 0.01M 0.00M FAILED
2026-04-30 20:48:11 lmbanr001 796209 launch_finetune.sh nlpgroup 00:00:00 00:00:00 0 GB 0 GB 0 1 CANCELLED BY lmbanr001
2026-04-30 20:59:24 lmbanr001 796208 ft-ner_all_canary_current nlpgroup 01:33:20 00:11:40 2.76 GB 0 GB 1 8(1) 1 0.01M 0.00M COMPLETED
2026-04-30 21:00:08 lmbanr001 796210 diag-796210 nlpgroup 00:05:52 00:00:44 1.21 GB 0 GB 1 8(1) 1 0.01M 0.00M COMPLETED
2026-04-30 21:00:59 lmbanr001 796211 diag-796211 nlpgroup 00:06:48 00:00:51 1.11 GB 0 GB 1 8(1) 1 0.01M 0.00M COMPLETED
2026-04-30 21:12:52 lmbanr001 796206 ft-pos_all_canary_current nlpgroup 03:25:36 00:25:42 2.78 GB 0 GB 1 8(1) 1 0.01M 0.00M COMPLETED
2026-04-30 21:30:50 lmbanr001 796212 ft-t2x_xho_hpo_selected nlpgroup 03:58:48 00:29:51 3.21 GB 0 GB 1 8(1) 1 0.01M 0.00M COMPLETED
2026-04-30 21:39:18 bxxjin001 792907 xp_p5_f1 a100free 19:44:12 04:56:03 36.4 GB 0 GB 1 4(1) 1 0.01M 0.00M COMPLETED
2026-04-30 21:40:11 lmbanr001 796216 diag-796216 nlpgroup 00:07:04 00:00:53 1.35 GB 0 GB 1 8(1) 1 0.01M 0.00M COMPLETED
2026-04-30 21:40:55 lmbanr001 796215 diag-796215 nlpgroup 00:05:52 00:00:44 1.33 GB 0 GB 1 8(1) 1 0.01M 0.00M COMPLETED
2026-04-30 21:45:51 lmbanr001 796219 eval-run_mamba_t2x_xho_hpo_selected nlpgroup 00:39:28 00:04:56 1.73 GB 0 GB 1 8(1) 1 0.01M 0.00M COMPLETED
2026-04-30 21:46:05 lmbanr001 796214 ft-ner_all_canary_hf_targets nlpgroup 02:02:00 00:15:15 2.91 GB 0 GB 1 8(1) 1 0.01M 0.00M COMPLETED
2026-04-30 21:46:57 lmbanr001 796218 diag-796218 nlpgroup 00:06:56 00:00:52 1.27 GB 0 GB 1 8(1) 1 0.01M 0.00M COMPLETED
2026-04-30 21:51:08 lmbanr001 796213 ft-pos_all_canary_hf_targets nlpgroup 05:06:08 00:38:16 2.95 GB 0 GB 1 8(1) 1 0.01M 0.00M COMPLETED
2026-04-30 21:51:51 lmbanr001 796217 diag-796217 nlpgroup 00:05:44 00:00:43 1.36 GB 0 GB 1 8(1) 1 0.01M 0.00M COMPLETED
2026-04-30 22:47:13 lmbanr001 796226 diag-796226 nlpgroup 01:16:48 00:09:36 1.41 GB 0 GB 1 8(1) 1 0.01M 0.00M CANCELLED BY lmbanr001
2026-04-30 22:54:36 lmbanr001 796230 diag-796230 nlpgroup 00:02:08 00:00:16 0 GB 0 GB 1 8(1) 1 0.01M 0.00M COMPLETED
2026-04-30 22:56:33 lmbanr001 796227 diag-796227 nlpgroup 01:06:08 00:08:16 1.33 GB 0 GB 1 8(1) 1 0.01M 0.00M CANCELLED BY lmbanr001
2026-04-30 22:57:58 lmbanr001 796232 diag-796232 nlpgroup 00:02:16 00:00:17 0 GB 0 GB 1 8(1) 1 0.01M 0.00M COMPLETED
2026-04-30 23:03:47 lmbanr001 796234 ft-news_xho_canary_current nlpgroup 00:12:32 00:01:34 2.6 GB 0 GB 1 8(1) 1 0.01M 0.00M CANCELLED BY lmbanr001
2026-04-30 23:03:47 lmbanr001 796235 mamba_news_canary_saveemb nlpgroup 00:00:00 00:00:00 0 GB 0 GB 0 1 CANCELLED BY lmbanr001
2026-04-30 23:17:45 bxxjin001 796225 p6_smoke_h3 a100free 04:56:32 01:14:08 37.59 GB 0 GB 4 4(1) 1 CANCELLED BY bxxjin001
2026-04-30 23:20:46 bxxjin001 796239 p6_smoke_tr a100free 00:06:00 00:01:30 3.36 GB 0 GB 1 4(1) 1 0.01M 0.00M COMPLETED
2026-04-30 23:26:04 lmbanr001 796231 diag-796231 nlpgroup 03:47:04 00:28:23 1.37 GB 0 GB 1 8(1) 1 0.01M 0.00M COMPLETED
2026-05-01
Completion time User Job ID and name Account CPU time Wall time MaxRSS MaxVMem Tasks Cores Nodes AveRead AveWrite Exit code: status
2026-05-01 00:22:22 bxxjin001 796240 p6_pdit_dx a100free 04:01:28 01:00:22 36.99 GB 0 GB 1 4(1) 1 0.01M 0.00M TIMEOUT
2026-05-01 04:44:36 ctzcar020 791767 Pn33B_3RU_Na_run9 vaccine 1-14:24:28 19:12:14 5.11 GB 0 GB 1 2(1) 1 0.01M 0.00M COMPLETED
2026-05-01 05:25:15 01476962 779651 MyJob nlpgroup80 4-01:19:04 1-00:19:46 37.44 GB 0 GB 1 4(1) 1 0.01M 0.00M COMPLETED
2026-05-01 05:41:00 lmbanr001 790780 hpo-opt_ner_all_hb nlpgroup 8-22:46:00 1-02:50:45 2.85 GB 0 GB 1 8(1) 1 0.01M 0.00M COMPLETED
2026-05-01 06:15:01 01476962 789304 MyJob nlpgroup80 3-21:15:04 23:18:46 37.44 GB 0 GB 1 4(1) 1 0.01M 0.00M COMPLETED
2026-05-01 11:15:24 ctzcar020 790124 Pn33G_3RU_Na_run4 vaccine 3-00:37:54 1-12:18:57 5.11 GB 0 GB 1 2(1) 1 0.01M 0.00M COMPLETED
2026-05-01 11:38:59 skscla001 796350 MMS_Model_GDRO_AT nlpgroup 00:33:52 00:08:28 11.95 GB 0 GB 1 4(1) 1 0.01M 0.00M COMPLETED
2026-05-01 11:39:30 skscla001 796351 MMS_Model_GDRO_AT nlpgroup 00:29:36 00:07:24 6.01 GB 0 GB 1 4(1) 1 0.01M 0.00M COMPLETED
2026-05-01 11:39:48 skscla001 796352 MMS_Model_GDRO_AT nlpgroup 00:29:44 00:07:26 5.33 GB 0 GB 1 4(1) 1 0.01M 0.00M COMPLETED
2026-05-01 11:59:55 skscla001 796358 MMS_Model_GDRO_AT nlpgroup 00:29:56 00:07:29 4.47 GB 0 GB 1 4(1) 1 0.01M 0.00M COMPLETED
2026-05-01 12:06:29 skscla001 796362 MMS_Model_GDRO_AT nlpgroup 00:29:48 00:07:27 4.41 GB 0 GB 1 4(1) 1 0.01M 0.00M COMPLETED
2026-05-01 12:06:44 skscla001 796363 MMS_Model_GDRO_AT nlpgroup 00:29:12 00:07:18 5.38 GB 0 GB 1 4(1) 1 0.01M 0.00M COMPLETED
2026-05-01 14:57:45 skscla001 796426 MMS_Model_GDRO_AT nlpgroup 09:42:04 02:25:31 31.48 GB 0 GB 1 4(1) 1 0.01M 0.00M COMPLETED
2026-05-01 15:31:18 lmbanr001 796803 eval-run_mamba_masakhanews_eng nlpgroup 00:02:08 00:00:16 0 GB 0 GB 1 8(1) 1 0.01M 0.00M FAILED
2026-05-01 15:31:26 lmbanr001 796802 diag-796802 nlpgroup 00:03:20 00:00:25 0 GB 0 GB 1 8(1) 1 0.01M 0.00M FAILED
2026-05-01 15:31:35 lmbanr001 796804 eval-run_mamba_masakhanews_xho nlpgroup 00:02:16 00:00:17 0 GB 0 GB 1 8(1) 1 0.01M 0.00M FAILED
2026-05-01 15:31:42 lmbanr001 796805 eval-run_mamba_masakhanews_all nlpgroup 00:02:08 00:00:16 0 GB 0 GB 1 8(1) 1 0.01M 0.00M FAILED
2026-05-01 15:31:49 lmbanr001 796806 eval-run_mamba_sib_afr nlpgroup 00:01:52 00:00:14 0.52 GB 0 GB 1 8(1) 1 0.01M 0.00M CANCELLED BY lmbanr001
2026-05-01 15:31:49 lmbanr001 796807 eval-run_mamba_sib_eng nlpgroup 00:00:48 00:00:06 0.44 GB 0 GB 1 8(1) 1 0.01M 0.00M CANCELLED BY lmbanr001
2026-05-01 15:31:49 lmbanr001 796808 tf-sib_nso nlpgroup 00:00:00 00:00:00 0 GB 0 GB 0 1 CANCELLED BY lmbanr001
2026-05-01 15:31:49 lmbanr001 796809 tf-sib_sot nlpgroup 00:00:00 00:00:00 0 GB 0 GB 0 1 CANCELLED BY lmbanr001
2026-05-01 15:31:49 lmbanr001 796810 tf-sib_xho nlpgroup 00:00:00 00:00:00 0 GB 0 GB 0 1 CANCELLED BY lmbanr001
2026-05-01 15:31:49 lmbanr001 796811 tf-sib_zul nlpgroup 00:00:00 00:00:00 0 GB 0 GB 0 1 CANCELLED BY lmbanr001
2026-05-01 15:31:49 lmbanr001 796812 tf-sib_all nlpgroup 00:00:00 00:00:00 0 GB 0 GB 0 1 CANCELLED BY lmbanr001
2026-05-01 15:31:49 lmbanr001 796813 tf-injongo_eng nlpgroup 00:00:00 00:00:00 0 GB 0 GB 0 1 CANCELLED BY lmbanr001
2026-05-01 15:31:49 lmbanr001 796814 tf-injongo_sot nlpgroup 00:00:00 00:00:00 0 GB 0 GB 0 1 CANCELLED BY lmbanr001
2026-05-01 15:31:49 lmbanr001 796815 tf-injongo_xho nlpgroup 00:00:00 00:00:00 0 GB 0 GB 0 1 CANCELLED BY lmbanr001
2026-05-01 15:31:49 lmbanr001 796816 tf-injongo_zul nlpgroup 00:00:00 00:00:00 0 GB 0 GB 0 1 CANCELLED BY lmbanr001
2026-05-01 15:31:49 lmbanr001 796817 tf-injongo_all nlpgroup 00:00:00 00:00:00 0 GB 0 GB 0 1 CANCELLED BY lmbanr001
2026-05-01 15:31:49 lmbanr001 796818 tf-pos_tsn nlpgroup 00:00:00 00:00:00 0 GB 0 GB 0 1 CANCELLED BY lmbanr001
2026-05-01 15:31:49 lmbanr001 796819 tf-pos_xho nlpgroup 00:00:00 00:00:00 0 GB 0 GB 0 1 CANCELLED BY lmbanr001
2026-05-01 15:31:49 lmbanr001 796820 tf-pos_zul nlpgroup 00:00:00 00:00:00 0 GB 0 GB 0 1 CANCELLED BY lmbanr001
2026-05-01 15:31:49 lmbanr001 796821 tf-pos_all nlpgroup 00:00:00 00:00:00 0 GB 0 GB 0 1 CANCELLED BY lmbanr001
2026-05-01 15:31:49 lmbanr001 796822 tf-ner_tsn nlpgroup 00:00:00 00:00:00 0 GB 0 GB 0 1 CANCELLED BY lmbanr001
2026-05-01 15:31:49 lmbanr001 796823 tf-ner_xho nlpgroup 00:00:00 00:00:00 0 GB 0 GB 0 1 CANCELLED BY lmbanr001
2026-05-01 15:31:49 lmbanr001 796824 tf-ner_zul nlpgroup 00:00:00 00:00:00 0 GB 0 GB 0 1 CANCELLED BY lmbanr001
2026-05-01 15:31:49 lmbanr001 796825 tf-ner_all nlpgroup 00:00:00 00:00:00 0 GB 0 GB 0 1 CANCELLED BY lmbanr001
2026-05-01 15:31:49 lmbanr001 796826 tf-afrihg_xho nlpgroup 00:00:00 00:00:00 0 GB 0 GB 0 1 CANCELLED BY lmbanr001
2026-05-01 15:31:49 lmbanr001 796827 tf-afrihg_zul nlpgroup 00:00:00 00:00:00 0 GB 0 GB 0 1 CANCELLED BY lmbanr001
2026-05-01 15:31:49 lmbanr001 796828 tf-afrihg_all nlpgroup 00:00:00 00:00:00 0 GB 0 GB 0 1 CANCELLED BY lmbanr001
2026-05-01 15:31:49 lmbanr001 796829 tf-t2x_xho nlpgroup 00:00:00 00:00:00 0 GB 0 GB 0 1 CANCELLED BY lmbanr001
2026-05-01 15:34:38 lmbanr001 796861 tf-sib_afr nlpgroup 00:00:00 00:00:00 0 GB 0 GB 0 1 CANCELLED BY lmbanr001
2026-05-01 15:34:38 lmbanr001 796862 tf-sib_eng nlpgroup 00:00:00 00:00:00 0 GB 0 GB 0 1 CANCELLED BY lmbanr001
2026-05-01 15:34:38 lmbanr001 796863 tf-sib_nso nlpgroup 00:00:00 00:00:00 0 GB 0 GB 0 1 CANCELLED BY lmbanr001
2026-05-01 15:34:38 lmbanr001 796864 tf-sib_sot nlpgroup 00:00:00 00:00:00 0 GB 0 GB 0 1 CANCELLED BY lmbanr001
2026-05-01 15:35:28 lmbanr001 796858 eval-run_mamba_masakhanews_eng nlpgroup 00:20:16 00:02:32 1.64 GB 0 GB 1 8(1) 1 0.01M 0.00M COMPLETED
2026-05-01 15:36:58 lmbanr001 796859 eval-run_mamba_masakhanews_xho nlpgroup 00:12:00 00:01:30 1.53 GB 0 GB 1 8(1) 1 0.01M 0.00M COMPLETED
2026-05-01 15:40:29 lmbanr001 796860 eval-run_mamba_masakhanews_all nlpgroup 00:28:08 00:03:31 1.72 GB 0 GB 1 8(1) 1 0.01M 0.00M COMPLETED
2026-05-01 15:41:57 lmbanr001 796865 eval-run_mamba_sib_xho nlpgroup 00:11:44 00:01:28 1.51 GB 0 GB 1 8(1) 1 0.01M 0.00M COMPLETED
2026-05-01 15:43:24 lmbanr001 796866 eval-run_mamba_sib_zul nlpgroup 00:11:36 00:01:27 1.52 GB 0 GB 1 8(1) 1 0.01M 0.00M COMPLETED
2026-05-01 15:48:16 lmbanr001 796867 eval-run_mamba_sib_all nlpgroup 00:38:56 00:04:52 1.73 GB 0 GB 1 8(1) 1 0.01M 0.00M COMPLETED
2026-05-01 15:59:46 lmbanr001 796857 diag-796857 nlpgroup 03:34:40 00:26:50 2.29 GB 0 GB 1 8(1) 1 0.01M 0.00M COMPLETED
2026-05-01 16:01:22 lmbanr001 796868 eval-run_mamba_injongointent_eng nlpgroup 01:44:48 00:13:06 2.56 GB 0 GB 1 8(1) 1 0.01M 0.00M COMPLETED
2026-05-01 16:12:23 lmbanr001 796869 eval-run_mamba_injongointent_sot nlpgroup 01:40:56 00:12:37 2.59 GB 0 GB 1 8(1) 1 0.01M 0.00M COMPLETED
2026-05-01 16:13:30 lmbanr001 796870 eval-run_mamba_injongointent_xho nlpgroup 01:36:56 00:12:07 2.57 GB 0 GB 1 8(1) 1 0.01M 0.00M COMPLETED
2026-05-01 16:24:33 lmbanr001 796871 eval-run_mamba_injongointent_zul nlpgroup 01:37:20 00:12:10 2.6 GB 0 GB 1 8(1) 1 0.01M 0.00M COMPLETED
2026-05-01 16:41:17 lmbanr001 796873 eval-run_mamba_masakhapos_tsn nlpgroup 02:13:52 00:16:44 1.59 GB 0 GB 1 8(1) 1 0.01M 0.00M COMPLETED
2026-05-01 16:44:39 mwrsim003 783915 y.sh nlpgroup80 19-07:47:40 1-22:22:46 37.43 GB 0 GB 1 10(1) 1 15.15M 0.74M FAILED
2026-05-01 16:45:09 mwrsim003 789715 y.sh nlpgroup80 00:00:00 00:00:00 0 GB 0 GB 1 10(1) 1 0.01M 0.00M FAILED
Completion time User Job ID and name Account CPU time Wall time MaxRSS MaxVMem Tasks Cores Nodes AveRead AveWrite Exit code: status