kuleshov-group/caduceus-ph_seqlen-1k_d_model-118_n_layer-4_lr-8e-3 Fill-Mask • 471k • Updated Oct 20, 2025 • 48 • 1
kuleshov-group/caduceus-ph_seqlen-1k_d_model-256_n_layer-4_lr-8e-3 Fill-Mask • 1.93M • Updated Oct 20, 2025 • 276 • 1
kuleshov-group/caduceus-ph_seqlen-131k_d_model-256_n_layer-16 Fill-Mask • 7.73M • Updated Oct 20, 2025 • 2.69k • 6
kuleshov-group/caduceus-ps_seqlen-1k_d_model-118_n_layer-4_lr-8e-3 Fill-Mask • 471k • Updated Oct 20, 2025 • 30 • 1
kuleshov-group/caduceus-ps_seqlen-1k_d_model-256_n_layer-4_lr-8e-3 Fill-Mask • 1.93M • Updated Oct 20, 2025 • 174 • 2
kuleshov-group/caduceus-ps_seqlen-131k_d_model-256_n_layer-16 Fill-Mask • 7.73M • Updated Oct 20, 2025 • 1.81k • 14
kuleshov-group/bd3lm-owt-block_size1024-pretrain Text Generation • 0.2B • Updated Mar 18, 2025 • 88 • 1