M2RNN Collection Note that the 7B models are MoE with 1.1B active parameters and 400M models are dense models • 14 items • Updated 4 days ago • 2
M2RNN Collection Note that the 7B models are MoE with 1.1B active parameters and 400M models are dense models • 14 items • Updated 4 days ago • 2