stanpony/gptnano_5M_vanilla_1e_lexinv_1e_immediate_20250910_002149 Text Generation • 17.5M • Updated Sep 10 • 4
stanpony/gptnano_5M_vanilla_transition_to_lexinvariant_linear_20250910_001519 Text Generation • 17.5M • Updated Sep 10 • 5
stanpony/gptnano_5M_vanilla_transition_to_lexinvariant_linear_20250910_001519_training Updated Sep 10
stanpony/gptnano_5M_vanilla_tke_untrained_20250906_003157 Text Generation • 17.5M • Updated Sep 6 • 5
stanpony/gptnano_5M_lexinvariant_1e_vanilla_1e_immediate_20250906_000904 Text Generation • 17.5M • Updated Sep 6 • 6
stanpony/gptnano_5M_lexinvariant_transition_linear_20250905_132506 Text Generation • 17.5M • Updated Sep 5 • 6
stanpony/gptnano_5M_lexinvariant_transition_linear_20250905_092019 Text Generation • 17.5M • Updated Sep 5 • 6
stanpony/gptnano_5M_lexinvariant_1e_vanilla_1e_20250905_091927 Text Generation • 17.5M • Updated Sep 5 • 5
stanpony/gptnano_5M_lexinvariant_transition_linear_20250904_150350 Text Generation • 17.5M • Updated Sep 4 • 7
stanpony/gptnano_5M_lexinvariant_0-5_epochs_then_vanilla_20250828_210705 Text Generation • 17.5M • Updated Aug 28 • 7
stanpony/gptnano_5M_lexinvariant_2_epochs_then_vanilla_20250826_114918_lr_1e_5_training Updated Aug 26
stanpony/gptnano_5M_lexinvariant_full_logitgate_30_pct_tke_false_20250821_183922_SPECIAL_tke_untrained Text Generation • 17.5M • Updated Aug 21 • 7
stanpony/gptnano_5M_lexinvariant_full_logitgate_30_pct_20250821_170659_SPECIAL Text Generation • 17.5M • Updated Aug 21 • 5