Text Generation
Transformers
Safetensors
Portuguese
qwen3
text-generation-inference
conversational
Eval Results (legacy)
nicholasKluge commited on
Commit
6efc7eb
·
verified ·
1 Parent(s): d0b018c

Update training_config_sft.yaml

Browse files
Files changed (1) hide show
  1. training_config_sft.yaml +1 -1
training_config_sft.yaml CHANGED
@@ -57,7 +57,7 @@ gradient_accumulation_steps: 4
57
  eval_micro_batch_size: null
58
  num_train_epochs: 5
59
  warmup_ratio: 0.1
60
- max_learning_rate: 0.000085
61
  min_learning_rate: 0.0
62
  muon_learning_rate: null
63
  weight_decay: 0.0
 
57
  eval_micro_batch_size: null
58
  num_train_epochs: 5
59
  warmup_ratio: 0.1
60
+ max_learning_rate: 0.000075
61
  min_learning_rate: 0.0
62
  muon_learning_rate: null
63
  weight_decay: 0.0