TitleDescriptionData TypeDefault
Maximal input sizeDefines the maximum number of tokens in each training sample.Commonly 128 to 4096 (model-dependent).Calculated automatically according to the size of the input logs.
Validation split ratioThe portion of the training data reserved for validation during training.A number between 0.01 and 0.9.0.2
Max samplesThe number of data points from the dataset used for training.Any positive integer up to the dataset size.Full dataset.
Training epochsIndicates the number of complete passes through the training dataset.Integer between 1 and 100.10
Learning rateSpecifies the step size for updating model weights during training.Typically between 1e-5 and 1e-1.2e-4
Save total limitThe total number of checkpoints to save during training.Typically between 0 and 10.3
Evaluation StepsThe number of steps between each evaluation during training.Typically between 1 and 5000.20