`current_gradient_accumulation_steps` is undefined when `eval_on_start==True`

### Reproduction

Similar issue as already raised in #3983 

However, that issue concerned errors during the training loop, which are indeed fixed by using the correct `transformers` version. But when `eval_on_start` is `True`, we go into eval before [`current_gradient_accumulation_steps` is first set in the `inner_training_loop` of the `Trainer`](https://github.com/huggingface/transformers/blob/30a4b8707d93dbec4629d351e8a9c2a66be7ad9f/src/transformers/trainer.py#L2626).

In `transformers.Trainer`, this is fine because `current_gradient_accumulation_steps` is not referenced during eval. But in TRL we use `current_gradient_accumulation_steps` in `_compute_loss`, which is also used during eval => error.

Fix: I guess we need to set an initial value for `current_gradient_accumulation_steps`, at least if `eval_on_start` is `True` (we should check whether the current logic for eval is even correct, maybe we actually need to set `current_gradient_accumulation_steps` ourselves in eval if this can have different values than during train).

Traceback:
```
Traceback (most recent call last):
  File "<...>/grpo.py", line 328, in main
    trainer.train()
  File "<...>/.venv/lib/python3.12/site-packages/transformers/trainer.py", line 2328, in train
    return inner_training_loop(
           ^^^^^^^^^^^^^^^^^^^^
  File "<...>/.venv/lib/python3.12/site-packages/transformers/trainer.py", line 2581, in _inner_training_loop
    self._evaluate(trial, ignore_keys_for_eval, skip_scheduler=True)
  File "/mnt/task_runtime/.venv/lib/python3.12/site-packages/transformers/trainer.py", line 3176, in _evaluate
    metrics = self.evaluate(ignore_keys=ignore_keys_for_eval)
              ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "<...>/.venv/lib/python3.12/site-packages/transformers/trainer.py", line 4469, in evaluate
    output = eval_loop(
             ^^^^^^^^^^
  File "<...>/.venv/lib/python3.12/site-packages/transformers/trainer.py", line 4665, in evaluation_loop
    losses, logits, labels = self.prediction_step(model, inputs, prediction_loss_only, ignore_keys=ignore_keys)
                             ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "<...>/.venv/lib/python3.12/site-packages/trl/trainer/grpo_trainer.py", line 1667, in prediction_step
    loss = self.compute_loss(model, inputs)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File <...>/.venv/lib/python3.12/site-packages/trl/extras/profiling.py", line 98, in wrapper
    return func(self, *args, **kwargs)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "<...>/.venv/lib/python3.12/site-packages/trl/trainer/grpo_trainer.py", line 1539, in compute_loss
    return self._compute_loss(model, inputs)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "<...>/.venv/lib/python3.12/site-packages/trl/trainer/grpo_trainer.py", line 1613, in _compute_loss
    loss = loss / self.current_gradient_accumulation_steps
                  ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
AttributeError: 'GRPOTrainer' object has no attribute 'current_gradient_accumulation_steps'
```



### System Info

I am on `transformers==4.56.0` and `trl==0.22.2`.

### Checklist

- [x] I have checked that my issue isn't already filed (see [open issues](https://github.com/huggingface/trl/issues?q=is%3Aissue))
- [x] I have included my system information
- [x] Any code provided is minimal, complete, and reproducible ([more on MREs](https://docs.github.com/en/get-started/writing-on-github/working-with-advanced-formatting/creating-and-highlighting-code-blocks))
- [x] Any code provided is properly formatted in code blocks, (no screenshot, [more on code blocks](https://docs.github.com/en/get-started/writing-on-github/working-with-advanced-formatting/creating-and-highlighting-code-blocks))
- [x] Any traceback provided is complete

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

`current_gradient_accumulation_steps` is undefined when `eval_on_start==True` #4010

Reproduction

System Info

Checklist

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

current_gradient_accumulation_steps is undefined when eval_on_start==True #4010

Description

Reproduction

System Info

Checklist

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

`current_gradient_accumulation_steps` is undefined when `eval_on_start==True` #4010