Rank 0-only logging #2608

SalmanMohammadi · 2025-05-01T13:17:20Z

Just a start on removing unnecessary duplicate logging across multiple ranks. Will update as I find more cases - @maintainers feel free to add to this branch.

root@eebe878229e8:/workspace/axolotl# axolotl train examples/llama-3/lora-1b.yml 
[2025-05-19 17:03:05,194] [INFO] [real_accelerator.py:219:get_accelerator] Setting ds_accelerator to cuda (auto detect)
[2025-05-19 17:03:05,283] [INFO] [root.spawn:77] [PID:98907] gcc -pthread -B /root/miniconda3/envs/py3.11/compiler_compat -DNDEBUG -fwrapv -O2 -Wall -fPIC -O2 -isystem /root/miniconda3/envs/py3.11/include -fPIC -O2 -isystem /root/miniconda3/envs/py3.11/include -fPIC -c /tmp/tmpnwaw2628/test.c -o /tmp/tmpnwaw2628/test.o
[2025-05-19 17:03:05,310] [INFO] [root.spawn:77] [PID:98907] gcc -pthread -B /root/miniconda3/envs/py3.11/compiler_compat /tmp/tmpnwaw2628/test.o -laio -o /tmp/tmpnwaw2628/a.out
[2025-05-19 17:03:05,350] [INFO] [root.spawn:77] [PID:98907] gcc -pthread -B /root/miniconda3/envs/py3.11/compiler_compat -DNDEBUG -fwrapv -O2 -Wall -fPIC -O2 -isystem /root/miniconda3/envs/py3.11/include -fPIC -O2 -isystem /root/miniconda3/envs/py3.11/include -fPIC -c /tmp/tmpc82bup43/test.c -o /tmp/tmpc82bup43/test.o
[2025-05-19 17:03:05,376] [INFO] [root.spawn:77] [PID:98907] gcc -pthread -B /root/miniconda3/envs/py3.11/compiler_compat /tmp/tmpc82bup43/test.o -L/usr/local/cuda -L/usr/local/cuda/lib64 -lcufile -o /tmp/tmpc82bup43/a.out
The following values were not passed to `accelerate launch` and had defaults used instead:
        `--num_processes` was set to a value of `2`
                More than one GPU was found, enabling multi-GPU training.
                If this was unintended please pass in `--num_processes=1`.
        `--num_machines` was set to a value of `1`
        `--mixed_precision` was set to a value of `'no'`
        `--dynamo_backend` was set to a value of `'no'`
To avoid this warning pass in values for each of the problematic parameters or run `accelerate config`.
[2025-05-19 17:03:16,329] [INFO] [real_accelerator.py:219:get_accelerator] Setting ds_accelerator to cuda (auto detect)
[2025-05-19 17:03:16,349] [INFO] [real_accelerator.py:219:get_accelerator] Setting ds_accelerator to cuda (auto detect)
[2025-05-19 17:03:16,418] [INFO] [root.spawn:77] [PID:99566] gcc -pthread -B /root/miniconda3/envs/py3.11/compiler_compat -DNDEBUG -fwrapv -O2 -Wall -fPIC -O2 -isystem /root/miniconda3/envs/py3.11/include -fPIC -O2 -isystem /root/miniconda3/envs/py3.11/include -fPIC -c /tmp/tmpf4ysnh66/test.c -o /tmp/tmpf4ysnh66/test.o
[2025-05-19 17:03:16,436] [INFO] [root.spawn:77] [PID:99567] gcc -pthread -B /root/miniconda3/envs/py3.11/compiler_compat -DNDEBUG -fwrapv -O2 -Wall -fPIC -O2 -isystem /root/miniconda3/envs/py3.11/include -fPIC -O2 -isystem /root/miniconda3/envs/py3.11/include -fPIC -c /tmp/tmp8bn2f8bx/test.c -o /tmp/tmp8bn2f8bx/test.o
[2025-05-19 17:03:16,438] [INFO] [root.spawn:77] [PID:99566] gcc -pthread -B /root/miniconda3/envs/py3.11/compiler_compat /tmp/tmpf4ysnh66/test.o -laio -o /tmp/tmpf4ysnh66/a.out
[2025-05-19 17:03:16,463] [INFO] [root.spawn:77] [PID:99567] gcc -pthread -B /root/miniconda3/envs/py3.11/compiler_compat /tmp/tmp8bn2f8bx/test.o -laio -o /tmp/tmp8bn2f8bx/a.out
[2025-05-19 17:03:16,468] [INFO] [root.spawn:77] [PID:99566] gcc -pthread -B /root/miniconda3/envs/py3.11/compiler_compat -DNDEBUG -fwrapv -O2 -Wall -fPIC -O2 -isystem /root/miniconda3/envs/py3.11/include -fPIC -O2 -isystem /root/miniconda3/envs/py3.11/include -fPIC -c /tmp/tmp8imc24xx/test.c -o /tmp/tmp8imc24xx/test.o
[2025-05-19 17:03:16,489] [INFO] [root.spawn:77] [PID:99566] gcc -pthread -B /root/miniconda3/envs/py3.11/compiler_compat /tmp/tmp8imc24xx/test.o -L/usr/local/cuda -L/usr/local/cuda/lib64 -lcufile -o /tmp/tmp8imc24xx/a.out
[2025-05-19 17:03:16,497] [INFO] [root.spawn:77] [PID:99567] gcc -pthread -B /root/miniconda3/envs/py3.11/compiler_compat -DNDEBUG -fwrapv -O2 -Wall -fPIC -O2 -isystem /root/miniconda3/envs/py3.11/include -fPIC -O2 -isystem /root/miniconda3/envs/py3.11/include -fPIC -c /tmp/tmp5i3wtr1z/test.c -o /tmp/tmp5i3wtr1z/test.o
[2025-05-19 17:03:16,522] [INFO] [root.spawn:77] [PID:99567] gcc -pthread -B /root/miniconda3/envs/py3.11/compiler_compat /tmp/tmp5i3wtr1z/test.o -L/usr/local/cuda -L/usr/local/cuda/lib64 -lcufile -o /tmp/tmp5i3wtr1z/a.out
[2025-05-19 17:03:17,418] [INFO] [datasets.<module>:54] [PID:99566] PyTorch version 2.6.0+cu124 available.
[2025-05-19 17:03:17,448] [INFO] [datasets.<module>:54] [PID:99567] PyTorch version 2.6.0+cu124 available.
INFO 05-19 17:03:19 [importing.py:53] Triton module has been replaced with a placeholder.
INFO 05-19 17:03:19 [importing.py:53] Triton module has been replaced with a placeholder.
INFO 05-19 17:03:19 [__init__.py:239] Automatically detected platform cuda.
INFO 05-19 17:03:19 [__init__.py:239] Automatically detected platform cuda.
[2025-05-19 17:03:21,472] [WARNING] [axolotl.utils.schemas.config.hint_lora_8bit:846] [PID:99566] [RANK:0] We recommend setting `load_in_8bit: true` for LORA finetuning
[2025-05-19 17:03:21,556] [DEBUG] [axolotl.utils.config.resolve_dtype:65] [PID:99566] [RANK:0] bf16 support detected, enabling for this configuration.
[2025-05-19 17:03:21,794] [INFO] [axolotl.utils.config.log_gpu_memory_usage:107] [PID:99566] [RANK:0] cuda memory usage baseline: 0.000GB (+0.395GB misc)

     #@@ #@@      @@# @@#
    @@  @@          @@  @@           =@@#                               @@                 #@    =@@#.
    @@    #@@@@@@@@@    @@           #@#@=                              @@                 #@     .=@@
      #@@@@@@@@@@@@@@@@@            =@# @#     ##=     ##    =####=+    @@      =#####+  =#@@###.   @@
    @@@@@@@@@@/  +@@/  +@@          #@  =@=     #@=   @@   =@#+  +#@#   @@    =@#+  +#@#   #@.      @@
    @@@@@@@@@@  ##@@  ##@@         =@#   @#      =@# @#    @@      @@   @@    @@      #@   #@       @@
     @@@@@@@@@@@@@@@@@@@@          #@=+++#@=      =@@#     @@      @@   @@    @@      #@   #@       @@
                                  =@#=====@@     =@# @#    @@      @@   @@    @@      #@   #@       @@
    @@@@@@@@@@@@@@@@  @@@@        #@      #@=   #@=  +@@   #@#    =@#   @@.   =@#    =@#   #@.      @@
                                 =@#       @#  #@=     #@   =#@@@@#=    +#@@=  +#@@@@#=    .##@@+   @@
    @@@@  @@@@@@@@@@@@@@@@

[rank1]:[W519 17:03:22.928749665 ProcessGroupNCCL.cpp:4561] [PG ID 0 PG GUID 0 Rank 1]  using GPU 1 to perform barrier as devices used by this process are currently unknown. This can potentially cause a hang if this rank to GPU mapping is incorrect. Specify device_ids in barrier() to force use of a particular device, or call init_process_group() with a device_id.
[2025-05-19 17:03:24,947] [DEBUG] [axolotl.utils.models.load_tokenizer:461] [PID:99566] [RANK:0] EOS: 128001 / <|end_of_text|>
[2025-05-19 17:03:24,948] [DEBUG] [axolotl.utils.models.load_tokenizer:464] [PID:99566] [RANK:0] PAD: 128001 / <|end_of_text|>
[2025-05-19 17:03:24,948] [DEBUG] [axolotl.utils.models.load_tokenizer:467] [PID:99566] [RANK:0] UNK: None / None
[2025-05-19 17:03:24,948] [INFO] [axolotl.utils.models.load_tokenizer:483] [PID:99566] [RANK:0] No Chat template selected. Consider adding a chat template for easier inference.
[2025-05-19 17:03:24,948] [INFO] [axolotl.utils.data.sft.load_tokenized_prepared_datasets:280] [PID:99566] [RANK:0] Unable to find prepared dataset in last_run_prepared/87233e1e917def7122b2b113697f5e3d
[2025-05-19 17:03:24,948] [INFO] [axolotl.utils.data.sft.load_tokenized_prepared_datasets:283] [PID:99566] [RANK:0] Loading raw datasets...
[2025-05-19 17:03:24,948] [WARNING] [axolotl.utils.data.sft.load_tokenized_prepared_datasets:287] [PID:99566] [RANK:0] Processing datasets during training can lead to VRAM instability. Please pre-process your dataset.
[2025-05-19 17:03:24,948] [INFO] [axolotl.utils.data.sft.load_tokenized_prepared_datasets:294] [PID:99566] [RANK:0] No seed provided, using default seed of 42
Repo card metadata block was not found. Setting CardData to empty.
[2025-05-19 17:03:26,095] [WARNING] [huggingface_hub.repocard.content:108] [PID:99566] Repo card metadata block was not found. Setting CardData to empty.
[2025-05-19 17:03:29,931] [INFO] [axolotl.utils.data.sft.get_dataset_wrapper:509] [PID:99566] [RANK:0] Loading dataset with base_type: alpaca and prompt_style: None
[2025-05-19 17:03:31,005] [INFO] [axolotl.utils.data.utils.drop_long_seq_in_dataset:177] [PID:99566] [RANK:0] min_input_len: 33
[2025-05-19 17:03:31,005] [INFO] [axolotl.utils.data.utils.drop_long_seq_in_dataset:181] [PID:99566] [RANK:0] max_input_len: 638
[2025-05-19 17:03:33,640] [INFO] [axolotl.utils.data.sft.load_tokenized_prepared_datasets:372] [PID:99566] [RANK:0] Saving merged prepared dataset to disk... last_run_prepared/87233e1e917def7122b2b113697f5e3d
Saving the dataset (1/1 shards): 100%|████████████████████████████████████████████████████████████████████████████████████████████| 54568/54568 [00:02<00:00, 21675.57 examples/s]
[rank0]:[W519 17:03:36.010026010 ProcessGroupNCCL.cpp:4561] [PG ID 0 PG GUID 0 Rank 0]  using GPU 0 to perform barrier as devices used by this process are currently unknown. This can potentially cause a hang if this rank to GPU mapping is incorrect. Specify device_ids in barrier() to force use of a particular device, or call init_process_group() with a device_id.
[2025-05-19 17:03:37,221] [DEBUG] [axolotl.utils.trainer.calculate_total_num_steps:405] [PID:99566] [RANK:0] total_num_tokens: 818_917
[2025-05-19 17:03:37,287] [DEBUG] [axolotl.utils.trainer.calculate_total_num_steps:425] [PID:99566] [RANK:0] `total_supervised_tokens: 168_730`
Repo card metadata block was not found. Setting CardData to empty.
[2025-05-19 17:03:38,097] [WARNING] [huggingface_hub.repocard.content:108] [PID:99567] Repo card metadata block was not found. Setting CardData to empty.
[2025-05-19 17:03:44,063] [INFO] [axolotl.utils.samplers.multipack.calc_min_len:412] [PID:99566] [RANK:0] gather_len_batches: [201, 201]
[2025-05-19 17:03:44,065] [DEBUG] [axolotl.utils.trainer.calculate_total_num_steps:481] [PID:99566] [RANK:0] data_loader_len: 50
[2025-05-19 17:03:44,078] [INFO] [axolotl.utils.trainer.calc_sample_packing_eff_est:493] [PID:99566] [RANK:0] sample_packing_eff_est across ranks: [0.9946811199188232, 0.9946811199188232]
[2025-05-19 17:03:44,079] [DEBUG] [axolotl.utils.trainer.calculate_total_num_steps:505] [PID:99566] [RANK:0] sample_packing_eff_est: None
[2025-05-19 17:03:44,079] [DEBUG] [axolotl.utils.trainer.calculate_total_num_steps:517] [PID:99566] [RANK:0] total_num_steps: 50
[2025-05-19 17:03:44,160] [DEBUG] [axolotl.utils.trainer.calculate_total_num_steps:405] [PID:99566] [RANK:0] total_num_tokens: 8_606_215
[2025-05-19 17:03:44,528] [DEBUG] [axolotl.utils.trainer.calculate_total_num_steps:425] [PID:99566] [RANK:0] `total_supervised_tokens: 6_008_433`
[2025-05-19 17:03:46,210] [INFO] [axolotl.utils.samplers.multipack.calc_min_len:412] [PID:99566] [RANK:0] gather_len_batches: [2103, 2103]
[2025-05-19 17:03:46,211] [DEBUG] [axolotl.utils.trainer.calculate_total_num_steps:481] [PID:99566] [RANK:0] data_loader_len: 525
[2025-05-19 17:03:46,212] [INFO] [axolotl.utils.trainer.calc_sample_packing_eff_est:493] [PID:99566] [RANK:0] sample_packing_eff_est across ranks: [0.9991092085838318, 0.9991092085838318]
[2025-05-19 17:03:46,212] [DEBUG] [axolotl.utils.trainer.calculate_total_num_steps:505] [PID:99566] [RANK:0] sample_packing_eff_est: 1.0
[2025-05-19 17:03:46,212] [DEBUG] [axolotl.utils.trainer.calculate_total_num_steps:517] [PID:99566] [RANK:0] total_num_steps: 525
[2025-05-19 17:03:46,215] [INFO] [axolotl.utils.data.sft.prepare_dataset:189] [PID:99566] [RANK:0] Maximum number of steps set at 2
[2025-05-19 17:03:46,240] [DEBUG] [axolotl.train.setup_model_and_tokenizer:63] [PID:99566] [RANK:0] loading tokenizer... NousResearch/Llama-3.2-1B
[2025-05-19 17:03:47,027] [DEBUG] [axolotl.utils.models.load_tokenizer:461] [PID:99566] [RANK:0] EOS: 128001 / <|end_of_text|>
[2025-05-19 17:03:47,027] [DEBUG] [axolotl.utils.models.load_tokenizer:464] [PID:99566] [RANK:0] PAD: 128001 / <|end_of_text|>
[2025-05-19 17:03:47,028] [DEBUG] [axolotl.utils.models.load_tokenizer:467] [PID:99566] [RANK:0] UNK: None / None
[2025-05-19 17:03:47,028] [INFO] [axolotl.utils.models.load_tokenizer:483] [PID:99566] [RANK:0] No Chat template selected. Consider adding a chat template for easier inference.
[2025-05-19 17:03:47,028] [DEBUG] [axolotl.train.setup_model_and_tokenizer:77] [PID:99566] [RANK:0] loading model and peft_config...
[2025-05-19 17:03:58,372] [INFO] [axolotl.utils.models.log_gpu_memory_usage:107] [PID:99566] [RANK:0] cuda memory usage after model load: 2.313GB (+0.014GB cache, +0.794GB misc)
[2025-05-19 17:03:58,385] [INFO] [axolotl.utils.models.load_model:1382] [PID:99566] [RANK:0] Converting modules to torch.bfloat16
trainable params: 11,272,192 || all params: 1,247,086,592 || trainable%: 0.9039
[2025-05-19 17:03:58,698] [INFO] [axolotl.utils.models.log_gpu_memory_usage:107] [PID:99566] [RANK:0] cuda memory usage after adapters: 2.355GB (+1.009GB cache, +0.794GB misc)
/workspace/axolotl/src/axolotl/core/trainers/base.py:64: FutureWarning: `tokenizer` is deprecated and will be removed in version 5.0.0 for `AxolotlTrainer.__init__`. Use `processing_class` instead.
  super().__init__(*_args, **kwargs)
No label_names provided for model class `PeftModelForCausalLM`. Since `PeftModel` hides base models input arguments, if label_names is not given, label_names can't be set automatically within `Trainer`. Note that empty label_names list will be used instead.
/workspace/axolotl/src/axolotl/core/trainers/base.py:64: FutureWarning: `tokenizer` is deprecated and will be removed in version 5.0.0 for `AxolotlTrainer.__init__`. Use `processing_class` instead.
  super().__init__(*_args, **kwargs)
No label_names provided for model class `PeftModelForCausalLM`. Since `PeftModel` hides base models input arguments, if label_names is not given, label_names can't be set automatically within `Trainer`. Note that empty label_names list will be used instead.
[2025-05-19 17:03:59,750] [INFO] [axolotl.train.save_initial_configs:377] [PID:99566] [RANK:0] Pre-saving adapter config to ./outputs/lora-out...
[2025-05-19 17:03:59,760] [INFO] [axolotl.train.save_initial_configs:381] [PID:99566] [RANK:0] Pre-saving tokenizer to ./outputs/lora-out...
[2025-05-19 17:04:00,011] [INFO] [axolotl.train.save_initial_configs:384] [PID:99566] [RANK:0] Pre-saving model config to ./outputs/lora-out...
[2025-05-19 17:04:00,031] [INFO] [axolotl.train.execute_training:215] [PID:99566] [RANK:0] Starting trainer...
[2025-05-19 17:04:02,007] [INFO] [axolotl.utils.samplers.multipack.calc_min_len:412] [PID:99566] [RANK:0] gather_len_batches: [2103, 2103]
  0%|                                                                                                                                                       | 0/2 [00:00<?, ?it/s]You're using a PreTrainedTokenizerFast tokenizer. Please note that with a fast tokenizer, using the `__call__` method is faster than using a method to encode the text followed by a call to the `pad` method to get a padded encoding.
You're using a PreTrainedTokenizerFast tokenizer. Please note that with a fast tokenizer, using the `__call__` method is faster than using a method to encode the text followed by a call to the `pad` method to get a padded encoding.
{'loss': 1.1739, 'grad_norm': 0.2724481523036957, 'learning_rate': 0.0, 'epoch': 0.0}                                                                                             
 50%|███████████████████████████████████████████████████████████████████████▌                                                                       | 1/2 [00:02<00:02,  2.43s/it][2025-05-19 17:04:06,550] [INFO] [axolotl.utils.samplers.multipack.calc_min_len:412] [PID:99566] [RANK:0] gather_len_batches: [201, 201]
{'eval_loss': 1.5100458860397339, 'eval_runtime': 38.9219, 'eval_samples_per_second': 140.204, 'eval_steps_per_second': 35.07, 'epoch': 0.0}                                      
 50%|███████████████████████████████████████████████████████████████████████▌                                                                       | 1/2 [00:41<00:02,  2.43s/it[2025-05-19 17:04:45,804] [INFO] [axolotl.utils.callbacks.log_gpu_memory_usage:107] [PID:99566] [RANK:0] cuda memory usage while training: 2.436GB (+7.439GB cache, +0.816GB misc) 
{'loss': 0.9655, 'grad_norm': 0.2586127519607544, 'learning_rate': 2e-05, 'epoch': 0.0}                                                                                           
100%|███████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 2/2 [00:43<00:00, 25.22s/it][2025-05-19 17:04:47,704] [INFO] [axolotl.utils.samplers.multipack.calc_min_len:412] [PID:99566] [RANK:0] gather_len_batches: [201, 201]
{'eval_loss': 1.5089274644851685, 'eval_runtime': 38.8681, 'eval_samples_per_second': 140.398, 'eval_steps_per_second': 35.119, 'epoch': 0.0}                                     
{'train_runtime': 84.1981, 'train_samples_per_second': 0.19, 'train_steps_per_second': 0.024, 'train_loss': 1.0697062611579895, 'epoch': 0.0}                                     
100%|███████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 2/2 [01:24<00:00, 42.06s/it]
[2025-05-19 17:05:26,322] [INFO] [axolotl.train.save_trained_model:234] [PID:99566] [RANK:0] Training completed! Saving pre-trained model to ./outputs/lora-out.

Summary by CodeRabbit

New Features
- Introduced a custom logging utility that ensures logs are emitted only from the main process in distributed environments and supports one-time warning messages.
Refactor
- Replaced standard logging throughout the application and tests with the new custom logging utility for consistent and centralized log management.
- Improved log message formatting and readability in various modules.
Chores
- Updated configuration files to specify optimizer and learning rate scheduler parameters more explicitly.
- Minor formatting and whitespace adjustments in code and configuration files.

src/axolotl/utils/logging.py

codecov · 2025-05-02T14:20:55Z

Codecov Report

Attention: Patch coverage is 67.64706% with 77 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
...c/axolotl/monkeypatch/mistral_attn_hijack_flash.py	0.00%	6 Missing ⚠️
src/axolotl/monkeypatch/unsloth_.py	0.00%	6 Missing ⚠️
src/axolotl/cli/config.py	28.57%	5 Missing ⚠️
src/axolotl/integrations/spectrum/__init__.py	0.00%	5 Missing ⚠️
src/axolotl/integrations/liger/__init__.py	33.33%	4 Missing ⚠️
src/axolotl/monkeypatch/llama_attn_hijack_flash.py	33.33%	4 Missing ⚠️
src/axolotl/core/trainers/mixins/scheduler.py	40.00%	3 Missing ⚠️
...axolotl/integrations/cut_cross_entropy/__init__.py	0.00%	3 Missing ⚠️
.../axolotl/monkeypatch/llama_attn_hijack_xformers.py	0.00%	3 Missing ⚠️
...l/prompt_strategies/bradley_terry/chat_template.py	0.00%	3 Missing ⚠️
... and 20 more

📢 Thoughts on this report? Let us know!

…to dist_logging

* ctx manager for SP * updates * update * further simplifying * simplifying * simplifying * reorg * batch api HF adapter for ring-flash-attn; cleanup and improvements * update * adding all batch ring-flash-attn methods via single adapter * fix * fixes for batch API funcs, simplify * fix * grpo sp support * progress * stronger subclassing of TRL GRPO trainer; custom distributed sampler * subclassing constructor * progress * finalizing SP + GRPO trainer * minimize diffs to GRPO trainer * remove (most of) the custom GRPO trainer logic * debug * debug * update * update * update * progress * cleanup * cleanup * minor changes * update * update * update * small changes * updates * cleanup; torch.compile ring_flash_attn functions to prevent numerical instability; lint * spacing * cleanup; log in pydantic model config only on main process * remove comment * fix sp sampler, update to latest upstream code, doc * add docs * update quartodoc autodoc contents * fix, simplifications * fixes + simplifications * review comments * lint * removing main process only logs in favor of #2608 * fixes, additional smoke test * updatse * more tests * update * fix grad accum bug (sort of) * lint, tests * todo

coderabbitai · 2025-05-15T12:31:17Z

Walkthrough

This update replaces all uses of the standard Python logging module across the codebase with a custom logging utility defined in axolotl.utils.logging. The new utility ensures logging occurs only on the main process in distributed environments and provides features such as warning suppression. Logger initialization and usage are standardized throughout, with minor formatting and readability improvements in some log messages. Additionally, the example configuration for llama-3 LoRA training was updated to remove an obsolete path and explicitly specify optimizer and learning rate scheduler parameters.

Changes

Files/Groups	Change Summary
src/axolotl/utils/logging.py	Introduced a custom logging utility with main-process-only and warning-once features.
src/axolotl/, src/axolotl/cli/, src/axolotl/core/, src/axolotl/loaders/, src/axolotl/monkeypatch/, src/axolotl/prompt_strategies/, src/axolotl/utils/**, src/axolotl/processing_strategies.py	Replaced standard `logging` with custom `get_logger` import and initialization; updated log calls.
src/axolotl/utils/quantization.py, src/axolotl/cli/train.py	Removed unused logging imports and logger initializations.
src/axolotl/utils/data/sft.py, src/axolotl/utils/data/rl.py	Minor log message formatting and seed handling improvements.
src/axolotl/prompt_strategies/chat_template.py, src/axolotl/prompt_strategies/bradley_terry/chat_template.py	Updated log level setting to use string names; minor code formatting for readability.
examples/llama-3/lora-1b.yml	Removed `dataset_prepared_path`; added explicit `optimizer` and `lr_scheduler` fields.
tests/**	Replaced standard logging with custom logger in all test modules; updated logger initialization.

Sequence Diagram(s)

sequenceDiagram
    participant Module
    participant axolotl.utils.logging
    participant MainProcess

    Module->>axolotl.utils.logging: get_logger(__name__)
    axolotl.utils.logging-->>Module: MultiProcessAdapter instance

    Module->>MultiProcessAdapter: LOG.info("message")
    MultiProcessAdapter->>MainProcess: Check if main process
    alt Is main process
        MultiProcessAdapter-->>Module: Emit log
    else Not main process
        MultiProcessAdapter-->>Module: Suppress log
    end

    Module->>MultiProcessAdapter: LOG.warning_once("message")
    MultiProcessAdapter->>MultiProcessAdapter: Check cache
    alt Not warned before
        MultiProcessAdapter-->>Module: Emit warning
    else Already warned
        MultiProcessAdapter-->>Module: Suppress duplicate warning
    end

Suggested reviewers

winglian

Poem

📜 Recent review details

Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between fb5e1d1 and cef31bc.

📒 Files selected for processing (1)

src/axolotl/utils/schemas/config.py (2 hunks)

✅ Files skipped from review due to trivial changes (1)

src/axolotl/utils/schemas/config.py

⏰ Context from checks skipped due to timeout of 90000ms (11)

GitHub Check: PyTest (3.11, 2.5.1)
GitHub Check: PyTest from Source Dist (3.11, 2.6.0)
GitHub Check: pre-commit
GitHub Check: PyTest (3.11, 2.7.0)
GitHub Check: PyTest from Source Dist (3.11, 2.5.1)
GitHub Check: PyTest from Source Dist (3.11, 2.7.0)
GitHub Check: PyTest (3.11, 2.6.0)
GitHub Check: test-axolotl-multigpu (124, 12.4.1, 3.11, 2.6.0, vllm, 2, true)
GitHub Check: test-axolotl-multigpu (126, 12.6.3, 3.11, 2.7.0, 2, true)
GitHub Check: test-axolotl-multigpu (124, 12.4.1, 3.11, 2.5.1, 2, true)
GitHub Check: pre-commit

✨ Finishing Touches

📝 Generate Docstrings

🪧 Tips

Chat

There are 3 ways to chat with CodeRabbit:

Review comments: Directly reply to a review comment made by CodeRabbit. Example:
- I pushed a fix in commit <commit_id>, please review it.
- Explain this complex logic.
- Open a follow-up GitHub issue for this discussion.
Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query. Examples:
- @coderabbitai explain this code block.
- @coderabbitai modularize this function.
PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
- @coderabbitai gather interesting stats about this repository and render them as a table. Additionally, render a pie chart showing the language distribution in the codebase.
- @coderabbitai read src/utils.ts and explain its main purpose.
- @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.
- @coderabbitai help me debug CodeRabbit configuration file.

Support

Need help? Create a ticket on our support page for assistance with any issues or questions.

Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments.

CodeRabbit Commands (Invoked using PR comments)

@coderabbitai pause to pause the reviews on a PR.
@coderabbitai resume to resume the paused reviews.
@coderabbitai review to trigger an incremental review. This is useful when automatic reviews are disabled for the repository.
@coderabbitai full review to do a full review from scratch and review all the files again.
@coderabbitai summary to regenerate the summary of the PR.
@coderabbitai generate docstrings to generate docstrings for this PR.
@coderabbitai generate sequence diagram to generate a sequence diagram of the changes in this PR.
@coderabbitai resolve resolve all the CodeRabbit review comments.
@coderabbitai configuration to show the current CodeRabbit configuration for the repository.
@coderabbitai help to get help.

Other keywords and placeholders

Add @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.
Add @coderabbitai summary to generate the high-level summary at a specific location in the PR description.
Add @coderabbitai anywhere in the PR title to generate the title automatically.

CodeRabbit Configuration File (`.coderabbit.yaml`)

You can programmatically configure CodeRabbit by adding a .coderabbit.yaml file to the root of your repository.
Please see the configuration documentation for more information.
If your editor has YAML language server enabled, you can add the path at the top of this file to enable auto-completion and validation: # yaml-language-server: $schema=https://coderabbit.ai/integrations/schema.v2.json

Documentation and Community

Visit our Documentation for detailed information on how to use CodeRabbit.
Join our Discord Community to get help, request features, and share feedback.
Follow us on X/Twitter for updates and announcements.

coderabbitai

Actionable comments posted: 11

🔭 Outside diff range comments (2)

src/axolotl/monkeypatch/transformers_fa_utils.py (1)

src/axolotl/monkeypatch/lora_kernels.py (1)

378-380: ⚠️ Potential issue

Remove unsupported warning_once calls
MultiProcessAdapter does not expose warning_once, causing lint errors. Replace these calls with standard LOG.warning(...) or implement a one-time warning helper.

Proposed diff:

-                    LOG.warning_once(
-                        "Cannot patch some attention QKV projections - requires LoRA adapters with no bias"
-                    )
+                    LOG.warning(
+                        "Cannot patch some attention QKV projections - requires LoRA adapters with no bias"
+                    )
...
-                    LOG.warning_once(
-                        "Cannot patch some attention output projection - requires LoRA adapters with no bias"
-                    )
+                    LOG.warning(
+                        "Cannot patch some attention output projection - requires LoRA adapters with no bias"
+                    )
...
-                    LOG.warning_once(
-                        "Cannot patch some MLP layers - requires LoRA adapters with no bias"
-                    )
+                    LOG.warning(
+                        "Cannot patch some MLP layers - requires LoRA adapters with no bias"
+                    )

Also applies to: 396-398, 413-415

🧰 Tools

🪛 GitHub Actions: lint

[error] 378-413: pylint E1101: Instance of 'MultiProcessAdapter' has no 'warning_once' member at lines 378, 396, and 413.

🧹 Nitpick comments (47)

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro
Cache: Disabled due to data retention organization setting
Knowledge Base: Disabled due to data retention organization setting

📥 Commits

Reviewing files that changed from the base of the PR and between a27b909 and e930ba7.

📒 Files selected for processing (128)

examples/llama-3/lora-1b.yml (2 hunks)
examples/llama-3/qlora-1b-kto.yaml (1 hunks)
src/axolotl/cli/checks.py (1 hunks)
src/axolotl/cli/config.py (2 hunks)
src/axolotl/cli/evaluate.py (2 hunks)
src/axolotl/cli/inference.py (2 hunks)
src/axolotl/cli/main.py (2 hunks)
src/axolotl/cli/merge_lora.py (2 hunks)
src/axolotl/cli/merge_sharded_fsdp_weights.py (2 hunks)
src/axolotl/cli/preprocess.py (4 hunks)
src/axolotl/cli/train.py (0 hunks)
src/axolotl/cli/utils.py (2 hunks)
src/axolotl/common/datasets.py (4 hunks)
src/axolotl/core/chat/messages.py (2 hunks)
src/axolotl/core/trainer_builder.py (8 hunks)
src/axolotl/core/trainers/base.py (5 hunks)
src/axolotl/core/trainers/grpo/__init__.py (3 hunks)
src/axolotl/core/trainers/mixins/optimizer.py (3 hunks)
src/axolotl/core/trainers/mixins/rng_state_loader.py (2 hunks)
src/axolotl/core/trainers/mixins/scheduler.py (5 hunks)
src/axolotl/datasets.py (3 hunks)
src/axolotl/integrations/base.py (2 hunks)
src/axolotl/integrations/cut_cross_entropy/__init__.py (2 hunks)
src/axolotl/integrations/cut_cross_entropy/args.py (1 hunks)
src/axolotl/integrations/grokfast/__init__.py (1 hunks)
src/axolotl/integrations/liger/__init__.py (3 hunks)
src/axolotl/integrations/liger/args.py (1 hunks)
src/axolotl/integrations/llm_compressor/plugin.py (2 hunks)
src/axolotl/integrations/spectrum/__init__.py (2 hunks)
src/axolotl/monkeypatch/accelerate/fsdp2.py (1 hunks)
src/axolotl/monkeypatch/btlm_attn_hijack_flash.py (1 hunks)
src/axolotl/monkeypatch/llama_attn_hijack_flash.py (6 hunks)
src/axolotl/monkeypatch/llama_attn_hijack_xformers.py (1 hunks)
src/axolotl/monkeypatch/lora_kernels.py (2 hunks)
src/axolotl/monkeypatch/mistral_attn_hijack_flash.py (8 hunks)
src/axolotl/monkeypatch/peft/utils.py (1 hunks)
src/axolotl/monkeypatch/relora.py (5 hunks)
src/axolotl/monkeypatch/stablelm_attn_hijack_flash.py (1 hunks)
src/axolotl/monkeypatch/trainer/lr.py (1 hunks)
src/axolotl/monkeypatch/trainer_accelerator_args.py (2 hunks)
src/axolotl/monkeypatch/trainer_eval_guard.py (2 hunks)
src/axolotl/monkeypatch/trainer_fsdp_optim.py (2 hunks)
src/axolotl/monkeypatch/transformers_fa_utils.py (1 hunks)
src/axolotl/monkeypatch/unsloth_.py (3 hunks)
src/axolotl/prompt_strategies/__init__.py (1 hunks)
src/axolotl/prompt_strategies/base.py (1 hunks)
src/axolotl/prompt_strategies/bradley_terry/__init__.py (1 hunks)
src/axolotl/prompt_strategies/bradley_terry/chat_template.py (1 hunks)
src/axolotl/prompt_strategies/chat_template.py (3 hunks)
src/axolotl/prompt_strategies/llama2_chat.py (2 hunks)
src/axolotl/prompt_strategies/messages/__init__.py (1 hunks)
src/axolotl/prompt_strategies/metharme.py (1 hunks)
src/axolotl/prompt_strategies/pygmalion.py (3 hunks)
src/axolotl/prompt_tokenizers.py (4 hunks)
src/axolotl/prompters.py (2 hunks)
src/axolotl/train.py (2 hunks)
src/axolotl/utils/callbacks/__init__.py (9 hunks)
src/axolotl/utils/callbacks/comet_.py (2 hunks)
src/axolotl/utils/callbacks/lisa.py (2 hunks)
src/axolotl/utils/callbacks/mlflow_.py (2 hunks)
src/axolotl/utils/chat_templates.py (2 hunks)
src/axolotl/utils/comet_.py (1 hunks)
src/axolotl/utils/config/__init__.py (6 hunks)
src/axolotl/utils/data/pretraining.py (2 hunks)
src/axolotl/utils/data/rl.py (4 hunks)
src/axolotl/utils/data/sft.py (10 hunks)
src/axolotl/utils/data/utils.py (4 hunks)
src/axolotl/utils/distributed.py (1 hunks)
src/axolotl/utils/gradient_checkpointing/offload_disk.py (1 hunks)
src/axolotl/utils/logging.py (1 hunks)
src/axolotl/utils/models.py (19 hunks)
src/axolotl/utils/samplers/multipack.py (4 hunks)
src/axolotl/utils/schemas/config.py (34 hunks)
src/axolotl/utils/schemas/deprecated.py (1 hunks)
src/axolotl/utils/schemas/integrations.py (1 hunks)
src/axolotl/utils/schemas/model.py (1 hunks)
src/axolotl/utils/schemas/training.py (2 hunks)
src/axolotl/utils/schemas/utils.py (3 hunks)
src/axolotl/utils/tokenization.py (1 hunks)
src/axolotl/utils/trainer.py (4 hunks)
tests/e2e/multigpu/solo/test_flex.py (1 hunks)
tests/e2e/multigpu/test_eval.py (1 hunks)
tests/e2e/multigpu/test_gemma3.py (1 hunks)
tests/e2e/multigpu/test_llama.py (1 hunks)
tests/e2e/multigpu/test_qwen2.py (1 hunks)
tests/e2e/multigpu/test_ray.py (1 hunks)
tests/e2e/patched/test_4d_multipack_llama.py (1 hunks)
tests/e2e/patched/test_fa_xentropy.py (1 hunks)
tests/e2e/patched/test_falcon_samplepack.py (1 hunks)
tests/e2e/patched/test_fused_llama.py (1 hunks)
tests/e2e/patched/test_llama_s2_attention.py (1 hunks)
tests/e2e/patched/test_lora_llama_multipack.py (1 hunks)
tests/e2e/patched/test_mistral_samplepack.py (1 hunks)
tests/e2e/patched/test_mixtral_samplepack.py (1 hunks)
tests/e2e/patched/test_phi_multipack.py (1 hunks)
tests/e2e/patched/test_resume.py (1 hunks)
tests/e2e/patched/test_unsloth_qlora.py (1 hunks)
tests/e2e/solo/test_flex.py (1 hunks)
tests/e2e/solo/test_relora_llama.py (1 hunks)
tests/e2e/test_deepseekv3.py (1 hunks)
tests/e2e/test_dpo.py (1 hunks)
tests/e2e/test_embeddings_lr.py (1 hunks)
tests/e2e/test_falcon.py (1 hunks)
tests/e2e/test_gemma2.py (1 hunks)
tests/e2e/test_gemma3_text.py (1 hunks)
tests/e2e/test_llama.py (1 hunks)
tests/e2e/test_llama_pretrain.py (1 hunks)
tests/e2e/test_llama_vision.py (1 hunks)
tests/e2e/test_lora_llama.py (1 hunks)
tests/e2e/test_mamba.py (1 hunks)
tests/e2e/test_mistral.py (1 hunks)
tests/e2e/test_mixtral.py (1 hunks)
tests/e2e/test_optimizers.py (1 hunks)
tests/e2e/test_packing_loss.py (1 hunks)
tests/e2e/test_phi.py (1 hunks)
tests/e2e/test_process_reward_model_smollm2.py (1 hunks)
tests/e2e/test_qwen.py (1 hunks)
tests/e2e/test_reward_model_smollm2.py (1 hunks)
tests/e2e/test_schedulers.py (1 hunks)
tests/integrations/test_liger.py (3 hunks)
tests/patched/test_validation.py (15 hunks)
tests/prompt_strategies/messages/test_chat.py (1 hunks)
tests/prompt_strategies/test_chat_templates.py (1 hunks)
tests/prompt_strategies/test_chat_templates_advanced.py (1 hunks)
tests/prompt_strategies/test_chat_templates_thinking.py (1 hunks)
tests/prompt_strategies/test_jinja_template_analyzer.py (5 hunks)
tests/test_prompt_tokenizers.py (6 hunks)
update_logging.py (1 hunks)

💤 Files with no reviewable changes (1)

src/axolotl/cli/train.py

🧰 Additional context used

🧬 Code Graph Analysis (101)

src/axolotl/monkeypatch/peft/utils.py (1)

src/axolotl/monkeypatch/stablelm_attn_hijack_flash.py (1)

src/axolotl/monkeypatch/accelerate/fsdp2.py (1)

tests/e2e/patched/test_unsloth_qlora.py (2)

src/axolotl/cli/inference.py (1)

src/axolotl/core/chat/messages.py (1)

src/axolotl/monkeypatch/trainer_accelerator_args.py (2)

src/axolotl/integrations/cut_cross_entropy/__init__.py (1)

tests/e2e/test_embeddings_lr.py (2)

src/axolotl/core/trainers/mixins/rng_state_loader.py (1)

src/axolotl/cli/merge_sharded_fsdp_weights.py (1)

src/axolotl/cli/evaluate.py (1)

tests/e2e/test_falcon.py (2)

src/axolotl/utils/schemas/model.py (1)

src/axolotl/cli/utils.py (1)

src/axolotl/utils/schemas/deprecated.py (1)

src/axolotl/cli/config.py (1)

tests/e2e/solo/test_flex.py (2)

tests/e2e/test_optimizers.py (2)

src/axolotl/integrations/cut_cross_entropy/args.py (1)

src/axolotl/utils/callbacks/lisa.py (1)

src/axolotl/cli/checks.py (1)

src/axolotl/monkeypatch/unsloth_.py (1)

tests/e2e/test_schedulers.py (2)

tests/e2e/patched/test_resume.py (2)

tests/e2e/multigpu/solo/test_flex.py (2)

src/axolotl/monkeypatch/llama_attn_hijack_xformers.py (1)

src/axolotl/integrations/llm_compressor/plugin.py (1)

tests/e2e/multigpu/test_qwen2.py (1)

tests/e2e/test_reward_model_smollm2.py (2)

tests/e2e/patched/test_phi_multipack.py (2)

tests/e2e/test_process_reward_model_smollm2.py (2)

tests/e2e/test_llama_vision.py (2)

src/axolotl/cli/merge_lora.py (1)

src/axolotl/utils/schemas/training.py (1)

tests/e2e/patched/test_fused_llama.py (2)

tests/e2e/test_lora_llama.py (2)

tests/e2e/test_qwen.py (1)

tests/e2e/patched/test_mixtral_samplepack.py (2)

src/axolotl/prompt_strategies/__init__.py (1)

src/axolotl/utils/schemas/integrations.py (1)

tests/e2e/test_llama_pretrain.py (2)

tests/e2e/patched/test_mistral_samplepack.py (2)

tests/e2e/test_dpo.py (2)

src/axolotl/utils/callbacks/comet_.py (1)

src/axolotl/utils/callbacks/mlflow_.py (1)

src/axolotl/utils/comet_.py (2)

tests/e2e/patched/test_fa_xentropy.py (2)

tests/e2e/patched/test_4d_multipack_llama.py (2)

tests/e2e/test_gemma3_text.py (1)

src/axolotl/monkeypatch/transformers_fa_utils.py (1)

src/axolotl/integrations/liger/args.py (1)

tests/e2e/test_mixtral.py (2)

src/axolotl/prompt_strategies/llama2_chat.py (1)

tests/e2e/patched/test_lora_llama_multipack.py (2)

src/axolotl/utils/data/pretraining.py (1)

tests/e2e/patched/test_llama_s2_attention.py (2)

src/axolotl/integrations/grokfast/__init__.py (2)

tests/e2e/multigpu/test_gemma3.py (2)

src/axolotl/utils/tokenization.py (1)

src/axolotl/prompt_strategies/base.py (1)

tests/e2e/multigpu/test_llama.py (2)

tests/e2e/patched/test_falcon_samplepack.py (2)

src/axolotl/utils/gradient_checkpointing/offload_disk.py (1)

tests/e2e/test_llama.py (2)

src/axolotl/datasets.py (1)

src/axolotl/prompt_strategies/bradley_terry/chat_template.py (1)

src/axolotl/prompt_strategies/messages/__init__.py (1)

tests/e2e/test_mamba.py (2)

src/axolotl/monkeypatch/trainer_fsdp_optim.py (2)

src/axolotl/core/trainers/mixins/scheduler.py (1)

src/axolotl/integrations/spectrum/__init__.py (2)

src/axolotl/utils/data/utils.py (3)

src/axolotl/monkeypatch/trainer/lr.py (1)

src/axolotl/prompters.py (1)

src/axolotl/integrations/base.py (1)

tests/e2e/test_mistral.py (2)

src/axolotl/utils/trainer.py (1)

tests/e2e/test_gemma2.py (1)

tests/e2e/test_phi.py (2)

tests/prompt_strategies/test_chat_templates.py (1)

tests/integrations/test_liger.py (1)

src/axolotl/prompt_strategies/bradley_terry/__init__.py (1)

src/axolotl/monkeypatch/lora_kernels.py (1)

src/axolotl/monkeypatch/relora.py (1)

src/axolotl/prompt_tokenizers.py (2)

src/axolotl/core/trainers/grpo/__init__.py (1)

src/axolotl/monkeypatch/trainer_eval_guard.py (2)

tests/prompt_strategies/test_chat_templates_thinking.py (1)

src/axolotl/utils/schemas/utils.py (1)

tests/prompt_strategies/test_jinja_template_analyzer.py (1)

src/axolotl/common/datasets.py (1)

src/axolotl/core/trainers/base.py (1)

src/axolotl/prompt_strategies/chat_template.py (2)

src/axolotl/utils/logging.py (2)

src/axolotl/monkeypatch/llama_attn_hijack_flash.py (2)

src/axolotl/utils/callbacks/__init__.py (1)

src/axolotl/monkeypatch/mistral_attn_hijack_flash.py (2)

src/axolotl/train.py (1)

src/axolotl/utils/samplers/multipack.py (1)

src/axolotl/utils/chat_templates.py (1)

🪛 GitHub Actions: lint

src/axolotl/monkeypatch/accelerate/fsdp2.py

[error] 1-1: Pre-commit hook 'isort' fixed import sorting issues in this file.

src/axolotl/cli/inference.py

[error] 1-1: Pre-commit hook 'isort' fixed import sorting issues in this file.

src/axolotl/monkeypatch/trainer_accelerator_args.py

[error] 1-1: Pre-commit hook 'isort' fixed import sorting issues in this file.

src/axolotl/integrations/cut_cross_entropy/__init__.py

[error] 1-1: Pre-commit hook 'isort' fixed import sorting issues in this file.

src/axolotl/core/trainers/mixins/rng_state_loader.py

[error] 1-1: Pre-commit hook 'isort' fixed import sorting issues in this file.

src/axolotl/cli/merge_sharded_fsdp_weights.py

[error] 1-1: Pre-commit hook 'isort' fixed import sorting issues in this file.

src/axolotl/cli/evaluate.py

[error] 1-1: Pre-commit hook 'isort' fixed import sorting issues in this file.

src/axolotl/utils/schemas/model.py

[error] 1-1: Pre-commit hook 'isort' fixed import sorting issues in this file.

src/axolotl/prompt_strategies/metharme.py

[error] 1-1: Pre-commit hook 'isort' fixed import sorting issues in this file.

src/axolotl/cli/utils.py

[error] 1-1: Pre-commit hook 'isort' fixed import sorting issues in this file.

src/axolotl/utils/schemas/deprecated.py

[error] 1-1: Pre-commit hook 'isort' fixed import sorting issues in this file.

src/axolotl/cli/config.py

[error] 1-1: Pre-commit hook 'isort' fixed import sorting issues in this file.

src/axolotl/monkeypatch/btlm_attn_hijack_flash.py

[error] 1-1: Pre-commit hook 'isort' fixed import sorting issues in this file.

src/axolotl/integrations/cut_cross_entropy/args.py

[error] 1-1: Pre-commit hook 'isort' fixed import sorting issues in this file.

src/axolotl/utils/callbacks/lisa.py

[error] 1-1: Pre-commit hook 'isort' fixed import sorting issues in this file.

src/axolotl/cli/checks.py

[error] 1-1: Pre-commit hook 'isort' fixed import sorting issues in this file.

src/axolotl/integrations/llm_compressor/plugin.py

[error] 1-1: Pre-commit hook 'isort' fixed import sorting issues in this file.

src/axolotl/cli/merge_lora.py

[error] 1-1: Pre-commit hook 'isort' fixed import sorting issues in this file.

src/axolotl/utils/schemas/training.py

[error] 1-1: Pre-commit hook 'isort' fixed import sorting issues in this file.

src/axolotl/prompt_strategies/__init__.py

[error] 1-1: Pre-commit hook 'isort' fixed import sorting issues in this file.

src/axolotl/utils/schemas/integrations.py

[error] 1-1: Pre-commit hook 'isort' fixed import sorting issues in this file.

src/axolotl/utils/callbacks/comet_.py

[error] 1-1: Pre-commit hook 'isort' fixed import sorting issues in this file.

src/axolotl/utils/callbacks/mlflow_.py

[error] 1-1: Pre-commit hook 'isort' fixed import sorting issues in this file.

src/axolotl/utils/comet_.py

[error] 1-1: Pre-commit hook 'isort' fixed import sorting issues in this file.

src/axolotl/monkeypatch/transformers_fa_utils.py

[error] 1-1: Pre-commit hook 'isort' fixed import sorting issues in this file.

[error] 50-50: pylint E1101: Instance of 'MultiProcessAdapter' has no 'warning_once' member.

src/axolotl/integrations/liger/args.py

[error] 1-1: Pre-commit hook 'isort' fixed import sorting issues in this file.

src/axolotl/utils/data/pretraining.py

[error] 1-1: Pre-commit hook 'isort' fixed import sorting issues in this file.

src/axolotl/integrations/grokfast/__init__.py

[error] 1-1: Pre-commit hook 'isort' fixed import sorting issues in this file.

src/axolotl/utils/tokenization.py

[error] 1-1: Pre-commit hook 'isort' fixed import sorting issues in this file.

src/axolotl/prompt_strategies/base.py

[error] 1-1: Pre-commit hook 'isort' fixed import sorting issues in this file.

src/axolotl/cli/preprocess.py

[error] 1-1: Pre-commit hook 'black' reformatted this file to fix code style issues.

src/axolotl/datasets.py

[error] 1-1: Pre-commit hook 'isort' fixed import sorting issues in this file.

src/axolotl/prompt_strategies/messages/__init__.py

[error] 1-1: Pre-commit hook 'isort' fixed import sorting issues in this file.

src/axolotl/monkeypatch/trainer_fsdp_optim.py

[error] 1-1: Pre-commit hook 'isort' fixed import sorting issues in this file.

src/axolotl/core/trainers/mixins/scheduler.py

[error] 1-1: Pre-commit hook 'isort' fixed import sorting issues in this file.

src/axolotl/monkeypatch/trainer/lr.py

[error] 1-1: Pre-commit hook 'isort' fixed import sorting issues in this file.

src/axolotl/prompters.py

[error] 1-1: Pre-commit hook 'isort' fixed import sorting issues in this file.

src/axolotl/integrations/base.py

[warning] 27-27: pylint W0611: Unused import 'SchedulerType' from transformers.trainer_utils.

tests/prompt_strategies/test_chat_templates.py

[warning] 25-39: pylint duplicate code detected with tests.prompt_strategies.messages.test_chat.

[warning] 159-167: pylint duplicate code detected with tests.prompt_strategies.messages.test_chat.

[warning] 55-74: pylint duplicate code detected with tests.prompt_strategies.messages.test_chat.

src/axolotl/prompt_strategies/bradley_terry/__init__.py

[error] 1-1: Pre-commit hook 'isort' fixed import sorting issues in this file.

src/axolotl/monkeypatch/lora_kernels.py

[error] 378-413: pylint E1101: Instance of 'MultiProcessAdapter' has no 'warning_once' member at lines 378, 396, and 413.

src/axolotl/monkeypatch/relora.py

[error] 1-1: Pre-commit hook 'isort' fixed import sorting issues in this file.

src/axolotl/core/trainers/mixins/optimizer.py

[error] 1-1: Pre-commit hook 'isort' fixed import sorting issues in this file.

src/axolotl/prompt_tokenizers.py

[error] 1-1: Pre-commit hook 'isort' fixed import sorting issues in this file.

src/axolotl/prompt_strategies/pygmalion.py

[error] 1-1: Pre-commit hook 'isort' fixed import sorting issues in this file.

src/axolotl/core/trainers/grpo/__init__.py

[error] 1-1: Pre-commit hook 'isort' fixed import sorting issues in this file.

[error] 48-50: mypy error: 'TRLConfig' has no attribute 'vllm'.

src/axolotl/monkeypatch/trainer_eval_guard.py

[error] 1-1: Pre-commit hook 'isort' fixed import sorting issues in this file.

tests/prompt_strategies/test_chat_templates_thinking.py

[error] 10-10: mypy errors: Module 'axolotl.prompt_strategies.jinja_template_analyzer' has no attribute 'PromptComponentStatus' and 'PromptTemplateAnalyzer'.

tests/prompt_strategies/messages/test_chat.py

[error] 3-8: flake8 F401: Multiple unused imports including 'os', 'transformers.AutoTokenizer', and several unused imports from axolotl.core.chat.messages.

[warning] 49-68: pylint duplicate code detected with tests.prompt_strategies.test_chat_templates.

[warning] 28-42: pylint duplicate code detected with tests.prompt_strategies.test_chat_templates.

src/axolotl/utils/schemas/utils.py

[error] 1-1: Pre-commit hook 'isort' fixed import sorting issues in this file.

tests/test_prompt_tokenizers.py

[error] 3-221: flake8 and pylint errors: multiple unused imports (e.g., 'unittest', 'transformers.AutoTokenizer'), undefined names (e.g., 'NoSystemPrompter', 'SystemDataPrompter', 'InstructionWSystemPromptTokenizingStrategy', 'Path', 'json', 'Llama2ChatPrompter', 'LLama2ChatTokenizingStrategy', 'load', 'DictDefault'), and E0611 import errors for missing attributes in modules 'axolotl.prompt_tokenizers' and 'axolotl.prompters'.

[error] 9-15: mypy errors: Module 'axolotl.prompt_tokenizers' has no attribute 'ShareGPTPromptTokenizingStrategy'; Module 'axolotl.prompters' has no attribute 'AlpacaInstructionPrompter' and 'ShareGPTPrompter'.

src/axolotl/utils/data/sft.py

[error] 1-1: Pre-commit hook 'trailing-whitespace' failed and fixed trailing whitespace issues in this file.

tests/prompt_strategies/test_jinja_template_analyzer.py

[error] 7-7: flake8 E0611: No name 'PromptComponentStatus' and 'PromptTemplateAnalyzer' in module 'axolotl.prompt_strategies.jinja_template_analyzer'.

src/axolotl/common/datasets.py

[error] 1-1: Pre-commit hook 'isort' fixed import sorting issues in this file.

src/axolotl/utils/data/rl.py

[error] 1-1: Pre-commit hook 'isort' fixed import sorting issues in this file.

src/axolotl/core/trainers/base.py

[error] 1-1: Pre-commit hook 'isort' fixed import sorting issues in this file.

src/axolotl/prompt_strategies/chat_template.py

[warning] 546-546: flake8: line break after binary operator (W504)

src/axolotl/monkeypatch/llama_attn_hijack_flash.py

[error] 1-1: Pre-commit hook 'isort' fixed import sorting issues in this file.

update_logging.py

[warning] 9-56: pylint warnings: multiple redefined-outer-name warnings for variables 'dry_run', 'base_dir', 'updated_files', 'skipped_files', 'file'; too many nested blocks (6/5); invalid constant name 'base_dir'; unused import 'Path'; and duplicate code detected with tests.prompt_strategies.messages.test_chat and tests.prompt_strategies.test_chat_templates.

src/axolotl/utils/callbacks/__init__.py

[error] 1-1: Pre-commit hook 'isort' fixed import sorting issues in this file.

src/axolotl/monkeypatch/mistral_attn_hijack_flash.py

[error] 1-1: Pre-commit hook 'isort' fixed import sorting issues in this file.

src/axolotl/utils/models.py

[warning] 1384-1384: flake8: line break after binary operator (W504)

src/axolotl/utils/config/__init__.py

[error] 1-1: Pre-commit hook 'isort' fixed import sorting issues in this file.

src/axolotl/core/trainer_builder.py

[error] 1-1: Pre-commit hook 'isort' fixed import sorting issues in this file.

src/axolotl/utils/samplers/multipack.py

[warning] 9-9: flake8 F401: Unused import 'multiprocessing.get_context'.

src/axolotl/utils/schemas/config.py

[error] 1-1: Pre-commit hook 'isort' fixed import sorting issues in this file.

[error] 130-130: mypy error: Value of type variable 'SupportsRichComparisonT' of 'min' cannot be 'int | None'.

src/axolotl/utils/chat_templates.py

[error] 1-1: Pre-commit hook 'isort' fixed import sorting issues in this file.

🪛 Ruff (0.11.9)

src/axolotl/integrations/base.py

27-27: transformers.trainer_utils.SchedulerType imported but unused

Remove unused import: transformers.trainer_utils.SchedulerType

(F401)

tests/prompt_strategies/messages/test_chat.py

3-3: os imported but unused

Remove unused import: os

(F401)

6-6: transformers.AutoTokenizer imported but unused

Remove unused import: transformers.AutoTokenizer

(F401)

9-9: axolotl.core.chat.messages.ChatFormattedChats imported but unused

Remove unused import

(F401)

10-10: axolotl.core.chat.messages.Chats imported but unused

Remove unused import

(F401)

11-11: axolotl.core.chat.messages.MessageContents imported but unused

Remove unused import

(F401)

12-12: axolotl.core.chat.messages.MessageContentTypes imported but unused

Remove unused import

(F401)

13-13: axolotl.core.chat.messages.MessageRoles imported but unused

Remove unused import

(F401)

14-14: axolotl.core.chat.messages.Messages imported but unused

Remove unused import

(F401)

tests/test_prompt_tokenizers.py

3-3: unittest imported but unused

Remove unused import: unittest

(F401)

6-6: transformers.AutoTokenizer imported but unused

Remove unused import: transformers.AutoTokenizer

(F401)

11-11: axolotl.prompt_tokenizers.InstructionPromptTokenizingStrategy imported but unused

Remove unused import

(F401)

12-12: axolotl.prompt_tokenizers.PromptTokenizingStrategy imported but unused

Remove unused import

(F401)

13-13: axolotl.prompt_tokenizers.ShareGPTPromptTokenizingStrategy imported but unused

Remove unused import

(F401)

15-15: axolotl.prompters.AlpacaInstructionPrompter imported but unused

Remove unused import

(F401)

15-15: axolotl.prompters.ShareGPTPrompter imported but unused

Remove unused import

(F401)

tests/prompt_strategies/test_jinja_template_analyzer.py

3-3: os imported but unused

Remove unused import: os

(F401)

8-8: axolotl.prompt_strategies.jinja_template_analyzer.PromptComponentStatus imported but unused

Remove unused import: axolotl.prompt_strategies.jinja_template_analyzer.PromptComponentStatus

(F401)

src/axolotl/utils/logging.py

28-29: Use a single if statement instead of nested if statements

Combine if statements using and

(SIM102)

update_logging.py

9-9: pathlib.Path imported but unused

Remove unused import: pathlib.Path

(F401)

86-89: Use ternary operator base_dir = sys.argv[1] if len(sys.argv) > 1 else "tests" instead of if-else-block

Replace if-else-block with base_dir = sys.argv[1] if len(sys.argv) > 1 else "tests"

(SIM108)

src/axolotl/monkeypatch/mistral_attn_hijack_flash.py

168-168: Do not call getattr with a constant attribute value. It is not any safer than normal property access.

Replace getattr with attribute access

(B009)

src/axolotl/core/trainer_builder.py

860-860: Test for membership should be not in

Convert to not in

(E713)

⏰ Context from checks skipped due to timeout of 90000ms (2)

GitHub Check: PyTest from Source Dist (3.11, 2.7.0)
GitHub Check: PyTest (3.11, 2.6.0)

🔇 Additional comments (243)

src/axolotl/monkeypatch/trainer/lr.py

src/axolotl/core/trainers/mixins/rng_state_loader.py

src/axolotl/integrations/llm_compressor/plugin.py

coderabbitai · 2025-05-19T17:49:15Z

src/axolotl/core/trainers/grpo/__init__.py

+            # type: ignore[attr-defined]
+            grpo_args_kwargs["vllm_server_host"] = trl.vllm_server_host or trl.vllm.host
+            # type: ignore[attr-defined]
+            grpo_args_kwargs["vllm_server_port"] = trl.vllm_server_port or trl.vllm.port


⚠️ Potential issue

Address missing vllm attributes in TRLConfig
The # type: ignore[attr-defined] comments hide the root cause: TRLConfig lacks a vllm field, leading to the mypy error TRLConfig has no attribute 'vllm'. To fix:

Update the TRLConfig schema to include a vllm object with host, port, etc., or

Add runtime guards before accessing trl.vllm_*.

This change is critical to restore type safety and prevent silent failures.

🧰 Tools

🪛 GitHub Actions: lint

[error] 48-50: mypy error: 'TRLConfig' has no attribute 'vllm'.

🤖 Prompt for AI Agents

In src/axolotl/core/trainers/grpo/__init__.py around lines 47 to 50, the code accesses attributes on trl.vllm which are missing from the TRLConfig type definition, causing mypy errors. To fix this, update the TRLConfig schema to include a vllm object with the necessary fields like host and port, ensuring type safety. Alternatively, add runtime checks to confirm these attributes exist on trl before accessing them, preventing attribute errors and preserving type correctness.

tests/prompt_strategies/messages/test_chat.py

tests/prompt_strategies/test_jinja_template_analyzer.py

coderabbitai · 2025-05-19T17:49:16Z

tests/prompt_strategies/test_jinja_template_analyzer.py

+from axolotl.prompt_strategies.jinja_template_analyzer import (
+    PromptComponentStatus,
+    PromptTemplateAnalyzer,
+)


💡 Verification agent

🧩 Analysis chain

Fix incorrect imports causing build failures.

The pipeline is failing because it can't find PromptComponentStatus and PromptTemplateAnalyzer in the specified module. Check if these classes exist or if their names have changed.

🏁 Script executed:

#!/bin/bash # Verify if these classes exist in the module rg -A 1 "class PromptComponentStatus|class PromptTemplateAnalyzer" src/axolotl/prompt_strategies/

Length of output: 99

🏁 Script executed:

#!/bin/bash # Locate the jinja_template_analyzer file fd jinja_template_analyzer.py # List all class definitions in that file rg -n "^class " src/axolotl/prompt_strategies/jinja_template_analyzer.py # Search for the specific class names across the entire repo rg -n "PromptComponentStatus" . rg -n "PromptTemplateAnalyzer" .

Length of output: 988

Update test imports to match refactored class names

The tests are importing classes that no longer exist in jinja_template_analyzer.py. You need to update the imports and usages in tests/prompt_strategies/test_jinja_template_analyzer.py:

• At the top of the file (lines 7–10), replace:

-from axolotl.prompt_strategies.jinja_template_analyzer import ( - PromptComponentStatus, - PromptTemplateAnalyzer, -) +from axolotl.prompt_strategies.jinja_template_analyzer import ( + JinjaTemplateAnalysis, + JinjaTemplateAnalyzer, +)

• Throughout the test, rename all occurrences of PromptComponentStatus → JinjaTemplateAnalysis and PromptTemplateAnalyzer → JinjaTemplateAnalyzer.

This will align the tests with the current class definitions in src/axolotl/prompt_strategies/jinja_template_analyzer.py.

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

from axolotl.prompt_strategies.jinja_template_analyzer import (

PromptComponentStatus,

PromptTemplateAnalyzer,

)

from axolotl.prompt_strategies.jinja_template_analyzer import (

JinjaTemplateAnalysis,

JinjaTemplateAnalyzer,

)

🧰 Tools

🪛 Ruff (0.11.9)

8-8: axolotl.prompt_strategies.jinja_template_analyzer.PromptComponentStatus imported but unused

Remove unused import: axolotl.prompt_strategies.jinja_template_analyzer.PromptComponentStatus

(F401)

🪛 GitHub Actions: lint

[error] 7-7: flake8 E0611: No name 'PromptComponentStatus' and 'PromptTemplateAnalyzer' in module 'axolotl.prompt_strategies.jinja_template_analyzer'.

🤖 Prompt for AI Agents

In tests/prompt_strategies/test_jinja_template_analyzer.py around lines 7 to 10, the imports for PromptComponentStatus and PromptTemplateAnalyzer are incorrect due to refactoring. Replace these imports with JinjaTemplateAnalysis and JinjaTemplateAnalyzer respectively, and update all occurrences of PromptComponentStatus to JinjaTemplateAnalysis and PromptTemplateAnalyzer to JinjaTemplateAnalyzer throughout the test file to match the current class names in the source module.

coderabbitai · 2025-05-19T17:49:16Z

tests/test_prompt_tokenizers.py

+import unittest

-from axolotl.prompt_strategies.alpaca_chat import NoSystemPrompter
-from axolotl.prompt_strategies.alpaca_w_system import (
-    InstructionWSystemPromptTokenizingStrategy,
-    SystemDataPrompter,
-)
-from axolotl.prompt_strategies.llama2_chat import (
-    Llama2ChatPrompter,
-    LLama2ChatTokenizingStrategy,
-)
-from axolotl.prompt_strategies.orpo.chat_template import load
-from axolotl.prompt_tokenizers import AlpacaPromptTokenizingStrategy
-from axolotl.prompters import AlpacaPrompter, PromptStyle
-from axolotl.utils.dict import DictDefault
+import pytest
+from transformers import AutoTokenizer

-from tests.hf_offline_utils import enable_hf_offline
+from axolotl.prompt_strategies.alpaca import AlpacaPrompter
+from axolotl.prompt_tokenizers import (
+    AlpacaPromptTokenizingStrategy,
+    InstructionPromptTokenizingStrategy,
+    PromptTokenizingStrategy,
+    ShareGPTPromptTokenizingStrategy,
+)
+from axolotl.prompters import AlpacaInstructionPrompter, PromptStyle, ShareGPTPrompter
+from axolotl.utils.logging import get_logger


💡 Verification agent

🧩 Analysis chain

Fix multiple unused imports.

Several imported modules are not being used in this file, causing linting errors:

unittest

transformers.AutoTokenizer

Multiple unused imports from axolotl.prompt_tokenizers and axolotl.prompters

Additionally, the pipeline failures indicate that some of these imports refer to undefined names, suggesting a mismatch between imports and the actual module contents.

🏁 Script executed:

#!/bin/bash # Check for undefined names in imports grep -r "class ShareGPTPromptTokenizingStrategy\|class AlpacaInstructionPrompter\|class ShareGPTPrompter" src/axolotl/

Length of output: 120

🏁 Script executed:

#!/bin/bash # Display the content of the test file to verify which imports are actually used sed -n '1,200p' tests/test_prompt_tokenizers.py

Length of output: 7225

🏁 Script executed:

#!/bin/bash # List all PromptTokenizingStrategy classes in the prompt_tokenizers module grep -R "class .*PromptTokenizingStrategy" src/axolotl/prompt_tokenizers # Check for InstructionWSystemPromptTokenizingStrategy grep -R "InstructionWSystemPromptTokenizingStrategy" -R src/axolotl # Check for Llama2ChatTokenizingStrategy (case-insensitive for double L) grep -R "Llama2ChatTokenizingStrategy" -R src/axolotl || true grep -R "LLama2ChatTokenizingStrategy" -R src/axolotl || true

Length of output: 1311

🏁 Script executed:

#!/bin/bash # Locate and inspect the prompt_tokenizers module file=$(find src/axolotl -type f -name "prompt_tokenizers.py" -print) echo "Module file: $file" if [[ -n "$file" ]]; then echo "----- Begin prompt_tokenizers.py -----" sed -n '1,200p' "$file" echo "----- End prompt_tokenizers.py -----" else echo "prompt_tokenizers.py not found" fi

Length of output: 6487

Refactor imports in tests/test_prompt_tokenizers.py

Several imports at the top of tests/test_prompt_tokenizers.py are either unused or refer to classes that don’t exist in the modules you’re importing from. At the same time, some of the classes and modules actually used in the tests are never imported. Please update the import block as follows:

• Remove unused/undefined imports:

unittest

transformers.AutoTokenizer

InstructionPromptTokenizingStrategy, PromptTokenizingStrategy, ShareGPTPromptTokenizingStrategy (they’re not in axolotl.prompt_tokenizers)

AlpacaInstructionPrompter, ShareGPTPrompter (never referenced in tests)

• Keep these existing imports:

pytest

get_logger (for LOG = get_logger(…))

PromptStyle (used in TestInstructionWSystemPromptTokenizingStrategy)

• Add missing imports for classes actually used in the tests:

NoSystemPrompter

SystemDataPrompter, InstructionWSystemPromptTokenizingStrategy

Llama2ChatPrompter, LLama2ChatTokenizingStrategy

Built-in modules: json, Path

Here’s a suggested diff for the top of your test file:

- import unittest - import pytest - from transformers import AutoTokenizer - from axolotl.prompt_strategies.alpaca import AlpacaPrompter - from axolotl.prompt_tokenizers import ( - AlpacaPromptTokenizingStrategy, - InstructionPromptTokenizingStrategy, - PromptTokenizingStrategy, - ShareGPTPromptTokenizingStrategy, - ) - from axolotl.prompters import AlpacaInstructionPrompter, PromptStyle, ShareGPTPrompter - from axolotl.utils.logging import get_logger + import pytest + import json + from pathlib import Path + from axolotl.utils.logging import get_logger + from axolotl.prompt_strategies.alpaca import AlpacaPrompter, NoSystemPrompter + from axolotl.prompt_strategies.alpaca_w_system import ( + SystemDataPrompter, + InstructionWSystemPromptTokenizingStrategy, + ) + from axolotl.prompt_strategies.llama2_chat import ( + Llama2ChatPrompter, + LLama2ChatTokenizingStrategy, + ) + from axolotl.prompt_tokenizers import AlpacaPromptTokenizingStrategy + from axolotl.prompters import PromptStyle

These changes will:

Eliminate lint errors for unused imports

Fix the import errors for missing classes

Keep only the imports that are actually referenced by the tests

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

import unittest

from axolotl.prompt_strategies.alpaca_chat import NoSystemPrompter

from axolotl.prompt_strategies.alpaca_w_system import (

InstructionWSystemPromptTokenizingStrategy,

SystemDataPrompter,

)

from axolotl.prompt_strategies.llama2_chat import (

Llama2ChatPrompter,

LLama2ChatTokenizingStrategy,

)

from axolotl.prompt_strategies.orpo.chat_template import load

from axolotl.prompt_tokenizers import AlpacaPromptTokenizingStrategy

from axolotl.prompters import AlpacaPrompter, PromptStyle

from axolotl.utils.dict import DictDefault

import pytest

from transformers import AutoTokenizer

from tests.hf_offline_utils import enable_hf_offline

from axolotl.prompt_strategies.alpaca import AlpacaPrompter

from axolotl.prompt_tokenizers import (

AlpacaPromptTokenizingStrategy,

InstructionPromptTokenizingStrategy,

PromptTokenizingStrategy,

ShareGPTPromptTokenizingStrategy,

)

from axolotl.prompters import AlpacaInstructionPrompter, PromptStyle, ShareGPTPrompter

from axolotl.utils.logging import get_logger

import pytest

import json

from pathlib import Path

from axolotl.utils.logging import get_logger

from axolotl.prompt_strategies.alpaca import AlpacaPrompter, NoSystemPrompter

from axolotl.prompt_strategies.alpaca_w_system import (

SystemDataPrompter,

InstructionWSystemPromptTokenizingStrategy,

)

from axolotl.prompt_strategies.llama2_chat import (

Llama2ChatPrompter,

LLama2ChatTokenizingStrategy,

)

from axolotl.prompt_tokenizers import AlpacaPromptTokenizingStrategy

from axolotl.prompters import PromptStyle

🧰 Tools

🪛 Ruff (0.11.9)

3-3: unittest imported but unused

Remove unused import: unittest

(F401)

6-6: transformers.AutoTokenizer imported but unused

Remove unused import: transformers.AutoTokenizer

(F401)

11-11: axolotl.prompt_tokenizers.InstructionPromptTokenizingStrategy imported but unused

Remove unused import

(F401)

12-12: axolotl.prompt_tokenizers.PromptTokenizingStrategy imported but unused

Remove unused import

(F401)

13-13: axolotl.prompt_tokenizers.ShareGPTPromptTokenizingStrategy imported but unused

Remove unused import

(F401)

15-15: axolotl.prompters.AlpacaInstructionPrompter imported but unused

Remove unused import

(F401)

15-15: axolotl.prompters.ShareGPTPrompter imported but unused

Remove unused import

(F401)

🪛 GitHub Actions: lint

[error] 3-221: flake8 and pylint errors: multiple unused imports (e.g., 'unittest', 'transformers.AutoTokenizer'), undefined names (e.g., 'NoSystemPrompter', 'SystemDataPrompter', 'InstructionWSystemPromptTokenizingStrategy', 'Path', 'json', 'Llama2ChatPrompter', 'LLama2ChatTokenizingStrategy', 'load', 'DictDefault'), and E0611 import errors for missing attributes in modules 'axolotl.prompt_tokenizers' and 'axolotl.prompters'.

[error] 9-15: mypy errors: Module 'axolotl.prompt_tokenizers' has no attribute 'ShareGPTPromptTokenizingStrategy'; Module 'axolotl.prompters' has no attribute 'AlpacaInstructionPrompter' and 'ShareGPTPrompter'.

🤖 Prompt for AI Agents

In tests/test_prompt_tokenizers.py lines 3 to 16, the import statements include unused modules like unittest and transformers.AutoTokenizer, and also import classes that do not exist in the specified modules, causing lint and import errors. Remove unused imports unittest, transformers.AutoTokenizer, InstructionPromptTokenizingStrategy, PromptTokenizingStrategy, ShareGPTPromptTokenizingStrategy, AlpacaInstructionPrompter, and ShareGPTPrompter. Retain pytest, get_logger, and PromptStyle imports. Add missing imports for NoSystemPrompter, SystemDataPrompter, InstructionWSystemPromptTokenizingStrategy, Llama2ChatPrompter, LLama2ChatTokenizingStrategy, and built-in modules json and Path to match the classes actually used in the tests. This will fix lint errors and import mismatches.

src/axolotl/utils/schemas/config.py

src/axolotl/utils/samplers/multipack.py

winglian · 2025-05-20T15:55:49Z

@SalmanMohammadi black and flake8 doesn't like a lot of the formatting changes

coderabbitai

Actionable comments posted: 0

♻️ Duplicate comments (2)

🧹 Nitpick comments (2)

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 3dbe0f4 and faa1e03.

📒 Files selected for processing (3)

src/axolotl/integrations/cut_cross_entropy/__init__.py (2 hunks)
src/axolotl/integrations/liger/__init__.py (4 hunks)
src/axolotl/utils/logging.py (1 hunks)

🚧 Files skipped from review as they are similar to previous changes (2)

src/axolotl/integrations/liger/init.py
src/axolotl/integrations/cut_cross_entropy/init.py

🧰 Additional context used

🪛 Ruff (0.11.9)

src/axolotl/utils/logging.py

37-37: Use of functools.lru_cache or functools.cache on methods can lead to memory leaks

(B019)

⏰ Context from checks skipped due to timeout of 90000ms (6)

GitHub Check: PyTest from Source Dist (3.11, 2.7.0)
GitHub Check: PyTest (3.11, 2.7.0)
GitHub Check: PyTest from Source Dist (3.11, 2.6.0)
GitHub Check: PyTest (3.11, 2.6.0)
GitHub Check: PyTest from Source Dist (3.11, 2.5.1)
GitHub Check: PyTest (3.11, 2.5.1)

🔇 Additional comments (2)

coderabbitai

Actionable comments posted: 0

♻️ Duplicate comments (1)

src/axolotl/utils/logging.py (1)

41-50: ⚠️ Potential issue

Fix potential memory leak in warning_once method.

Using lru_cache on methods can lead to memory leaks because it keeps references to self and all arguments, preventing garbage collection. Although setting maxsize=10 limits the cache size, it doesn't fully solve the issue.

Consider implementing a class-level cache instead:

-@functools.lru_cache(maxsize=10)
 def warning_once(self, *args, **kwargs):
     """
     This method is identical to `logger.warning()`, but will emit the warning with the same message only once

     Note: The cache is for the function arguments, so 2 different callers using the same arguments will hit the
     cache. The assumption here is that all warning messages are unique across the code. If they aren't then need to
     switch to another type of cache that includes the caller frame information in the hashing function.
     """
-    self.warning(*args, **kwargs)
+    # Use a class-level cache to avoid memory leaks
+    if not hasattr(MultiProcessAdapter, "_warning_cache"):
+        MultiProcessAdapter._warning_cache = set()
+    
+    # Create a cache key from the arguments
+    key = str(args) + str(sorted(kwargs.items()))
+    
+    if key not in MultiProcessAdapter._warning_cache:
+        MultiProcessAdapter._warning_cache.add(key)
+        self.warning(*args, **kwargs)
+        
+        # Limit cache size to prevent unbounded growth
+        if len(MultiProcessAdapter._warning_cache) > 10:
+            MultiProcessAdapter._warning_cache.pop()

This implementation avoids the memory leak while maintaining the same functionality.

🧰 Tools

🪛 Ruff (0.11.9)

41-41: Use of functools.lru_cache or functools.cache on methods can lead to memory leaks

(B019)

🧹 Nitpick comments (4)

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 9dcb21d and 9c8403d.

📒 Files selected for processing (6)

src/axolotl/cli/config.py (5 hunks)
src/axolotl/integrations/cut_cross_entropy/__init__.py (2 hunks)
src/axolotl/integrations/liger/__init__.py (4 hunks)
src/axolotl/utils/logging.py (1 hunks)
src/axolotl/utils/schemas/config.py (7 hunks)
src/axolotl/utils/schemas/model.py (2 hunks)

✅ Files skipped from review due to trivial changes (1)

src/axolotl/utils/schemas/model.py

🚧 Files skipped from review as they are similar to previous changes (4)

src/axolotl/cli/config.py
src/axolotl/integrations/liger/init.py
src/axolotl/integrations/cut_cross_entropy/init.py
src/axolotl/utils/schemas/config.py

🧰 Additional context used

🪛 Ruff (0.11.9)

src/axolotl/utils/logging.py

41-41: Use of functools.lru_cache or functools.cache on methods can lead to memory leaks

(B019)

⏰ Context from checks skipped due to timeout of 90000ms (10)

GitHub Check: PyTest from Source Dist (3.11, 2.7.0)
GitHub Check: PyTest from Source Dist (3.11, 2.6.0)
GitHub Check: PyTest from Source Dist (3.11, 2.5.1)
GitHub Check: PyTest (3.11, 2.7.0)
GitHub Check: PyTest (3.11, 2.6.0)
GitHub Check: test-axolotl-multigpu (126, 12.6.3, 3.11, 2.7.0, 2, true)
GitHub Check: test-axolotl-multigpu (124, 12.4.1, 3.11, 2.5.1, 2, true)
GitHub Check: PyTest (3.11, 2.5.1)
GitHub Check: test-axolotl-multigpu (124, 12.4.1, 3.11, 2.6.0, vllm, 2, true)
GitHub Check: pre-commit

djsaunde

great stuff, thanks for doing this. just need to rebase and (IMO) make the LOG statements one-liners where possible.

examples/llama-3/qlora-1b-kto.yaml

djsaunde · 2025-05-23T15:41:13Z

src/axolotl/utils/logging.py

src/axolotl/utils/data/utils.py

djsaunde · 2025-05-23T15:49:45Z

src/axolotl/utils/logging.py

+        )
+
+    def log(self, level, msg, *args, **kwargs):
+        use_environ = kwargs.pop("use_environ", self.use_environ)


nice, I like that you can set this at different levels like this

coderabbitai

Actionable comments posted: 0

♻️ Duplicate comments (1)

src/axolotl/utils/logging.py (1)

41-50: ⚠️ Potential issue

Fix potential memory leak in warning_once method

Using functools.lru_cache on a method can lead to memory leaks because it keeps a reference to self. While setting maxsize=10 is better than None, it still risks memory leaks.

Consider using a class-level cache instead:

-    @functools.lru_cache(maxsize=10)
     def warning_once(self, *args, **kwargs):
         """
         This method is identical to `logger.warning()`, but will emit the warning with the same message only once
@@ -48,4 +47,11 @@
         cache. The assumption here is that all warning messages are unique across the code. If they aren't then need to
         switch to another type of cache that includes the caller frame information in the hashing function.
         """
-        self.warning(*args, **kwargs)
+        # Use a class-level cache
+        if not hasattr(MultiProcessAdapter, "_warning_cache"):
+            MultiProcessAdapter._warning_cache = {}
+        
+        key = str(args) + str(sorted(kwargs.items()))
+        if key not in MultiProcessAdapter._warning_cache or len(MultiProcessAdapter._warning_cache) > 10:
+            MultiProcessAdapter._warning_cache[key] = True
+            self.warning(*args, **kwargs)

🧰 Tools

🪛 Ruff (0.11.9)

41-41: Use of functools.lru_cache or functools.cache on methods can lead to memory leaks

(B019)

🧹 Nitpick comments (3)

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 9c8403d and 9dfd461.

📒 Files selected for processing (7)

src/axolotl/cli/config.py (5 hunks)
src/axolotl/monkeypatch/unsloth_.py (3 hunks)
src/axolotl/utils/data/sft.py (7 hunks)
src/axolotl/utils/logging.py (1 hunks)
src/axolotl/utils/models.py (10 hunks)
src/axolotl/utils/schemas/config.py (4 hunks)
src/axolotl/utils/schemas/model.py (1 hunks)

🚧 Files skipped from review as they are similar to previous changes (6)

src/axolotl/utils/schemas/model.py
src/axolotl/monkeypatch/unsloth_.py
src/axolotl/cli/config.py
src/axolotl/utils/schemas/config.py
src/axolotl/utils/models.py
src/axolotl/utils/data/sft.py

🧰 Additional context used

🪛 Ruff (0.11.9)

src/axolotl/utils/logging.py

41-41: Use of functools.lru_cache or functools.cache on methods can lead to memory leaks

(B019)

⏰ Context from checks skipped due to timeout of 90000ms (8)

GitHub Check: PyTest from Source Dist (3.11, 2.7.0)
GitHub Check: PyTest from Source Dist (3.11, 2.5.1)
GitHub Check: PyTest from Source Dist (3.11, 2.6.0)
GitHub Check: PyTest (3.11, 2.5.1)
GitHub Check: PyTest (3.11, 2.7.0)
GitHub Check: PyTest (3.11, 2.6.0)
GitHub Check: test-axolotl-multigpu (126, 12.6.3, 3.11, 2.7.0, 2, true)
GitHub Check: test-axolotl-multigpu (124, 12.4.1, 3.11, 2.5.1, 2, true)

🔇 Additional comments (3)

djsaunde

Approving now since this largely looks good, but there's still a some cases of trailing commas in the LOG commands that should be fixed before merge. For example, in src/axolotl/utils/data/sft.py:

    LOG.info(
        f"Maximum number of steps set at {total_num_steps}",
    )

coderabbitai

Actionable comments posted: 0

🧹 Nitpick comments (1)

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between dd76a13 and 1df853f.

📒 Files selected for processing (15)

src/axolotl/cli/utils.py (1 hunks)
src/axolotl/common/datasets.py (1 hunks)
src/axolotl/core/trainer_builder.py (1 hunks)
src/axolotl/integrations/base.py (2 hunks)
src/axolotl/integrations/liger/__init__.py (4 hunks)
src/axolotl/monkeypatch/gradient_checkpointing/offload_disk.py (1 hunks)
src/axolotl/monkeypatch/peft/utils.py (1 hunks)
src/axolotl/prompt_strategies/chat_template.py (3 hunks)
src/axolotl/train.py (3 hunks)
src/axolotl/utils/config/__init__.py (1 hunks)
src/axolotl/utils/data/rl.py (3 hunks)
src/axolotl/utils/samplers/multipack.py (1 hunks)
src/axolotl/utils/schemas/config.py (4 hunks)
tests/e2e/multigpu/solo/test_grpo.py (1 hunks)
tests/patched/test_validation.py (15 hunks)

✅ Files skipped from review due to trivial changes (2)

src/axolotl/monkeypatch/gradient_checkpointing/offload_disk.py
src/axolotl/integrations/liger/init.py

🚧 Files skipped from review as they are similar to previous changes (11)

src/axolotl/integrations/base.py
src/axolotl/utils/schemas/config.py
src/axolotl/utils/config/init.py
src/axolotl/monkeypatch/peft/utils.py
src/axolotl/cli/utils.py
src/axolotl/common/datasets.py
src/axolotl/utils/samplers/multipack.py
src/axolotl/utils/data/rl.py
src/axolotl/prompt_strategies/chat_template.py
src/axolotl/train.py
tests/patched/test_validation.py

🧰 Additional context used

🪛 Ruff (0.11.9)

tests/e2e/multigpu/solo/test_grpo.py

121-124: Use contextlib.suppress(FileNotFoundError) instead of try-except-pass

Replace with contextlib.suppress(FileNotFoundError)

(SIM105)

src/axolotl/core/trainer_builder.py

92-95: Use contextlib.suppress(ImportError) instead of try-except-pass

(SIM105)

🔇 Additional comments (2)

quick start

2d68a1f

djsaunde reviewed May 1, 2025

View reviewed changes

src/axolotl/utils/logging.py Outdated Show resolved Hide resolved

winglian reviewed May 1, 2025

View reviewed changes

src/axolotl/utils/logging.py Outdated Show resolved Hide resolved

SalmanMohammadi and others added 2 commits May 2, 2025 09:36

quick start

730fe0d

refactor log rank zero funcs

66162cb

winglian force-pushed the dist_logging branch from 2d68a1f to 66162cb Compare May 2, 2025 13:36

use multi process logging adapter similar to accelerate

89d44dd

winglian added this to the Axolotl v0.10.0 milestone May 7, 2025

Merge branch 'dist_logging' of github.com:axolotl-ai-cloud/axolotl in…

823338a

…to dist_logging

SalmanMohammadi mentioned this pull request May 9, 2025

SP GRPO support + batch SP fixes #2643

Merged

djsaunde added a commit that referenced this pull request May 9, 2025

removing main process only logs in favor of #2608

ef1a795

djsaunde added a commit that referenced this pull request May 10, 2025

removing main process only logs in favor of #2608

978ae1b

djsaunde added a commit that referenced this pull request May 12, 2025

removing main process only logs in favor of #2608

1263243

djsaunde added a commit that referenced this pull request May 12, 2025

removing main process only logs in favor of #2608

6aa733e

SalmanMohammadi and others added 3 commits May 15, 2025 08:29

quick start

39c2690

refactor log rank zero funcs

adb78c7

use multi process logging adapter similar to accelerate

aa97c92

winglian force-pushed the dist_logging branch from 89d44dd to aa97c92 Compare May 15, 2025 12:31

SalmanMohammadi added 5 commits May 15, 2025 17:56

wip replacing calls

cf7ed4e

replacing more calls - testing on dist setup

014f499

updating

d0a30f1

seems to be working

07ae995

Merge branch 'main' into dist_logging

e930ba7

SalmanMohammadi marked this pull request as ready for review May 19, 2025 17:38

SalmanMohammadi requested a review from winglian May 19, 2025 17:39

coderabbitai bot reviewed May 19, 2025

View reviewed changes

coderabbitai bot reviewed May 22, 2025

View reviewed changes

SalmanMohammadi added 8 commits May 22, 2025 16:16

fixing logging

598fcf6

fixing logging

9dcb21d

debugging

a4f00f2

debugging

e37ff32

debugging

7ce9baa

debugging

7f12bde

fixed logging

42921f6

configuring use_environ with get_logger

9c8403d

coderabbitai bot reviewed May 23, 2025

View reviewed changes

SalmanMohammadi requested review from djsaunde and winglian May 23, 2025 12:28

djsaunde reviewed May 23, 2025

View reviewed changes

comments

9dfd461

coderabbitai bot reviewed May 23, 2025

View reviewed changes

djsaunde self-requested a review May 23, 2025 17:57

SalmanMohammadi mentioned this pull request May 27, 2025

fix: suppress non-axolotl logs unless it's warning or higher #2724

Merged

djsaunde approved these changes May 27, 2025

View reviewed changes

djsaunde mentioned this pull request May 27, 2025

Data loader refactor #2707

Merged

SalmanMohammadi added 3 commits May 27, 2025 18:36

comments-fixing test

dd76a13

merge conflicts

c198524

CI

1df853f

coderabbitai bot reviewed May 28, 2025

View reviewed changes

SalmanMohammadi added 4 commits May 28, 2025 11:12

merging

20a4621

merging

ba46957

fixing trailing commas

fb5e1d1

fixing trailing commas

cef31bc

SalmanMohammadi merged commit 65c5481 into main May 28, 2025
14 of 16 checks passed

SalmanMohammadi deleted the dist_logging branch May 28, 2025 13:57

dhruvmullick mentioned this pull request May 28, 2025

vLLM serve fails with Liger plugin #2736

Closed

8 tasks

Uh oh!

Rank 0-only logging #2608

Rank 0-only logging #2608

Uh oh!

Conversation

SalmanMohammadi commented May 1, 2025 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

Uh oh!

Uh oh!

codecov bot commented May 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

coderabbitai bot commented May 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Sequence Diagram(s)

Suggested reviewers

Poem

Chat

Support

CodeRabbit Commands (Invoked using PR comments)

Other keywords and placeholders

CodeRabbit Configuration File (.coderabbit.yaml)

Documentation and Community

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai bot May 19, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

coderabbitai bot May 19, 2025

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot May 19, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

winglian commented May 20, 2025

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

djsaunde left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

djsaunde May 23, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

djsaunde May 23, 2025

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

djsaunde left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

SalmanMohammadi commented May 1, 2025 •

edited by coderabbitai bot

Loading

codecov bot commented May 2, 2025 •

edited

Loading

coderabbitai bot commented May 15, 2025 •

edited

Loading

CodeRabbit Configuration File (`.coderabbit.yaml`)

djsaunde left a comment •

edited

Loading