Skip to content

Actions: hiyouga/LLaMA-Factory

Actions

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
4,580 workflow runs
4,580 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[trainer] support early stop (#7797)
tests #2461: Commit 7f3c31f pushed by hiyouga
April 21, 2025 17:59 8m 2s main
April 21, 2025 17:59 8m 2s
[trainer] support early stop
tests #2460: Pull request #7797 opened by hiyouga
April 21, 2025 17:48 8m 58s hiyouga/early_stop
April 21, 2025 17:48 8m 58s
[data] improve mmplugin (#7795)
tests #2459: Commit 92101f3 pushed by hiyouga
April 21, 2025 17:25 7m 44s main
April 21, 2025 17:25 7m 44s
[data] improve mmplugin
tests #2458: Pull request #7795 synchronize by hiyouga
April 21, 2025 17:17 6m 39s hiyouga/plugin
April 21, 2025 17:17 6m 39s
[data] improve mmplugin
tests #2457: Pull request #7795 opened by hiyouga
April 21, 2025 17:11 6m 5s hiyouga/plugin
April 21, 2025 17:11 6m 5s
[example] add bash usage (#7794)
tests #2456: Commit a62cba3 pushed by hiyouga
April 21, 2025 16:25 7m 22s main
April 21, 2025 16:25 7m 22s
[example] add bash usage
tests #2455: Pull request #7794 opened by hiyouga
April 21, 2025 16:18 7m 0s hiyouga/bash
April 21, 2025 16:18 7m 0s
[trainer] Add Muon Optimizer (#7749)
tests #2454: Commit d128382 pushed by hiyouga
April 21, 2025 15:38 13m 55s main
April 21, 2025 15:38 13m 55s
[parser] support omegaconf (#7793)
tests #2453: Commit 278df43 pushed by hiyouga
April 21, 2025 15:30 14m 54s main
April 21, 2025 15:30 14m 54s
Add Muon Optimizer
tests #2452: Pull request #7749 synchronize by hiyouga
April 21, 2025 15:30 12m 50s tianshijing:main
April 21, 2025 15:30 12m 50s
[data] Fix wrong position ids with packed attention masks (#7754)
tests #2450: Commit 81768df pushed by hiyouga
April 21, 2025 15:19 16m 11s main
April 21, 2025 15:19 16m 11s
[misc] fix new tokens adding (#7253)
tests #2449: Commit 1302ca3 pushed by hiyouga
April 21, 2025 15:19 5m 41s main
April 21, 2025 15:19 5m 41s
[misc] fix new tokens adding
tests #2448: Pull request #7253 synchronize by hiyouga
April 21, 2025 15:12 19m 9s flashJd:fix_issue_branch
April 21, 2025 15:12 19m 9s
[parser] support omegaconf
tests #2447: Pull request #7793 synchronize by hiyouga
April 21, 2025 15:09 18m 47s hiyouga/conf
April 21, 2025 15:09 18m 47s
[model] fix gemma3 export (#7786)
tests #2446: Commit b8cddbc pushed by hiyouga
April 21, 2025 15:07 29m 1s main
April 21, 2025 15:07 29m 1s
[misc] fix bug in constant (#7765)
tests #2445: Commit ec7257e pushed by hiyouga
April 21, 2025 15:06 11m 9s main
April 21, 2025 15:06 11m 9s
[parser] support omegaconf
tests #2443: Pull request #7793 synchronize by hiyouga
April 21, 2025 15:00 9m 36s hiyouga/conf
April 21, 2025 15:00 9m 36s
fix Gemma export
tests #2442: Pull request #7786 synchronize by hiyouga
April 21, 2025 14:53 6m 52s ddddng:main
April 21, 2025 14:53 6m 52s
[parser] support omegaconf
tests #2441: Pull request #7793 opened by hiyouga
April 21, 2025 14:48 7m 10s hiyouga/conf
April 21, 2025 14:48 7m 10s
UnboundLocalError: local variable 'image_seqlen' referenced before assignment
label_issue #2845: Issue #7791 opened by YuTinH
April 21, 2025 12:50 7s
April 21, 2025 12:50 7s
是否可以支持更长训练的训练,比如150K+的一些训练方法?
label_issue #2844: Issue #7790 opened by h123fire
April 21, 2025 12:30 10s
April 21, 2025 12:30 10s
InternVL3和2.5训练出错
label_issue #2843: Issue #7789 opened by Vincent-HKUSTGZ
April 21, 2025 10:20 6s
April 21, 2025 10:20 6s
学习率导致不同任务学习能力差别问题请教
label_issue #2842: Issue #7788 opened by Wangman1
April 21, 2025 08:16 9s
April 21, 2025 08:16 9s
mac 无法加载模型
label_issue #2841: Issue #7787 opened by SuJ1213
April 21, 2025 08:08 9s
April 21, 2025 08:08 9s