Default Branch

edd65a84dd · Merge pull request #6362 from hpcaitech/CI/test_build_on_schedule · Updated 2025-07-15 06:25:10 +00:00

Branches

6019434ac9 · Merge pull request #6370 from ChosenQC/feature/pdf-rag · Updated 2025-07-23 06:26:08 +00:00

0
4

57e92104a2 · hotfix entropy calculation (#6364) · Updated 2025-07-22 02:02:02 +00:00

45
114

5c5cb1863b · hotfix · Updated 2025-07-21 10:04:20 +00:00

45
114

9f8c97d028 · add entropy · Updated 2025-07-16 08:44:23 +00:00

45
113

16450a5ac6 · [pre-commit.ci] auto fixes from pre-commit.com hooks · Updated 2025-07-14 17:21:12 +00:00

3
2

973dea21c7 · remove assert · Updated 2025-06-27 06:16:23 +00:00

45
114

9379a89677 · [feat][npu] Merge form grpo-latest (#6346) · Updated 2025-06-23 03:49:13 +00:00

45
78

c7d3d0dc8f · remove unused parameter · Updated 2025-06-19 07:14:16 +00:00

45
109

2db255bf15 · add profiling, implement memory efficient logprob alculation · Updated 2025-06-18 10:08:22 +00:00

45
98

2f02a28777 · Update README.md · Updated 2025-06-12 03:21:31 +00:00

45
99

9ca920c1af · [pre-commit.ci] auto fixes from pre-commit.com hooks · Updated 2025-06-09 01:48:20 +00:00

45
92

96faf54542 · fix typ and parameter description · Updated 2025-06-05 07:41:14 +00:00

45
87

e00c9bbf38 · upgrade python · Updated 2025-06-03 10:51:39 +00:00

5
0
Included

5890c8ecdd · Merge pull request #6335 from wangbluo/lazy_deepseek · Updated 2025-06-02 03:30:11 +00:00

35
114

f8bd2db33f · add uuid to rollout log · Updated 2025-05-20 01:45:56 +00:00

45
68

18f2247a10 · update consumer · Updated 2025-05-14 10:19:47 +00:00

45
58

367ae3f233 · Revert "Support evaluation during training" · Updated 2025-05-07 02:52:08 +00:00

45
58

16169d1f22 · Revert "[feat] Update reward verification" · Updated 2025-05-06 04:59:30 +00:00

45
54

4d18e7d772 · spot a possible bug · Updated 2025-05-05 10:48:42 +00:00

45
59

d4a6b6c4a7 · update evaluation parameters · Updated 2025-05-04 08:41:27 +00:00

45
57