KTO loss #410

vulkomilev · 2024-11-27T20:00:35Z

Summary

This is the kto loss implemented by references from other projects

Details

I am not sure about the correctness (because this is my first PR) of the final results so I expect a lot of comments

Testing Done

I have done the basic testing inspired from cpo

pramodith · 2024-11-29T19:01:57Z

test/chunked_loss/test_cpo_loss.py

@@ -126,7 +126,7 @@ def test_correctness(
        input1, weight1, target, bias1, alpha=alpha
    )
    loss2 = LigerFusedLinearCPOFunction.apply(
-        input2, weight2, target, bias2, ignore_index, beta, alpha, True
+        input2, weight2, target, bias2, ignore_index, beta, alpha, False


Why are we changing the test case for an unrelated alignment algo?

Sorry my bad.

pramodith

Hey, I think this code needs to be refactored to make things a bit cleaner and easier to understand. Could you also write out the equations for KTO in the description to the PR so that its easier for a reviewer to understand?

pramodith · 2024-11-29T19:04:00Z

src/liger_kernel/chunked_loss/fused_linear_preference_kto.py

+from torch.nn import functional as F
+
+
+class LigerFusedLinearKTOPreferenceBase(torch.autograd.Function):


Why is this class needed, can't you reuse https://github.com/linkedin/Liger-Kernel/blob/main/src/liger_kernel/chunked_loss/fused_linear_preference.py?

I am getting this error
E RuntimeError: CUDA error: device-side assert triggered E CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect. E For debugging consider passing CUDA_LAUNCH_BLOCKING=1 E Compile with TORCH_USE_CUDA_DSA` to enable device-side assertions.

src/liger_kernel/chunked_loss/fused_linear_preference.py:210: RuntimeError
---------------------------------------------------------------------------------------------------------------- Captured stdout call -----------------------------------------------------------------------------------------------------------------

---------------------------------------------------------------------------------------------------------------- Captured stderr call -----------------------------------------------------------------------------------------------------------------
NoneType: None
../aten/src/ATen/native/cuda/ScatterGatherKernel.cu:144: operator(): block: [0,0,0], thread: [6,0,0] Assertion idx_dim >= 0 && idx_dim < index_size && "index out of bounds" failed.
../aten/src/ATen/native/cuda/ScatterGatherKernel.cu:144: operator(): block: [0,0,0], thread: [7,0,0] Assertion idx_dim >= 0 && idx_dim < index_size && "index out of bounds" failed.
../aten/src/ATen/native/cuda/ScatterGatherKernel.cu:144: operator(): block: [0,0,0], thread: [12,0,0] Assertion idx_dim >= 0 && idx_dim < index_size && "index out of bounds" failed.
../aten/src/ATen/native/cuda/ScatterGatherKernel.cu:144: operator(): block: [0,0,0], thread: [83,0,0] Assertion idx_dim >= 0 && idx_dim < index_size && "index out of bounds" failed.
../aten/src/ATen/native/cuda/ScatterGatherKernel.cu:144: operator(): block: [0,0,0], thread: [32,0,0] Assertion idx_dim >= 0 && idx_dim < index_size && "index out of bounds" failed.
../aten/src/ATen/native/cuda/ScatterGatherKernel.cu:144: operator(): block: [0,0,0], thread: [43,0,0] Assertion idx_dim >= 0 && idx_dim < index_size && "index out of bounds" failed.
../aten/src/ATen/native/cuda/ScatterGatherKernel.cu:144: operator(): block: [0,0,0], thread: [54,0,0] Assertion idx_dim >= 0 && idx_dim < index_size && "index out of bounds" failed.
../aten/src/ATen/native/cuda/ScatterGatherKernel.cu:144: operator(): block: [0,0,0], thread: [59,0,0] Assertion idx_dim >= 0 && idx_dim < index_size && "index out of bounds" failed.
../aten/src/ATen/native/cuda/ScatterGatherKernel.cu:144: operator(): block: [0,0,0], thread: [62,0,0] Assertion idx_dim >= 0 && idx_dim < index_size && "index out of bounds" failed.
=============================================================================================================== short test summary info ===============================================================================================================
FAILED test/chunked_loss/test_kto_loss.py::test_correctness[-100-0.1-1.0-False-1.0-dtype0-0.005-0.005-3-47-31-123] - RuntimeError: CUDA error: device-side assert triggered
================================================================================================================== 1 failed in 1.86s ============================================`

I will do the equations and the formatting. Also I need two arguments 'reference_chosen_logps' and 'reference_rejected_logps' to my custom loss function.

vulkomilev added 3 commits November 21, 2024 00:07

working on tests

ef59f91

test are working but I have problem with assertions

b053b0c

basic test working

2461a33

vulkomilev mentioned this pull request Nov 28, 2024

[RFC] Liger FlexChunkLoss: Alignment and Distillation loss #371

Open

12 tasks

pramodith reviewed Nov 29, 2024

View reviewed changes

pramodith requested changes Nov 29, 2024

View reviewed changes

returned to fused loss

5deb7f9

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

KTO loss #410

KTO loss #410

vulkomilev commented Nov 27, 2024

pramodith Nov 29, 2024

vulkomilev Nov 30, 2024

pramodith left a comment

pramodith Nov 29, 2024

vulkomilev Dec 1, 2024

vulkomilev Dec 1, 2024

		from torch.nn import functional as F


		class LigerFusedLinearKTOPreferenceBase(torch.autograd.Function):

KTO loss #410

Are you sure you want to change the base?

KTO loss #410

Conversation

vulkomilev commented Nov 27, 2024

Summary

Details

Testing Done

pramodith Nov 29, 2024

Choose a reason for hiding this comment

vulkomilev Nov 30, 2024

Choose a reason for hiding this comment

pramodith left a comment

Choose a reason for hiding this comment

pramodith Nov 29, 2024

Choose a reason for hiding this comment

vulkomilev Dec 1, 2024

Choose a reason for hiding this comment

vulkomilev Dec 1, 2024

Choose a reason for hiding this comment