You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Validating and improving how profiling is done. Internal Ipex profiling example works in isolation, but with this Megatron-DeepSpeed we are missing the XPU outputs.
Timeline output with XPU, including kernels and communication
Compare support and output of the PyTorch legacy profiler
Check whether IPEX_ZE_TRACING=1 is needed and add that to relevant scripts
Generate smaller profiler output files by leveraging torch.profiler.schedule and profiler.step(). For example wait 2 steps, warmup for 2 steps, and then profile 2 steps and write the output.
The text was updated successfully, but these errors were encountered:
Validating and improving how profiling is done. Internal Ipex profiling example works in isolation, but with this Megatron-DeepSpeed we are missing the XPU outputs.
IPEX_ZE_TRACING=1
is needed and add that to relevant scriptstorch.profiler.schedule
andprofiler.step()
. For example wait 2 steps, warmup for 2 steps, and then profile 2 steps and write the output.The text was updated successfully, but these errors were encountered: