[TRTLLM-7440][fix] Split fused_input_embed to separate out host sync
#7280
Loading
fused_input_embed to separate out host sync
#7280