Thanks for sharing your work, it's great. Can we use Qwen2-7B for pre-training and fine-tuning? Or Phi-3 or Phi-4?
Thanks for sharing your work, it's great. Can we use Qwen2-7B for pre-training and fine-tuning? Or Phi-3 or Phi-4?