Skip to content

Latest commit

 

History

History
113 lines (90 loc) · 7.96 KB

File metadata and controls

113 lines (90 loc) · 7.96 KB

ITDR: An Instruction Tuning Dataset for Enhancing Large Language Models in Recommendations

Paper

🤖Model

The modles are available at ModelScope.

All fine-tuned LoRA adapters are available via Google Drive.

🔥News

  • [Nov. 2025]: ITDR has been accepted to KDD2026!

Introduction

Large language models (LLMs) have demonstrated outstanding performance in natural language processing tasks. However, in the field of recommendation systems, due to the structural differences between user behavior data and natural language, LLMs struggle to effectively model the associations between user preferences and items. Although prompt-based methods can generate recommendation results, their inadequate understanding of recommendation tasks leads to constrained performance. To address this gap, in this work, we construct a sufficient instruction tuning dataset, ITDR, which encompasses 7 subtasks across two core root tasks—useritem interaction and user-item understanding. The dataset integrates data from 13 public recommendation datasets and is built using manually crafted standardized templates, comprising approximately 200,000 instances. Experimental results demonstrate that ITDR significantly enhances the performance of mainstream open-source LLMs such as GLM-4, Qwen2.5, Qwen2.5-Instruct and LLaMA-3.2 on recommendation tasks. Furthermore, we analyze the correlations between tasks and explore the impact of task descriptions and data scale on instruction tuning effectiveness. Finally, we perform comparative experiments against closed-source LLMs with substantial parameters.

Architecture of ITDR

academic_sankey

Perfomance

Comparison_before_and_after_fine-tuning

All Source Data We Used In The Paper

Dataset Name Link
Anime Dataset 2023 https://www.kaggle.com/datasets/dbdmobile/myanimelist-dataset
MovieLens 1M/32M https://grouplens.org/datasets/movielens/
Amazon Reviews 2023 https://huggingface.co/datasets/McAuley-Lab/Amazon-Reviews-2023/tree/main
MicroLens https://github.com/westlake-repl/MicroLens
PixelRec https://github.com/westlake-repl/PixelRec
BookCrossing https://www.kaggle.com/datasets/ruchi798/bookcrossing-dataset
Amazon Books Reviews https://www.kaggle.com/datasets/mohamedbakhet/amazon-books-reviews?select=books_data.csv
MIND https://msnews.github.io/
Steam https://github.com/kang205/SASRec?tab=readme-ov-file
Yelp https://www.kaggle.com/datasets/yelp-dataset/yelp-dataset
Last.FM 360K http://ocelma.net/MusicRecommendationDataset/lastfm-360K.html
Last.FM 1K https://yann.lecun.com/exdb/mnist/

Finetune

We use LLaMA-Factory for model fine-tuning. Below is an example of fine-tuning GLM-4:

CUDA_VISIBLE_DEVICES=0 python src/train.py \
--model_name_or_path /root/shared-nvme/models/glm-4-9b/ZhipuAI/glm-4-9b \
--trust_remote_code \
--stage sft \
--do_train \
--dataset train \
--template glm4 \
--finetuning_type lora \
--output_dir saves/glm-4-9b/sft \
--overwrite_cache \
--per_device_train_batch_size 1 \
--gradient_accumulation_steps 8 \
--lr_scheduler_type cosine \
--logging_steps 100 \
--save_steps 500 \
--learning_rate 1e-4 \
--num_train_epochs 2.0 \
--plot_loss \
--bf16

Deployment and Inference

We use vLLM for deployment and inference. Below is an example:

CUDA_VISIBLE_DEVICES=0 vllm serve /root/ITDR-GLM-4-9B \
    --host 0.0.0.0 \
    --port 8098 \
    --max-model-len 8192 \
    --tensor-parallel-size 1 \
    --disable-log-requests \
    --gpu-memory-utilization 0.9 \
    --enable-prefix-caching \
    --trust-remote-code

The statistics of ITDR

statistics_of_datasets

Details of the main experimental results

Performance on the user-item interaction tasks Performance on the user-item understanding tasks1 Performance on the user-item understanding tasks2

Details of the root tasks ablation experiment results

remove different root tasks_uii remove different root tasks_uiu1 remove different root tasks_uiu2

Details of the subtasks ablation experiment results

subtasks_ablation_uii subtasks_ablation_uiu1 subtasks_ablation_uiu2

Details of experimental results for closed source LLMs

closed_source LLMs_uii closed_source LLMs_uiu1 closed_source LLMs_uiu2

Data examples

RT TKR CDR NIR UAP IR TUI