HedraRAG: Artifact Evaluation README

This document provides the instructions to reproduce the experimental environment for the HedraRAG artifact. The following components and versions are required for successful setup and evaluation.

System Requirements

Operating System: Ubuntu 20.04 (Linux x86_64)
Python Version: 3.11
CUDA Version: 12.4
PyTorch Version: 2.5.1

We recommend using the official PyTorch Docker image: pytorch/pytorch:2.5.1-cuda12.4-cudnn9-devel Available at: https://hub.docker.com/r/pytorch/pytorch

Environment Setup

Clone Repository

git clone <your-repo-url>

cd <repo-root>
Create and Activate Conda Environment (Recommended)

conda create -n heterag python=3.9 -y

conda activate heterag
Install Dependencies

bash Dependency.sh
Build HedraRAG

bash Install.sh
Build LangChain (baseline) [optional]

cd LangChain

pip install -r requirements.txt

Dataset Preparation

The original paper uses a large Wikipedia page index (>100GB), which may be inconvenient for quick prototyping or evaluation. To simplify the setup, we provide a smaller pre-built index based on the MS MARCO passage corpus (~36GB) to help users efficiently build and test the pipeline.

1. Download Pre-built Index and Corpus

Please download the index and from the following link: https://doi.org/10.5281/zenodo.16663591

2. Configure Dataset Paths

Update data.conf before running the pipeline:

export index_path=/path/to/ivf.index

export corpus_path=Tevatron/msmarco-passage-corpus

export model_path=/huggingface/model_path

index_path: Path to the downloaded FAISS index file
corpus_path: Defaults to the Tevatron MS MARCO passage corpus on HuggingFace
model_path: HuggingFace model path used for generation

3. Using Custom Corpus and Index (Optional)

You can also use your own corpus and corresponding index by updating the paths accordingly:

Set index_path to your own FAISS index
Set corpus_path to either a local path or HuggingFace dataset

If you want to build your own FAISS IVF index, we recommend using the intfloat/e5-large-v2 model to encode your documents.

4. Build the Corpus Used in the Paper

The corpus and index used in the paper are based on Wikipedia passages up to the end of 2022, available at https://zenodo.org/records/16849723, and encoded with intfloat/e5-large-v2.

Since the index is large, we recommend building it locally. On a high-performance CPU+GPU machine, this process may take several days.

Steps:

Download the corpus file

Download text-list-100-sec.jsonl from the above link.
Run the build script

Use the provided build_index.sh and modify the first two lines: corpus_path=/path/to/text-list-100-sec.jsonl save_dir=/path/to/save_dir
- corpus_path: Path to the downloaded text-list-100-sec.jsonl file
- save_dir: Output directory; the generated ivf.index will be stored here
Preprocessing and storage optimization
- The build script supports a checkpoint mechanism for resuming. If the run fails midway, you can re-execute build_index.sh to resume and continue.
- The process may consume up to 240GB of storage space and take over 30 hours on our CPU.
- After preprocessing is complete, you can delete emb_e5.memmap to save storage space.
Update the configuration In data.conf, set: export corpus_path=/path/to/text-list-100-sec.jsonl export index_path=/path/to/save_dir/ivf.index

You can then run the paper experiments directly.

Running Experiments

Once the environment is set up, you can run the evaluation scripts to reproduce the experimental results.

We provide a series of scripts named run_fig[X].sh, each corresponding to Figure [X] in the paper. These scripts execute the experiments and generate the associated plots.

All individual execution and plotting scripts are located in the evaluation/ directory.

The final plots can be found in the evaluation/output_figure directory.

Expected Resource Usage

The experiments in this artifact are designed to be executable on modern GPU-CPU servers with large memory capacity.

GPU requirements:
- To reproduce the full experiments in the paper, we recommend at least 200 GB of CPU memory and 80 GB of GPU memory.
Runtime per experiment:
- run_fig12.sh: ~1 hour
- All other run_fig[X].sh scripts: ≤0.5 hour

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
HedraRAG		HedraRAG
LangChain		LangChain
evaluation		evaluation
faiss		faiss
.gitignore		.gitignore
Dependency.sh		Dependency.sh
Install.sh		Install.sh
ReadMe.md		ReadMe.md
build_index.sh		build_index.sh
data.conf		data.conf
requirement.txt		requirement.txt
run_fig12.sh		run_fig12.sh
run_fig13.sh		run_fig13.sh
run_fig14.sh		run_fig14.sh
run_fig15.sh		run_fig15.sh
run_fig16.sh		run_fig16.sh
run_fig17.sh		run_fig17.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

HedraRAG: Artifact Evaluation README

System Requirements

Environment Setup

Dataset Preparation

1. Download Pre-built Index and Corpus

2. Configure Dataset Paths

3. Using Custom Corpus and Index (Optional)

4. Build the Corpus Used in the Paper

Running Experiments

Expected Resource Usage

About

Uh oh!

Releases

Packages

Contributors 2

Languages

Leo9660/HedraRAG_AE

Folders and files

Latest commit

History

Repository files navigation

HedraRAG: Artifact Evaluation README

System Requirements

Environment Setup

Dataset Preparation

1. Download Pre-built Index and Corpus

2. Configure Dataset Paths

3. Using Custom Corpus and Index (Optional)

4. Build the Corpus Used in the Paper

Running Experiments

Expected Resource Usage

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages