Reasoning is Periodicity? Improving Large Language Models Through Effective Periodicity Modeling

This repository contains the implementation and pre-trained models for FANformer, a novel architecture that enhances Large Language Models through effective periodicity modeling.

🎉 Our work has been accepted to NeurIPS'25.

Overview

Revised Architecture: Implemented in olmo/model.py
Model Scale: 1B parameters pre-trained model

Training

Launch distributed training with 8 GPUs:

torchrun --nproc_per_node=8 scripts/train.py configs/test/FANformer-1B-pretrain.yaml

Evaluation

Run comprehensive evaluation using the OLMo benchmark:

olmes --model ${MODEL_PATH} --task main_suite::olmo1 --output-dir ${OUTPUT_DIR}

Pre-trained Models

Model	Non-embedding Parameters	Training Tokens	Download
FANformer-1B	1.1B	1T	Huggingface or Google Drive

Wandb report on comparison

Citation

@article{dong2025fanformer,
  title={FANformer: Improving Large Language Models Through Effective Periodicity Modeling},
  author={Dong, Yihong and Li, Ge and Jiang, Xue and Tao, Yongding and Zhang, Kechi and Zhu, Hao and Liu, Huanyu and Ding, Jiazheng and Li, Jia and Deng, Jinliang and Mei, Hong},
  journal={arXiv preprint arXiv:2502.21309},
  year={2025}
}

@article{dong2024fan,
  title={FAN: Fourier Analysis Networks},
  author={Yihong Dong and Ge Li and Yongding Tao and Xue Jiang and Kechi Zhang and Jia Li and Jing Su and Jun Zhang and Jingjing Xu},
  journal={arXiv preprint arXiv:2410.02675},
  year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
configs		configs
docker		docker
docs		docs
evaluation		evaluation
hf_olmo		hf_olmo
inference		inference
olmo		olmo
olmo_data		olmo_data
scripts		scripts
test_fixtures		test_fixtures
tests		tests
.dockerignore		.dockerignore
.gitattributes		.gitattributes
.gitignore		.gitignore
Calculate_Lipschitz.ipynb		Calculate_Lipschitz.ipynb
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
conftest.py		conftest.py
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Reasoning is Periodicity? Improving Large Language Models Through Effective Periodicity Modeling

Overview

Training

Evaluation

Pre-trained Models

Wandb report on comparison

Citation

About

Uh oh!

Releases

Packages

Languages

License

YihongDong/FANformer

Folders and files

Latest commit

History

Repository files navigation

Reasoning is Periodicity? Improving Large Language Models Through Effective Periodicity Modeling

Overview

Training

Evaluation

Pre-trained Models

Wandb report on comparison

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages