Skip to content

Commit 360d737

Browse files
author
stephenlei
committed
update readme
1 parent 35537d9 commit 360d737

File tree

1 file changed

+13
-5
lines changed

1 file changed

+13
-5
lines changed

README.md

Lines changed: 13 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -1,14 +1,17 @@
11
# SongGeneration
2+
23
<p align="center"><img src="img/logo.jpg" width="40%"></p>
34
<p align="center">
45
<a href="https://levo-demo.github.io/">Demo</a> &nbsp;|&nbsp; <a href="https://arxiv.org/abs/2506.07520">Paper</a> &nbsp;|&nbsp; <a href="https://huggingface.co/waytan22/SongGeneration">Hugging Face</a> &nbsp;|&nbsp; <a href="https://huggingface.co/spaces/waytan22/SongGeneration-LeVo">Space Demo</a>
56
</p>
67

78

8-
This repository is the official code repository for LeVo: High-Quality Song Generation with Multi-Preference Alignment. In this repository, we provide the SongGeneration model, inference scripts, and checkpoints trained on the Million Song Dataset that support both Chinese and English.
9+
10+
This repository is the official code repository for LeVo: High-Quality Song Generation with Multi-Preference Alignment. In this repository, we provide the SongGeneration model, inference scripts, checkpoints and some tools such as Dataset Pipeline.
911

1012
## News and Updates
1113

14+
* **2025.09.23 🔥**: We have released the [Data Processing Pipeline](https://github.com/tencent-ailab/SongPrep), which is capable of **analyzing the structure and lyrics** of entire songs and **providing precise timestamps** without the need for additional source separation. On the human-annotated test set [SSLD-200](https://huggingface.co/datasets/waytan22/SSLD-200), the model’s performance outperforms mainstream models including Gemini-2.5, Seed-ASR, and Qwen3-ASR.
1215
* **2025.07.25 🔥**: SongGeneration can now run with as little as **10GB of GPU memory**.
1316
* **2025.07.18 🔥**: SongGeneration now supports generation of **pure music**, **pure vocals**, and **dual-track (vocals + accompaniment separately)** outputs.
1417
* **2025.06.16 🔥**: We have released the **SongGeneration** series.
@@ -18,14 +21,15 @@ This repository is the official code repository for LeVo: High-Quality Song Gene
1821
- [ ] Update full time model.
1922
- [ ] Update English enhanced model.
2023
- [ ] Release finetuning scripts.
21-
- [x] Update Low memory usage model.
24+
- [x] Release data processing pipeline.
25+
- [x] Update low memory usage model.
2226
- [x] Support single vocal/bgm track generation.
2327

2428
## Model Versions
2529

26-
| Model | HuggingFace |
27-
| :----------------------: | :----------------------------------------------------------: |
28-
| SongGeneration-base | <a href="https://huggingface.co/tencent/SongGeneration/tree/main/ckpt/songgeneration_base">v20250520</a> |
30+
| Model | HuggingFace |
31+
| :------------------------: | :----------------------------------------------------------: |
32+
| SongGeneration-base | <a href="https://huggingface.co/tencent/SongGeneration/tree/main/ckpt/songgeneration_base">v20250520</a> |
2933
| SongGeneration-base(zh&en) | Coming soon |
3034
| SongGeneration-full(zh&en) | Coming soon |
3135

@@ -90,6 +94,7 @@ sh generate.sh ckpt_path lyrics.jsonl output_path
9094
- **Note:** If certain optional fields are not required, they can be omitted.
9195

9296
- Outputs of the loader `output_path`:
97+
9398
- `audio`: generated audio files
9499
- `jsonl`: output jsonls
95100

@@ -197,6 +202,7 @@ The `descriptions` field allows you to control various musical attributes of the
197202
## Tools
198203

199204
### Gradio UI
205+
200206
You can start up the UI with the following command:
201207

202208
```bash
@@ -220,9 +226,11 @@ The code and weights in this repository is released in the [LICENSE](LICENSE) f
220226

221227

222228
## Contact
229+
223230
Use WeChat or QQ to scan blow QR code.
224231

225232
<div style="display: flex; justify-content: center; gap: 20px; width: 100%;">
226233
<img src="img/contact.jpg" height="300" />
227234
<img src="img/contactQQ.jpg" height="300" />
228235
</div>
236+

0 commit comments

Comments
 (0)