Highlights
Pinned Loading
-
inference-preference-optimization
inference-preference-optimization PublicInference Preference Optimization (IPO) builds on GRPO by integrating memory retrieval into chain-of-thought reasoning for personalized inference.
Python 5
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.

