-
Notifications
You must be signed in to change notification settings - Fork 301
Fix rope scaling factor #1605
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix rope scaling factor #1605
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This looks good! Can we add some tests?
@abuelnasr0 also apologies, we just changed our entire directory structure in #1608 (Hopefully for good reason, we want to allow But it does mean everything will need an annoying merge/rebase. If it'd help for me to do any of those and push to this branch just lmk! |
4c76fe6
to
f93a813
Compare
I can add tests, but on sunday. sorry for that, but I will be AFK until then. |
No rush at all! And thanks so much for all the major contributions to the library :) I am just getting back from vacation myself, slowly catching up on all the review. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM! If tests pass lets ship it!
You're welcome. I am trying to give back to the community as much as I can. And actually contributing to the library helped me to improve, I am learning new things with each PR. Thank you & other authors for creating the library. and thank you for all your reviews, they were really helpful to me. |
checkout these lines. what is implemented in this PR is |
I think the use of
scaling_factor
is wrong inRotaryEmbedding
layer. It is used to scale the positions not the frequencies.References: