-
Notifications
You must be signed in to change notification settings - Fork 24
Pull requests: beehive-lab/GPULlama3.java
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[WIP] Manipulation of Q8_0 tensors with Tornado
ByteArrays
#79
opened Dec 4, 2025 by
orionpapadakis
•
Draft
[FP16] Improved performance by fusing dequantize with compute in kernels: 20-30% Inference Speedup
#78
opened Dec 3, 2025 by
mikepapadim
Loading…
RMS normalization kernel optimization by fusing the reduction kernel and context mapping kernel
#77
opened Dec 1, 2025 by
yrq0208
Loading…
Copy-in embeddings in reduced precision and handle precision conversion during inference
#73
opened Nov 26, 2025 by
mikepapadim
Loading…
Add INTEGRATIONS.md showcasing LangChain4j and Quarkus integration ex…
#68
opened Nov 14, 2025 by
mikepapadim
Loading…
Add Q4_0 quantization support for all models in TornadoVM path
#67
opened Nov 13, 2025 by
mikepapadim
•
Draft
ProTip!
no:milestone will show everything without a milestone.