Lucebox optimization hub: hand-tuned LLM inference, built for specific consumer hardware.
kernel cuda cuda-kernels nvidia-cuda luce rtx3090 llama-cpp local-ai qwen speculative-decoding dflash megakernel speculative-prefill pflash lucebox
-
Updated
May 5, 2026 - C++