[NEW]Free cloud fallback for the month of February
Get startedCactus Blog
Deep dives into on-device AI, inference optimization, and the engineering behind Cactus.
Latest
ModelsApplications
The Sweet Spot for Mac Code Use: Reviewing LFM2 24B MoE A2B with Cactus
LFM2-24B-A2B features 24B total parameters but only activates 2B during inference. We break down the MoE architecture, GQA, gated convolutions, and show how to run it locally with Cactus.
NC
Noah Cylich
HN
Henry Ndubuaku
