Cactus Compute, Inc.

cactus

by cactus-compute

5.1k

Fast, lightweight inference framework for energy-efficient on-device AI: numerical computation graph API, OpenAI-compatible inference engine, INT8 optimizations and model/tooling for compact, low-power deployments.

#llm, #framework, #ai