Our new, fastest-ever Cactus v1 SDK is here!

Oxford Seed FundGoogle for Startups

The fastest way to deploy mobile AI

Deploy text, vision, and speech models locally on smartphones.

Minimize latency, guarantee privacy, decrease costs.

Get Started Dashboard 3.0k+ stars

<50ms

Time to First Token

Up to 300

Tokens / second

Zero

Data Leaves the Device

Why on-device?

Predictable cost. Guaranteed privacy. Realtime performance.

Offline-ready

For secure facilities or internet-disabled devices.

Private

No data transmission by default and complete user privacy.

Multimodal

Deploy Text, Vision, and Speech models through a single SDK.

Cloud fallback

Fall back to cloud inference if needed for longer or asynchronous tasks.

Agentic

Built-in mobile tool calling and agentic workflows.

Native Support

Pre-compiled builds for iOS, Android, and other platforms.

Platform Support

Get started with your preferred framework

Flutter

Dart package for Flutter apps

flutter pub add cactus

Quick Start →

React Native

NPM package for React Native

npm install cactus-react-native

Quick Start →

Kotlin Multiplatform

Kotlin package for KMP

// see docs for installation

Quick Start →

Built-in Telemetry

One-line initialization with a CACTUS_TELEMETRY_TOKEN.

Track device engagement in real time

Monitor user activity, model usage, device performance, and inference types. Understand your user patterns without additional setup or configuration.

Get instant visibility into device-level metrics, inference throughput, latency, and user engagement.

Daily active devices

Daily active devices across your Cactus projects

Error rate

Daily error rate across your events

Optimize workflow performance

Capture error rates across your deployments. Identify problematic patterns or workflow performance degradation in real-time.

Run out-of-the-box analytics to ensure your AI features remain reliable and performant.

Start tracking →

Agent Builder Canvas

Create complex workflows on a simple interface

Try creating complex workflows on a simple interface:

Deploy →

Performance benchmarks

Real-world performance data on popular consumer devices

Tokens per Second

Real-world decode performance measured through our demo apps

394MB

Frequently Asked Questions

Everything you need to know about deploying AI on mobile

Try demo apps

Experience Cactus SDK in action with our demo applications

Download on App Store Get it on Google Play

Ready to find your Edge?

Join thousands of developers building the future of mobile AI

The fastest way to deploy mobile AI

Why on-device?

Platform Support

Built-in Telemetry

Track device engagement in real time

Optimize workflow performance

Agent Builder Canvas

Performance benchmarks

Frequently Asked Questions

Which smartphones does Cactus support?

How does on-device inference compare to cloud APIs?

What are the minimum device requirements?

How do I handle model updates?

Is my user data secure?

Is Cactus free?

Does Cactus run on desktop?

Try demo apps

Ready to find your Edge?