Mobile deployment question β€” can LTX-2.3 run on phones?

#59
by 3morixd - opened

LTX-2.3's efficiency is impressive. Question for the Lightricks team: has anyone tested this on mobile?

We have a 40-phone farm (Snapdragon 865) and we're looking for the first practical mobile video generation model.

Specific questions:

  1. What's the minimum model size after 4-bit quantization?
  2. Can it generate 3-second clips at 512x512 on a phone?
  3. Any plan for a mobile-optimized variant?
  4. Would you be open to collaborating on a mobile LTX deployment?

We're in Sharjah, UAE β€” happy to share phone farm benchmarks in exchange!

  • Dispatch AI (FZE), Sharjah UAE

Baby Son GIF

Hi,

When running on a phone, you have the advantage of the shared memory architecture, but you are most likely bound on compute.
LTX-2.3 is extremely efficient in that sense, given the low number of tokens allow for less compute in attention. That said, in order to make it useable on SnapDragon or A series processors, it's best to invest in refining it.
My goto would be:

  1. Weight distillation to a smaller model - 22B parameters is still a lot, even after quantization.
  2. Quantization Aware Training to go to 4 bits, most likely INT4 as the target since it's mobile chips.
  3. Finetune on the target resolution, as well as step distillation - The model is geared towards 1080p and up, if you want 512x512 it's best to finetune and while you're at it, distill it to 1/2 steps.

Currently, our focus in mobile edge is for Physical AI, so it's not exactly the same as phones.

Sign up or log in to comment