How to run LLaMA (and other LLMs) on Android.

llama@lemmy.dbzer0.com · edit-2 4 months ago

How to run LLaMA (and other LLMs) on Android.

projectmoon@forum.agnos.is · 4 months ago

@llama@lemmy.dbzer0.com Depends on the inference engine. Some of them will try to load the model until it blows up and runs out of memory. Which can cause its own problems. But it won’t overheat the phone, no. But if you DO use a model that the phone can run, like any intense computation, it can cause the phone to heat up. Best not run a long inference prompt while the phone is in your pocket, I think.

llama@lemmy.dbzer0.com · edit-2 4 months ago

Thanks for your comment. That for sure is something to look out for. It is really important to know what you’re running and what possible limitations there could be. Not what the original comment said, though.

How to run LLaMA (and other LLMs) on Android.

How to run LLaMA (and other LLMs) on Android.

Step 1: Install Termux

Step 2: Set Up proot-distro and Install Debian

Step 3: Install Dependencies

Step 4: Install Ollama

Step 5: Download and run the Llama3.2:1B Model