Skip to content

Commit

Permalink
Update run_llm_inference.sh with recommended models.
Browse files Browse the repository at this point in the history
PiperOrigin-RevId: 698802697
  • Loading branch information
schmidt-sebastian authored and copybara-github committed Nov 21, 2024
1 parent 2fab0dd commit df03022
Showing 1 changed file with 6 additions and 0 deletions.
6 changes: 6 additions & 0 deletions run_llm_inference.sh
Original file line number Diff line number Diff line change
Expand Up @@ -2,6 +2,12 @@

# This is a simple script to run LLM inference on Android via the MediaPipe
# LLM inference engine.
#
# This script allows running transformer-based LLM models in *.task or *.bin
# format. We recommend using `gemma2-2b-it-cpu-int8.task` (from
# https://www.kaggle.com/models/google/gemma-2/tfLite/gemma2-2b-it-cpu-int8) or
# the smaller `gemma-1.1-2b-it-cpu-int4.bin` model (from
# https://www.kaggle.com/models/google/gemma/tfLite/gemma-1.1-2b-it-cpu-int4).

MODEL_FILENAME="gemma2-2b-it-cpu-int8.task"
ADB_WORK_DIR="/data/local/tmp"
Expand Down

0 comments on commit df03022

Please sign in to comment.