> That too was broken in mlx-lm (it crashed), but has since been fixed on the ma... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		egorfine 1 day ago \| parent \| context \| favorite \| on: I ran Gemma 4 as a local model in Codex CLI > That too was broken in mlx-lm (it crashed), but has since been fixed on the main branch Unfortunately I have got zero success running gemma with mlx-lm main branch. Can you point me out what is the right way? I have zero experience with mlx-lm.

		help

Confiks 1 day ago [–]

Get into a venv, and run:

> pip3 install git+https://github.com/ml-explore/mlx-lm.git

> ./venv/bin/mlx_lm.generate --model "$MODEL" --temp 1.0 --top-p 0.95 --top-k 64 --max-tokens 128000 --prompt "Hello world"

Where $MODEL is an unsloth model like:

- unsloth/gemma-4-E4B-it-UD-MLX-4bit

- unsloth/gemma-4-26b-a4b-it-UD-MLX-4bit

egorfine 23 hours ago | [–]

Thanks!

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact