ipex-llm

Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discrete GPU such as Arc, Flex and Max); seamlessly integrate with llama.cpp, Ollama, HuggingFace, LangChain, LlamaIndex, vLLM, DeepSpeed, Axolotl, etc.

Why use it

Best for

beginner

Deployment options

compose · docker

Resource requirements

server

Alternative to

No mapping yet

Common setup stack

Reverse proxy · HTTPS certs · auth gateway · backups

Alternatives & similar apps

1000UserGuide

Compare with ipex-llm

13ft

Compare with ipex-llm

1backend

Compare with ipex-llm

4gaBoards

Compare with ipex-llm