alpine/ollama Docker Image Overview

alpine/ollama

Minimal CPU-only Ollama Docker Image

12 收藏0 次下载

Minimal CPU-only Ollama Docker Image

This repository provides a minimal CPU-only Ollama Docker image, specifically designed to run on systems without GPU support. At just 70MB, this image is significantly smaller than the official Ollama image, which is around 4GB.

ollama latest b99944c07117   3 hours ago    69.3MB

Notes

This image is not based on alpine, but wolfi.dev, I will work it out with alpine later
Got big help from @kth8 ( [***] )

Multi-Arch support

linux/amd64
linux/arm64

Repo:

[***]

Automation build logs:

[***]

Docker iamge tags:

[***]

Why Use This Image?

Lightweight: The official Ollama image is over 4GB in size, which can be overkill for systems that only need CPU-based processing. This image is only 70MB, making it much faster to download and deploy.
CPU-only Support: This image is tailored for systems without GPUs. It ensures you can run Ollama efficiently, even on basic or resource-constrained environments, without needing specialized hardware.
Run Anywhere: Whether you're working on local servers, edge devices, or cloud environments that don’t offer GPU resources, this image allows you to run Ollama anywhere, focusing purely on CPU-based operations.

How to Use

Pull the image

bash
docker pull alpine/ollama

Run the service with API supported

docker rm -f ollama
docker run -d -p ***:*** -v ~/.ollama/root/.ollama --name ollama alpine/ollama

Download the models, for example, llama3.2, only run once. It will save the model locally, you can re-use it later.

docker exec -ti ollama ollama pull llama3.2

If you don't want to download, you can choice to use alpine/llama3.2 image directly. I create this with model "llama3.2" integrated already

docker run -d -p ***:*** --name llama3.2 alpine/llama3.2

Test its API service with curl

$ curl http://localhost:***/api/generate -d '{
  "model": "llama3.2",
  "prompt":"Why is the sky blue?"
}'

{"model":"llama3.2","created_at":"2024-10-16T00:25:58.59931201Z","response":"The","done":false}
{"model":"llama3.2","created_at":"2024-10-16T00:25:58.695826838Z","response":" sky","done":false}
{"model":"llama3.2","created_at":"2024-10-16T00:25:58.780917761Z","response":" appears","done":false}
{"model":"llama3.2","created_at":"2024-10-16T00:25:58.992556209Z","response":" blue","done":false}
{"model":"llama3.2","created_at":"2024-10-16T00:25:59.085970606Z","response":" because","done":false}
{"model":"llama3.2","created_at":"2024-10-16T00:25:59.30869749Z","response":" of","done":false}
...

If you monitor the CPU usage, for example, with htop, you would see the high CPU usage

You can deploy the Ollama web UI to chat with it directly. There are many tools available, but I won't recommend any specific one.

Use case

this image could be deployed to any enviornment, for example, in kubernetes cluster, you can use it to analyze logs, streamlining logs with local LLMs, etc.

查看更多 ollama 相关镜像 →

ollama/ollama

轻松在本地部署和运行大型语言模型的最简单方式。

148850M+ pulls

上次更新：2 天前

dustynv/ollama

GitHub仓库dusty-nv/jetson-containers中的Ollama LLM包，是为NVIDIA Jetson嵌入式平台设计的容器化解决方案，旨在简化大型语言模型（LLM）的部署与运行流程，支持多种主流LLM模型，充分利用Jetson设备的硬件加速能力，适用于边缘AI计算、智能终端开发等场景，为开发者提供便捷高效的本地化LLM部署工具。

8100K+ pulls

上次更新：6 个月前

shinejh0528/ollama

基于ollama/ollama的镜像，集成fastAPI服务器，允许外部系统释放Ollama占用的GPU内存，提供Ollama服务（11434端口）和fastAPI管理接口（5000端口）。

10K+ pulls

上次更新：7 个月前

轩辕镜像配置手册

探索更多轩辕镜像的使用方法，找到最适合您系统的配置方式

登录仓库拉取

通过 Docker 登录认证访问私有仓库

Linux

在 Linux 系统配置镜像服务

Windows/Mac

在 Docker Desktop 配置镜像

Docker Compose

Docker Compose 项目配置

K8s Containerd

Kubernetes 集群配置 Containerd

K3s

K3s 轻量级 Kubernetes 镜像加速

Dev Containers

VS Code Dev Containers 配置

MacOS OrbStack

MacOS OrbStack 容器配置

宝塔面板

在宝塔面板一键配置镜像

群晖

Synology 群晖 NAS 配置

飞牛

飞牛 fnOS 系统配置镜像

极空间

极空间 NAS 系统配置服务

爱快路由

爱快 iKuai 路由系统配置

绿联

绿联 NAS 系统配置镜像

威联通

QNAP 威联通 NAS 配置

Podman

Podman 容器引擎配置

Singularity/Apptainer

HPC 科学计算容器配置

其他仓库配置

ghcr、Quay、nvcr 等镜像仓库

专属域名拉取

无需登录使用专属域名

需要其他帮助？请查看我们的常见问题Docker 镜像访问常见问题解答或提交工单

镜像拉取常见问题

轩辕镜像免费版与专业版有什么区别？

免费版仅支持 Docker Hub 访问，不承诺可用性和速度；专业版支持更多镜像源，保证可用性和稳定速度，提供优先客服响应。

轩辕镜像支持哪些镜像仓库？

专业版支持 docker.io、gcr.io、ghcr.io、registry.k8s.io、nvcr.io、quay.io、mcr.microsoft.com、docker.elastic.co 等；免费版仅支持 docker.io。

流量耗尽错误提示

当返回 402 Payment Required 错误时，表示流量已耗尽，需要充值流量包以恢复服务。

410 错误问题

通常由 Docker 版本过低导致，需要升级到 20.x 或更高版本以支持 V2 协议。

manifest unknown 错误

先检查 Docker 版本，版本过低则升级；版本正常则验证镜像信息是否正确。

镜像拉取成功后，如何去掉轩辕镜像域名前缀？

使用 docker tag 命令为镜像打上新标签，去掉域名前缀，使镜像名称更简洁。

查看全部问题→

用户好评

来自真实用户的反馈，见证轩辕镜像的优质服务

oldzhang

运维工程师

Linux服务器

"Docker访问体验非常流畅，大镜像也能快速完成下载。"

alpine/ollama

Minimal CPU-only Ollama Docker Image

12 收藏0 次下载

🚀专业版镜像服务，面向生产环境设计

官方简介版本下载

🚀专业版镜像服务，面向生产环境设计

Minimal CPU-only Ollama Docker Image

ollama latest b99944c07117   3 hours ago    69.3MB

Notes

This image is not based on alpine, but wolfi.dev, I will work it out with alpine later
Got big help from @kth8 ( [***] )

Multi-Arch support

linux/amd64
linux/arm64

Repo:

[***]

Automation build logs:

[***]

Docker iamge tags:

[***]

Why Use This Image?

Lightweight: The official Ollama image is over 4GB in size, which can be overkill for systems that only need CPU-based processing. This image is only 70MB, making it much faster to download and deploy.
CPU-only Support: This image is tailored for systems without GPUs. It ensures you can run Ollama efficiently, even on basic or resource-constrained environments, without needing specialized hardware.
Run Anywhere: Whether you're working on local servers, edge devices, or cloud environments that don’t offer GPU resources, this image allows you to run Ollama anywhere, focusing purely on CPU-based operations.

How to Use

Pull the image

bash
docker pull alpine/ollama

Run the service with API supported

docker rm -f ollama
docker run -d -p ***:*** -v ~/.ollama/root/.ollama --name ollama alpine/ollama

Download the models, for example, llama3.2, only run once. It will save the model locally, you can re-use it later.

docker exec -ti ollama ollama pull llama3.2

If you don't want to download, you can choice to use alpine/llama3.2 image directly. I create this with model "llama3.2" integrated already

docker run -d -p ***:*** --name llama3.2 alpine/llama3.2

Test its API service with curl

$ curl http://localhost:***/api/generate -d '{
  "model": "llama3.2",
  "prompt":"Why is the sky blue?"
}'

{"model":"llama3.2","created_at":"2024-10-16T00:25:58.59931201Z","response":"The","done":false}
{"model":"llama3.2","created_at":"2024-10-16T00:25:58.695826838Z","response":" sky","done":false}
{"model":"llama3.2","created_at":"2024-10-16T00:25:58.780917761Z","response":" appears","done":false}
{"model":"llama3.2","created_at":"2024-10-16T00:25:58.992556209Z","response":" blue","done":false}
{"model":"llama3.2","created_at":"2024-10-16T00:25:59.085970606Z","response":" because","done":false}
{"model":"llama3.2","created_at":"2024-10-16T00:25:59.30869749Z","response":" of","done":false}
...

If you monitor the CPU usage, for example, with htop, you would see the high CPU usage

You can deploy the Ollama web UI to chat with it directly. There are many tools available, but I won't recommend any specific one.

Use case

this image could be deployed to any enviornment, for example, in kubernetes cluster, you can use it to analyze logs, streamlining logs with local LLMs, etc.

查看更多 ollama 相关镜像 →

ollama/ollama

轻松在本地部署和运行大型语言模型的最简单方式。

基于ollama/ollama的镜像，集成fastAPI服务器，允许外部系统释放Ollama占用的GPU内存，提供Ollama服务（11434端口）和fastAPI管理接口（5000端口）。

10K+ pulls

上次更新：7 个月前

轩辕镜像配置手册

探索更多轩辕镜像的使用方法，找到最适合您系统的配置方式

登录仓库拉取

通过 Docker 登录认证访问私有仓库

Linux

在 Linux 系统配置镜像服务

Windows/Mac

在 Docker Desktop 配置镜像

Docker Compose

Docker Compose 项目配置

K8s Containerd

Kubernetes 集群配置 Containerd

K3s

K3s 轻量级 Kubernetes 镜像加速

Dev Containers

VS Code Dev Containers 配置

MacOS OrbStack

MacOS OrbStack 容器配置

宝塔面板

在宝塔面板一键配置镜像

群晖

Synology 群晖 NAS 配置

飞牛

飞牛 fnOS 系统配置镜像

极空间

极空间 NAS 系统配置服务

爱快路由

爱快 iKuai 路由系统配置

绿联

绿联 NAS 系统配置镜像

威联通

QNAP 威联通 NAS 配置

Podman

Podman 容器引擎配置

Singularity/Apptainer

HPC 科学计算容器配置

其他仓库配置

ghcr、Quay、nvcr 等镜像仓库

专属域名拉取

无需登录使用专属域名

需要其他帮助？请查看我们的常见问题Docker 镜像访问常见问题解答或提交工单

镜像拉取常见问题

轩辕镜像免费版与专业版有什么区别？

免费版仅支持 Docker Hub 访问，不承诺可用性和速度；专业版支持更多镜像源，保证可用性和稳定速度，提供优先客服响应。

轩辕镜像支持哪些镜像仓库？

专业版支持 docker.io、gcr.io、ghcr.io、registry.k8s.io、nvcr.io、quay.io、mcr.microsoft.com、docker.elastic.co 等；免费版仅支持 docker.io。

流量耗尽错误提示

当返回 402 Payment Required 错误时，表示流量已耗尽，需要充值流量包以恢复服务。

410 错误问题

通常由 Docker 版本过低导致，需要升级到 20.x 或更高版本以支持 V2 协议。

manifest unknown 错误

先检查 Docker 版本，版本过低则升级；版本正常则验证镜像信息是否正确。

镜像拉取成功后，如何去掉轩辕镜像域名前缀？

使用 docker tag 命令为镜像打上新标签，去掉域名前缀，使镜像名称更简洁。

查看全部问题→

用户好评

来自真实用户的反馈，见证轩辕镜像的优质服务

oldzhang

运维工程师

Linux服务器

"Docker访问体验非常流畅，大镜像也能快速完成下载。"

alpine/ollama Docker 镜像 - 轩辕镜像

Minimal CPU-only Ollama Docker Image

Notes

Multi-Arch support

Repo:

Automation build logs:

Docker iamge tags:

Why Use This Image?

How to Use

Use case

轩辕镜像配置手册

登录仓库拉取

Linux

Windows/Mac

Docker Compose

K8s Containerd

K3s

Dev Containers

MacOS OrbStack

宝塔面板

群晖

飞牛

极空间

爱快路由

绿联

威联通

Podman

Singularity/Apptainer

其他仓库配置

专属域名拉取

镜像拉取常见问题

用户好评

alpine/ollama Docker 镜像 - 轩辕镜像

Minimal CPU-only Ollama Docker Image

Notes

Multi-Arch support

Repo:

Automation build logs:

Docker iamge tags:

Why Use This Image?

How to Use

Use case

轩辕镜像配置手册

登录仓库拉取

Linux

Windows/Mac

Docker Compose

K8s Containerd

K3s

Dev Containers

MacOS OrbStack

宝塔面板

群晖

飞牛

极空间

爱快路由

绿联

威联通

Podman

Singularity/Apptainer

其他仓库配置

专属域名拉取

镜像拉取常见问题

用户好评