Local LLM Inventory
Last verified: 2026-06-28
Host: sol.system42.one local archive
This page records the local language-model assets currently present on disk under /home/david/.cache/models, the local runtimes available on this machine, and whether each model family should be considered open-weight.
Local runtimes
ollamaat~/.local/bin/ollamallama-serverat~/.local/bin/llama-serverllama-cliat~/.local/bin/llama-cli
Ollama is installed, but ollama list was empty at the time of verification. The local model inventory below is therefore based on GGUF files present on disk rather than Ollama-managed registrations.
Installed model files
1. DeepSeek-R1-Distill-Qwen-7B
- File:
/home/david/.cache/models/DeepSeek-R1-Distill-Qwen-7B-Q4_K_M.gguf - Size:
4.4G - Format: GGUF
- Likely role: local reasoning / general chat
2. Llama-3.2-1B-Instruct
- File:
/home/david/.cache/models/Llama-3.2-1B-Instruct-Q4_K_M.gguf - Size:
771M - Format: GGUF
- Likely role: lightweight local chat, lower memory footprint
3. Qwen2.5-7B-Instruct
- File:
/home/david/.cache/models/Qwen2.5-7B-Instruct-Q4_K_M.gguf - Size:
163M - Format: GGUF
- Status note: the file has a valid
GGUFheader, but the size is much smaller than expected for a 7B Q4 model and should be treated as suspicious or incomplete until load-tested
4. Qwen3VL-8B-Thinking
- File:
/home/david/.cache/models/Qwen3VL-8B-Thinking-Q4_K_M.gguf - Size:
4.7G - Format: GGUF
- Likely role: multimodal reasoning when paired with the projection weights listed below
5. gemma-3-4b-it
- File:
/home/david/.cache/models/gemma-3-4b-it-Q4_K_M.gguf - Size:
2.4G - Format: GGUF
- Likely role: compact local instruction model
Multimodal projection files
These are not separate LLMs. They are auxiliary weights used for multimodal vision-language pipelines.
/home/david/.cache/models/mmproj-Qwen3VL-8B-Thinking-F16.gguf(1.1G)/home/david/.cache/models/mmproj-model-f16.gguf(812M)
Open-weight status
Open-weight with standard permissive licensing
# DeepSeek-R1-Distill-Qwen-7B
- Classification: open-weight
- License position: published with available weights; model card indicates MIT for the DeepSeek release
- Source: <https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-7B>
# Qwen2.5-7B-Instruct
- Classification: open-weight
- License position: Apache 2.0
- Source: <https://huggingface.co/Qwen/Qwen2.5-7B-Instruct>
# Qwen3-VL-8B-Thinking
- Classification: open-weight
- License position: Apache 2.0 on the official GGUF release
- Source: <https://huggingface.co/Qwen/Qwen3-VL-8B-Thinking-GGUF>
Open-weight, but under custom or non-OSI terms
# gemma-3-4b-it
- Classification: open-weight
- License position: Google publishes Gemma as open weights, but under Gemma-specific terms rather than a standard OSI open-source license
- Sources:
- <https://ai.google.dev/gemma/docs/core/model_card_3>
- <https://ai.google.dev/gemma/terms>
# Llama-3.2-1B-Instruct
- Classification: publicly available weights
- License position: Meta
Llama 3.2 Community License, not a standard open-source license - Source: <https://huggingface.co/meta-llama/Llama-3.2-1B-Instruct>
Summary
Models on this machine that are clearly open-weight and permissively licensed
DeepSeek-R1-Distill-Qwen-7BQwen2.5-7B-InstructQwen3VL-8B-Thinking
Models on this machine with public/open weights but custom licensing
gemma-3-4b-itLlama-3.2-1B-Instruct
Notes
- The inventory reflects files physically present on disk on 2026-06-28.
- Runtime availability does not guarantee that every file is intact or currently tuned for usable inference.
- The
Qwen2.5-7B-InstructGGUF should be considered the main candidate for re-download or integrity verification because its size is unexpectedly small.