LLM_API_Demonstrations

This repository contains demonstration functions for carrying various tasks with LLMs using the API, against different backends (Ollama, with either native or OpenAI APIs, and Hugging Face)

demo_ollama_backend.py

To use, first create a conda environment

conda create --name ollama

and install the ollama API

pip install ollama

and, optionally, the OpenAI API (f you want to use that component of the demo)

pip install openai

demo_huggingface_local_model.py

This demo loads a model from the local ../models/ folder (Qwen3.5-0.8B by default) and runs inference locally via the Hugging Face transformers library. The conversation pattern is identical to demo_ollama_backend.py; only the inference call changes.

Create a fresh conda environment in this folder:

conda create --prefix ../envs/hf-local python=3.11
conda activate ../envs/hf-local

and install the dependencies:

pip install torch torchvision --index-url https://download.pytorch.org/whl/cu124
pip install transformers accelerate pillow safetensors

(the explicit --index-url pins torch to a CUDA 12.4 build, which is what works on our hardware; on a different machine you may want a different CUDA build or the CPU-only one).

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
README.md		README.md
demo_huggingface_local_model.py		demo_huggingface_local_model.py
demo_ollama_backend.py		demo_ollama_backend.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLM_API_Demonstrations

demo_ollama_backend.py

demo_huggingface_local_model.py

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

LLM_API_Demonstrations

demo_ollama_backend.py

demo_huggingface_local_model.py

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages