Tutorials

Getting Started with LM Studio for Local LLMs

Download, install, and configure LM Studio to run local LLMs with a graphical interface, OpenAI-compatible API, and model library.

Robson PereiraMay 31, 20268 min read

LM Studio graphical interface running a local language model.

Getting Started with LM Studio for Local LLMs

LM Studio is one of the most accessible ways to run local language models on a personal machine. It bundles model discovery, download, inference, and an OpenAI-compatible API into a single desktop application, making it an attractive option for people who want local AI without the command-line overhead.

What you need before installing

LM Studio runs on macOS, Windows, and Linux. It works best on systems with a dedicated GPU and at least 16 GB of system memory, but smaller quantised models can run on machines with less hardware. Download the installer from the official website and run through the standard setup steps.

For a broader view of hardware requirements, read Best Hardware for Self-Hosted AI.

Browse and download models

The built-in model browser connects to Hugging Face and lets you search, filter, and download models directly. Start with a popular quantised model in the 7B to 13B parameter range — these offer a good balance of quality and performance on consumer hardware.

Understand GGUF format

LM Studio uses GGUF-format models, which are pre-quantised for efficient CPU and GPU inference. The format keeps model files smaller and loading times faster compared to raw model weights.

Run your first inference session

Once the model is downloaded, select it from the model list, adjust the context length if your GPU memory allows, and start chatting. The interface shows token generation speed, memory usage, and model metadata so you can track performance.

If you prefer a web-based interface, compare the experience in Open WebUI vs AnythingLLM.

Enable the local API server

LM Studio includes a local HTTP server that exposes an OpenAI-compatible endpoint. This is useful for connecting LM Studio to other tools, automation workflows, and chat interfaces. Enable it from the server tab, note the port, and point your tools to `http://localhost:1234/v1`.

For workflow automation ideas, see Build Your Own AI Assistant with n8n.

Switch models without restarting

One of LM Studio's strengths is the ability to load and unload models on the fly. You can switch between a fast coding model and a larger reasoning model within the same session, keeping the API server running throughout.

Conclusion

LM Studio lowers the barrier for local LLM experimentation. Its graphical interface, built-in model browser, and OpenAI-compatible API make it a strong choice for beginners and intermediate users who prefer a desktop experience over terminal-based workflows.

FAQ

Is LM Studio free?

Yes. The application is free to download and use with any compatible model from Hugging Face.

Can LM Studio use multiple GPUs?

It supports offloading layers across GPU and CPU, but full multi-GPU support depends on the backend and model format.

Does LM Studio support vision models?

Vision-language models are supported through the same interface when the model is compatible with the underlying inference engine.

Getting Started with LM Studio for Local LLMs

Getting Started with LM Studio for Local LLMs

What you need before installing

Browse and download models

Understand GGUF format

Run your first inference session

Enable the local API server

Switch models without restarting

Conclusion

FAQ

Is LM Studio free?

Can LM Studio use multiple GPUs?

Does LM Studio support vision models?

Related articles

How to Add Local Documents to Open WebUI with RAG and Ollama

How to Deploy Open WebUI and Ollama on a Private LAN with Docker Compose

How to Build a Self-Hosted AI Workstation with Docker and Multiple Model Runners