🧠 Using Microsoft AI Toolkit with FastFlowLM in VS Code
This guide explains how to run FastFlowLM locally on Windows and connect it to Microsoft AI Toolkit in Visual Studio Code.
✅ 1. Install Visual Studio Code (Windows)
- Go to: https://code.visualstudio.com
- Download the User Installer for Windows
- Run the installer:
- ✅ Check “Add to PATH”
- ✅ (Optional) Create a desktop icon
- ✅ Accept the license agreement
- Complete the installation
📦 2. Install AI Toolkit Extension
- Launch VS Code
- Open the Extensions panel:
Ctrl + Shift + X
- Search:
AI Toolkit for Visual Studio Code
- Click Install
You’ll now see the AI Toolkit icon on the sidebar.
🧠 3. Install FastFlowLM & Pull a Model
Download & install FastFlowLM: (../../install.md)
-
Open PowerShell
-
Pull a base model:
flm pull llama3.2:1b
- Confirm it’s installed:
flm list
You should see models like llama3.2:1b
listed.
🧩 4. Add FastFlowLM Model in AI Toolkit via Custom Endpoint
- In VS Code, open the AI Toolkit panel
- Navigate to Models → Catalog
- Click ➕ Add Your Own Model
- In the top bar, select Add Custom Model
- Enter OpenAI compatible chat completion endpoint URI:
http://localhost:11434/v1/chat/completions
Click Enter
- Enter the exact model name as in the API:
llama3.2:1b
Click Enter
- Enter display model name:
flm-llama3.2:1b
Click Enter
- Enter API key:
dummy
Click Enter
- You will now see the model under My Models
📡 5. Activate Server Mode with a Model
Open powershell, enter
flm serve llama3.2:1b
💬 6. Use the Model in Playground
- Switch to the Playground tab in AI Toolkit
- Choose your custom FastFlowLM model from the dropdown
- Type a prompt, e.g.:
What are the benefits of local inference?
- Click Send
- The model response will stream from your local FastFlowLM instance
🗑️ 7. Remove a Model from “My Models” in AI Toolkit
To remove a previously added model from the My Models section in the AI Toolkit:
- Navigate to AI Toolkit → My Models
- Expand the Custom section
- Right-click the model you wish to remove (e.g.,
llama3.2:1b
) - Select Delete
🧹 This action removes the model’s reference from the AI Toolkit interface but does not delete the model files from local disk.