🧩 Model Card: Qwen3-0.6B

▶️ Run with FastFlowLM in PowerShell:

flm run qwen3:0.6b

📝 Note:

  • CLI: Type /think to toggle on/off interactively.
  • Server Mode: Set the "think" flag in the request payload.

🧩 Model Card: Qwen3-1.7B

▶️ Run with FastFlowLM in PowerShell:

flm run qwen3:0.6b

📝 Note:

  • CLI: Type /think to toggle on/off interactively.
  • Server Mode: Set the "think" flag in the request payload.

🧩 Model Card: Qwen3-4B

▶️ Run with FastFlowLM in PowerShell:

flm run qwen3:4b

📝 Note:

  • CLI: Type /think to toggle on/off interactively.
  • Server Mode: Set the "think" flag in the request payload.

🧩 Model Card: Qwen3-8B

▶️ Run with FastFlowLM in PowerShell:

flm run qwen3:8b

📝 Note:

  • CLI: Type /think to toggle on/off interactively.
  • Server Mode: Set the "think" flag in the request payload.

🧩 Model Card: Qwen3-4B-Thinking-2507

▶️ Run with FastFlowLM in PowerShell:

flm run qwen3-tk:4b

🧩 Model Card: Qwen3-4B-Instruct-2507

▶️ Run with FastFlowLM in PowerShell:

flm run qwen3-it:4b

🧩 Model Card: Qwen3-VL-4B-Instruct

▶️ Run with FastFlowLM in PowerShell:

flm run qwen3vl-it:4b

📝 Note

  • Image understanding adapts to image size. Image TTFT can range from under 1 second to ~200 seconds depending on resolution. Use lower-resolution images (720p or below) unless high resolution is required (e.g. OCR on small text).
  • Video understanding is not supported yet.