Tag: Python

Guide: Run Gemma 4 NVFP4 From ComfyUI

Published by Zanno on 19 April 2026

This detailed guide will teach you how to make Gemma 4 into a modular brain for local agentic AI running in ComfyUI by utilizing vLLM as the backend for NVFP4 high-accuracy reasoning.

ComfyUI Installation Tutorial

ComfyUI Installation Tutorial

Published by Zanno on 14 November 2025

Follow our complete step-by-step guide to perform a clean portable installation, downgrade PyTorch, and fix the common xformers dependency error.

Ultimate Guide: Converting Models to Safetensors

Ultimate Guide: Converting Models to Safetensors

Published by Zanno on 4 July 2025

I created a single script that can handle virtually any PyTorch-based model you throw at it and convert to a safetensors file.

Flash Attention for Windows: Get Warp Speed!

Flash Attention for Windows: Get Warp Speed!

Published by Zanno on 30 May 2025

Today, I’m sharing a guide on how to compile and install Flash Attention v2.7.4.post1 on your Windows machine.