Skip to content
#

int4

Here are 12 public repositories matching this topic...

Research and training stack for AVA β€” a tool-using, memory-aware virtual assistant targeting 4 GB VRAM. Spans custom transformers, verifier-RL, external memory, multi-domain benchmarks, and Gemma 4 inference optimization.

  • Updated May 8, 2026
  • Python

PyTorch implementation of TRIAD-PTQ (Trace-Router-Interaction-Aware Decomposition) β€” weight-only INT3/INT4 PTQ for compact LLMs and edge CNNs/ViTs, with real benchmarks on SmolLM/TinyLlama/MobileNetV2/EfficientNet-B0/MobileViT-S.

  • Updated May 5, 2026
  • Python

Improve this page

Add a description, image, and links to the int4 topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the int4 topic, visit your repo's landing page and select "manage topics."

Learn more