tt - Tuned Tensor CLI

tt is the command-line tool for Tuned Tensor. Use it to define behaviour specs, validate them, launch fine-tuning runs, manage datasets, and download or serve trained models.

The main CLI documentation lives at tunedtensor.com/docs/cli. This README is a short install and development reference.

Install

npm install -g @tuned-tensor/cli
tt --version

Run from source:

git clone https://github.com/tunedtensor/tuned-tensor-cli.git
cd tuned-tensor-cli
npm install
npm run build
npm link

Quick Start

tt auth login
tt init --name "Customer Support Bot" --model Qwen/Qwen3.5-2B

# Edit tunedtensor.json, then:
tt eval
tt push
tt runs estimate <spec-id>
tt runs start <spec-id>
tt runs watch <run-id>

To continue training from a completed fine-tuned model artifact:

tt runs start <spec-id> --parent-model <model-id>

Useful discovery commands:

tt specs list
tt datasets list
tt runs list
tt models list
tt models base
tt balance

Label real, unlabeled data with a teacher model (JSONL with {"input": ...} rows, or CSV with --input-column; up to 50,000 rows / 50 MB). Labeling runs as a managed cloud workflow — upload and disconnect; the teacher drafts outputs under your spec's system prompt and you review before anything trains:

tt label upload tickets.csv --spec <spec-id> --input-column body --watch
tt label watch <job-id>                     # re-attach to progress any time
tt label rows <job-id> --status labeled     # review the teacher's drafts
tt label accept <job-id> --all              # or accept/reject/edit by row
tt label promote <job-id> --name tickets-v1 # becomes a validated dataset
tt runs start <spec-id> --dataset <dataset-id>

Export a model to GGUF and package it for Ollama (so it's pluggable like any other local model, e.g. in OpenClaw via Ollama's native /api/chat):

# Convert + quantize to GGUF, write a Modelfile, and run `ollama create`
tt models export <model-id> --format gguf --quant q4_k_m --ollama

# Inspect the planned llama.cpp / ollama commands without running them
tt models export <model-id> --quant q8_0 --ollama --print-command

This wraps llama.cpp's convert_hf_to_gguf.py + llama-quantize and Ollama's ollama create. Point tt at your llama.cpp checkout with --llama-cpp <dir> (or --convert-script / --quantize-bin); with --ollama the behaviour spec's system prompt is embedded as the Modelfile SYSTEM block.

For the full command reference, including dataset-backed runs, long-example policies, eval token budgets, preflight run estimates, continued fine-tuning, evaluation caps, local model serving, configuration, and billing, see the CLI docs.

Development

npm install
npm run build
npm run dev
npm run typecheck
npm test

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 80 Commits
.github/workflows		.github/workflows
src		src
.gitignore		.gitignore
AGENTS.md		AGENTS.md
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
README.md		README.md
package-lock.json		package-lock.json
package.json		package.json
tsconfig.json		tsconfig.json
tsup.config.ts		tsup.config.ts
vitest.config.ts		vitest.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

tt - Tuned Tensor CLI

Install

Quick Start

Development

License

About

Uh oh!

Releases 24

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

tt - Tuned Tensor CLI

Install

Quick Start

Development

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 24

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages