> For the complete documentation index, see [llms.txt](https://docs.mozilla.ai/llms.txt). Markdown versions of documentation pages are available by appending `.md` to page URLs; this page is available as [Markdown](https://docs.mozilla.ai/encoderfile/getting-started.md). # Getting Started This quick-start guide will help you build and run your first encoderfile in under 10 minutes. ## Prerequisites ### encoderfile CLI Tool You need the `encoderfile` CLI tool installed: * **Pre-built binary** (recommended) - Fastest setup for Linux/macOS users ```bash curl -fsSL https://raw.githubusercontent.com/mozilla-ai/encoderfile/main/install.sh | sh ``` * **Build from source** - Required for Windows, or for latest development features * See [our guide on building encoderfile CLI from source](/encoderfile/reference/building.md) * **Docker** - Best for CI/CD or isolated builds without installing dependencies * Check out our guide on [Building Encoderfiles with Docker](/encoderfile/building-encoderfiles/docker.md) ### Python with Optimum For exporting models to ONNX: > Requires Python 3.13+ ```bash pip install optimum[onnxruntime] onnxruntime ``` There are some resources that you can check about the ONNX runtime, what HF models it supports, and how to export a model in HF to this format: * * * ## Your First Encoderfile Let's build a sentiment analysis model as an example. ### Step 1: Export Model to ONNX Export a HuggingFace model to ONNX format: ```bash optimum-cli export onnx \ --model distilbert-base-uncased-finetuned-sst-2-english \ --task text-classification \ ./sentiment-model ``` This creates a directory with the required files: ``` sentiment-model/ ├── config.json ├── model.onnx # ONNX weights ├── tokenizer.json # Tokenizer └── ... (other files) ``` **Available task types:** * `feature-extraction` - For embedding models * `text-classification` - For sequence classification * `token-classification` - For NER/token tagging ### Step 2: Create Configuration File Create `sentiment-config.yml`: ```yaml encoderfile: name: sentiment-analyzer version: "1.0.0" path: ./sentiment-model model_type: sequence_classification output_path: ./build/sentiment-analyzer.encoderfile ``` **Key fields:** * `name` - Model identifier (used in API responses) * `path` - Path to the model directory with ONNX weights * `model_type` - `embedding`, `sequence_classification`, or `token_classification` * `output_path` - Where to output the binary (optional, defaults to `./.encoderfile`) ### Step 3: Build the Binary Build your encoderfile: ```bash encoderfile build -f sentiment-config.yml ``` > **Note:** If you built the CLI from source, use: `./target/release/encoderfile build -f sentiment-config.yml` The binary will be created at `./build/sentiment-analyzer.encoderfile`. ### Step 4: Run the Server Start your encoderfile server: ```bash chmod +x ./build/sentiment-analyzer.encoderfile ./build/sentiment-analyzer.encoderfile serve ``` You should see: ``` Starting HTTP server on 0.0.0.0:8080 Starting gRPC server on [::]:50051 ``` ### Step 5: Make Predictions Test with curl: ```bash curl -X POST http://localhost:8080/predict \ -H "Content-Type: application/json" \ -d '{ "inputs": [ "This product is amazing!", "Terrible experience, very disappointed" ] }' ``` Expected response: ```json { "results": [ { "logits": [-4.123, 4.567], "scores": [0.0001, 0.9999], "predicted_index": 1, "predicted_label": "POSITIVE" }, { "logits": [4.234, -3.987], "scores": [0.9998, 0.0002], "predicted_index": 0, "predicted_label": "NEGATIVE" } ], "model_id": "sentiment-analyzer" } ``` ## Quick Examples ### Embedding Model ```bash # Export optimum-cli export onnx \ --model sentence-transformers/all-MiniLM-L6-v2 \ --task feature-extraction \ ./embedding-model # Config cat > embedding-config.yml < ner-config.yml < config.yml <&goal= ``` `ask` is the immediate question: it should be specific, self-contained, and written in natural language. `goal` is optional and describes the broader end goal you are ultimately trying to accomplish on behalf of the user. GitBook uses it to tailor the answer towards what is most useful for that goal. The response will contain a direct answer to the question and relevant excerpts and sources from the documentation. Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.