Skip to content

Configuration Reference

Strata is configured via environment variables (prefixed with STRATA_) or a [tool.strata] section in pyproject.toml.

Precedence: defaults < pyproject.toml < environment variables < programmatic overrides

Server

Variable Default Description
STRATA_HOST 127.0.0.1 Server bind address
STRATA_PORT 8765 Server port
STRATA_DEPLOYMENT_MODE service personal or service
STRATA_ALLOW_REMOTE_CLIENTS_IN_PERSONAL false Allow non-localhost clients in personal mode

Cache

Variable Default Description
STRATA_CACHE_DIR ~/.strata/cache Disk cache location
STRATA_MAX_CACHE_SIZE_BYTES 10737418240 (10 GB) Max cache size
STRATA_CACHE_GRANULARITY row_group_projection row_group_projection or row_group

Fetcher

Variable Default Description
STRATA_BATCH_SIZE 65536 Rows per batch
STRATA_FETCH_PARALLELISM 4 Max concurrent fetches per scan
STRATA_MAX_FETCH_WORKERS 32 Max threads in fetch pool
STRATA_FETCH_TIMEOUT_SECONDS 60.0 Per-fetch timeout

Resource Limits

Variable Default Description
STRATA_MAX_CONCURRENT_SCANS 100 Max concurrent scans
STRATA_MAX_TASKS_PER_SCAN 1000 Max row groups per scan
STRATA_PLAN_TIMEOUT_SECONDS 30.0 Planning timeout
STRATA_SCAN_TIMEOUT_SECONDS 300.0 Scan streaming timeout
STRATA_MAX_RESPONSE_BYTES 536870912 (512 MB) Max response size (413 if exceeded)

QoS (Two-Tier Admission)

Variable Default Description
STRATA_INTERACTIVE_SLOTS 32 Interactive tier concurrency
STRATA_BULK_SLOTS 8 Bulk tier concurrency
STRATA_INTERACTIVE_MAX_BYTES 10485760 (10 MB) Max bytes for interactive classification
STRATA_INTERACTIVE_MAX_COLUMNS 10 Max columns for interactive classification

Metadata

Variable Default Description
STRATA_METADATA_DB None SQLite path for metadata persistence

S3 Storage

Variable Default Description
STRATA_S3_REGION None AWS region
STRATA_S3_ENDPOINT_URL None Custom endpoint (MinIO, LocalStack)
STRATA_S3_ACCESS_KEY None Access key (falls back to AWS_ACCESS_KEY_ID)
STRATA_S3_SECRET_KEY None Secret key (falls back to AWS_SECRET_ACCESS_KEY)
STRATA_S3_ANONYMOUS false Use anonymous access

Artifact Storage

Variable Default Description
STRATA_ARTIFACT_DIR None Artifact store directory
STRATA_ARTIFACT_BLOB_BACKEND local local, s3, gcs, or azure
STRATA_ARTIFACT_S3_BUCKET None S3 bucket for artifacts
STRATA_ARTIFACT_S3_PREFIX artifacts S3 key prefix
STRATA_ARTIFACT_GCS_BUCKET None GCS bucket for artifacts
STRATA_ARTIFACT_GCS_PREFIX artifacts GCS prefix
STRATA_ARTIFACT_AZURE_CONTAINER None Azure container
STRATA_ARTIFACT_AZURE_PREFIX artifacts Azure prefix

Authentication

Variable Default Description
STRATA_AUTH_MODE none none or trusted_proxy
STRATA_PROXY_TOKEN None Shared secret for proxy verification
STRATA_PRINCIPAL_HEADER X-Strata-Principal Header for user identity
STRATA_SCOPES_HEADER X-Strata-Scopes Header for permission scopes
STRATA_HIDE_FORBIDDEN_AS_NOT_FOUND true Return 404 instead of 403

Multi-Tenancy

Variable Default Description
STRATA_MULTI_TENANT_ENABLED false Enable multi-tenant mode
STRATA_TENANT_HEADER X-Tenant-ID Header for tenant identification
STRATA_REQUIRE_TENANT_HEADER false Require tenant header on all requests

Notebook

Variable Default Description
STRATA_NOTEBOOK_STORAGE_DIR /tmp/strata-notebooks Default notebook storage directory
STRATA_NOTEBOOK_PYTHON_VERSIONS current server Python minor Available Python versions (JSON array or comma-separated list)

Rate Limiting

Variable Default Description
STRATA_RATE_LIMIT_ENABLED true Enable rate limiting
STRATA_RATE_LIMIT_GLOBAL_RPS 1000.0 Global requests per second
STRATA_RATE_LIMIT_CLIENT_RPS 100.0 Per-client requests per second
STRATA_RATE_LIMIT_SCAN_RPS 50.0 Scan endpoint rate limit

Observability

Variable Default Description
STRATA_LOG_LEVEL INFO Log level
STRATA_LOG_FORMAT json json or text
STRATA_TRACING_ENABLED false Enable OpenTelemetry
OTEL_EXPORTER_OTLP_ENDPOINT None OTLP collector endpoint
OTEL_SERVICE_NAME strata Service name for traces

AI / LLM Assistant

Variable Default Description
STRATA_AI_BASE_URL None OpenAI-compatible API base URL
STRATA_AI_MODEL None Model identifier (e.g. claude-sonnet-4-20250514, gpt-4o)
STRATA_AI_API_KEY None API key (generic, works with any provider)
STRATA_AI_MAX_CONTEXT_TOKENS 100000 Max context tokens sent to the model
STRATA_AI_MAX_OUTPUT_TOKENS 4096 Max output tokens requested
STRATA_AI_TIMEOUT_SECONDS 60.0 LLM request timeout
ANTHROPIC_API_KEY None Anthropic API key (auto-sets base URL + model)
OPENAI_API_KEY None OpenAI API key (auto-sets base URL + model)
GEMINI_API_KEY None Google Gemini API key (auto-sets base URL + model)
MISTRAL_API_KEY None Mistral API key (auto-sets base URL + model)

Provider-specific keys auto-configure base_url and model defaults. STRATA_AI_* variables override provider defaults. Notebook-level [ai] config in notebook.toml can override api_key, base_url, model, max_context_tokens, max_output_tokens, and timeout_seconds.

Timeouts

Variable Default Description
STRATA_S3_CONNECT_TIMEOUT_SECONDS 10.0 S3 connection timeout
STRATA_S3_REQUEST_TIMEOUT_SECONDS 30.0 S3 request timeout
STRATA_PLAN_TIMEOUT_SECONDS 30.0 Planning phase timeout
STRATA_SCAN_TIMEOUT_SECONDS 300.0 Scan streaming timeout
STRATA_FETCH_TIMEOUT_SECONDS 60.0 Per-fetch timeout