🗄️ Database Systems 🔬 Zenodo ✍️ Medium Article 🦀 Rust 🤖 LLM-Native ⚡ Information-Theoretic

SEMANTIX

Learned Semantic Cost Models for LLM-Native Relational Engines

Treating AI/LLM inference as a core database primitive

A paradigm shift in relational query optimization. SEMANTIX couples LLM inference with learned semantic cost estimation, eliminating token waste and semantic misalignment. 3.2× token reduction, 1.8× speedup, 97.1% accuracy.

Learn More Install →

⚡ Rust implementation • PostgreSQL 14+ • Production-ready v0.1.0

"Current systems decouple LLM retrieval from cost-aware query planning."

This architectural choice results in token waste, semantic misalignment, and unbounded latency. The database doesn't know how to optimize for AI. The AI doesn't know its cost. They operate in isolation. SEMANTIX unifies them.

The Solution: Semantic Cost Modeling

SEMANTIX treats LLM inference as a first-class database primitive, coupled with learned semantic cost estimation through information-theoretic foundations.

Four-Phase Architecture

┌─────────────────────────────────────────────────────────────┐
│                  SEMANTIX Query Optimizer                   │
├─────────────────────────────────────────────────────────────┤
│                                                             │
│  Phase 1: Semantic Parsing                                  │
│  ├─ NL Query → Bidirectional Semantic Anchor                │
│  └─ Output: LogicalPlan + Initial Cost Estimates            │
│                                                             │
│  Phase 2: Cost Refinement                                   │
│  ├─ Learned Cost Model (GBDT)                               │
│  └─ Output: Refined token cost estimates                    │
│                                                             │
│  Phase 3: Adaptive Token Scheduling                         │
│  ├─ Constrained Optimization (Lagrangian)                   │
│  └─ Output: Token allocation schedule                       │
│                                                             │
│  Phase 4: Execution + Feedback Loop                         │
│  ├─ Execute with schedule                                   │
│  └─ Update cost model with actual execution data            │
│                                                             │
└─────────────────────────────────────────────────────────────┘

Key Innovations

📊 Formal Cost Architecture

Unified cost model embedding semantic entropy, relational context preservation, and execution schedule conditioning.

🔗 Bidirectional Semantic Anchors

Learned projections mapping NL intent to cost-parametric logical plans with full provenance.

⏱️ Adaptive Token Scheduling

Dynamic token allocation under latency constraints using Lagrangian relaxation and iterative refinement.

🔄 Continuous Learning

Feedback loop integrating actual execution metrics to refine cost models in real-time.

Performance Results

Evaluated on extended TPC-H with semantic annotations. SEMANTIX demonstrates:

Inference Token Cost

3.2×

Reduction vs PostgreSQL

End-to-End Latency

1.8×

Speedup (25.3ms vs 45.3ms)

Semantic Accuracy

97.1%

Maintained

Energy Reduction

65.6%

vs Classical Systems

Comparison Against Baselines

System	Tokens (K)	Latency (ms)	Accuracy (%)	Energy (Wh)
SEMANTIX	3.1	25.3	97.1	1.24
Classical PostgreSQL	9.9	45.3	89.4	3.61
RAG-Optimized	8.2	42.1	91.3	3.04
Semantic Entropy	5.4	33.7	94.8	1.89

Mathematical Foundations

Equation 1: Semantic Token Cost

The core cost model combines information-theoretic entropy with execution schedule conditioning:

C_{\text{sem}}(\pi, \sigma) = \sum_{i=1}^{n} \left[ H(i \mid \Sigma^{\text{ctx}}(i)) + \gamma \cdot \text{delay}(o_j, \sigma) + \beta \cdot \text{staleness}(o_j, \sigma) \right]

where:

H(i | Σ^ctx(i)) = Conditional semantic entropy of token i given context
γ = Delay weight parameter (controls latency penalty)
β = Staleness weight parameter (controls stale context penalty)
σ = Execution schedule (allocation of computational resources)

Equation 4: Bidirectional Semantic Anchor

Maps natural language queries to cost-parametric logical plans:

\varphi(\text{NL query}) = (\text{LogicalPlan}, \{c_1, \ldots, c_k\})

Algorithm 1: Adaptive Token Scheduling

Solves the constrained optimization problem using Lagrangian relaxation:

\begin{aligned} \text{minimize} & \quad \sum_{j=1}^{m} c_j^{\text{allocated}} \\ \text{subject to} & \quad \sum_{j=1}^{m} \text{latency}(o_j, c_j^{\text{allocated}}) \leq L_{\max} \\ & \quad c_j^{\min} \leq c_j^{\text{allocated}} \leq c_j^{\max} \end{aligned}

Convergence: The iterative algorithm converges to an ε-optimal solution in O(log(1/ε)) iterations, bounded by a configurable threshold (default 0.001).

Installation & Setup

Prerequisites

Rust 1.70+ — Install
PostgreSQL 14+ — Install
Python 3.10+ — for data generation scripts
8GB RAM, 4-core CPU minimum — NVIDIA GPU optional

Quick Start

# Clone repository
git clone https://github.com/novas-workshop-2026/learned-semantic-costs.git
cd semantix

# Build project (release optimized)
cargo build --release

# Create PostgreSQL database
createdb semantix

# Initialize schema
psql -d semantix -f schema/tpch_schema.sql

# Generate TPC-H with semantic annotations
cargo run --release --bin data-generator

# Load data
psql -d semantix -c "COPY orders FROM 'tpch_orders_semantic.csv' CSV HEADER;"

# Profile operator latencies
cargo run --release --bin cost-profiler

# Run benchmark
cargo run --release --bin benchmark

Start the SEMANTIX Daemon

cargo run --release --bin semantix-daemon

Usage

Programmatic API (Rust)

use semantix::SemanticQueryOptimizer;

#[tokio::main]
async fn main() -> anyhow::Result<()> {
    // Initialize optimizer
    let mut optimizer = SemanticQueryOptimizer::new(
        "postgresql://localhost/semantix"
    ).await?;

    // Execute query with full semantic optimization
    let result = optimizer.optimize_and_execute(
        "SELECT * FROM orders WHERE custkey = 1"
    ).await?;

    // Check metrics
    let metrics = optimizer.get_metrics();
    println!("Tokens: {}, Latency: {}ms, Accuracy: {:.2}%",
        metrics.avg_token_cost,
        metrics.avg_latency_ms,
        metrics.avg_semantic_accuracy * 100.0
    );

    // Provide feedback for continuous learning
    optimizer.feedback(&result.context);

    Ok(())
}

Command-Line Interface

# Profile specific query
cargo run --release --bin benchmark -- --query "SELECT * FROM orders LIMIT 100"

# Generate data with custom scale
cargo run --release --bin data-generator -- --scale-factor 10

# Profile specific operators
cargo run --release --bin cost-profiler -- --operators "Scan,Filter,Join"

Configuration

Configuration File (semantix.toml)

[anchor_config]
encoder_model_path = "models/bert-encoder-semantic.bin"
decoder_model_path = "models/bert-decoder-semantic.bin"
max_sequence_length = 512
embedding_dim = 768
semantic_drift_threshold = 0.15

[cost_model_config]
model_type = "gbdt"
model_path = "models/cost_model.xgb"
entropy_weight = 1.0
delay_weight = 0.3
staleness_weight = 0.5
min_token_budget = 100
max_token_budget = 10000

[scheduler_config]
max_latency_ms = 50
latency_sigma = 0.1
alpha = 0.01
convergence_threshold = 0.001
max_iterations = 1000

[database]
url = "postgresql://localhost/semantix"
log_level = "info"

Environment Variables

export DATABASE_URL="postgresql://user:password@localhost/semantix"
export LOG_LEVEL="debug"
export SEMANTIX_CONFIG="path/to/semantix.toml"

Testing & Profiling

Test Suite

# Run all tests
cargo test
cargo test --doc
cargo test --all-features

# Integration tests (requires PostgreSQL)
cargo test --test integration_tests -- --test-threads=1

# Benchmark tests
cargo bench

Performance Profiling

# CPU profiling with flamegraph
cargo install flamegraph
cargo flamegraph --bin benchmark
# Open flamegraph.svg in browser

# Memory profiling
valgrind --tool=massif ./target/release/benchmark
ms_print massif.out.

# Latency profiling
cargo run --release --bin cost-profiler -- --detailed-report

Project Structure

semantix/
├── Cargo.toml                 # Rust dependencies
├── src/
│   ├── lib.rs               # Main library exports
│   ├── semantic_anchors.rs  # NL → LogicalPlan translation
│   ├── cost_model.rs        # Learned cost estimation
│   ├── scheduler.rs         # Adaptive token scheduling (Alg 1)
│   ├── database.rs          # PostgreSQL integration
│   ├── executor.rs          # Query execution engine
│   ├── metrics.rs           # Performance tracking
│   ├── config.rs            # Configuration management
│   ├── errors.rs            # Error types
│   └── bin/
│       ├── daemon.rs        # Main optimizer service
│       ├── profiler.rs      # Latency profiler
│       ├── data_gen.rs      # TPC-H data generation
│       └── benchmark.rs     # Performance evaluation
├── schema/
│   └── tpch_schema.sql      # PostgreSQL schema
├── tests/
│   ├── integration_tests.rs # End-to-end tests
│   └── unit_tests.rs        # Component tests
├── docker/
│   ├── Dockerfile           # Container image
│   └── docker-compose.yml   # Multi-container setup
└── README.md

Contributing

We welcome contributions! The project uses:

🦀 Rust 1.70+

Full type safety and memory safety guarantees.

✓ Tests

Unit and integration tests for all components.

📝 Documentation

Inline docs, rustdoc, and comprehensive guides.

# Fork, create branch, and submit PR
git checkout -b feature/amazing-feature
git commit -m 'Add amazing feature'
git push origin feature/amazing-feature

# Ensure code quality
cargo test
cargo fmt
cargo clippy -- -D warnings

Citation

@inproceedings{semantix2026,
  title={Learned Semantic Cost Models for Adaptive Token-Efficient 
         Query Optimization in LLM-Native Relational Engines},
  author={Prakul Sunil Hiremath},
  year={2026}
}

Resources

Develop

Installation Guide
Architecture
Report Issues

Connect

Email
GitHub
Status: v0.1.0

SEMANTIX — LLM-Native Relational Engines with Learned Semantic Costs

Apache License 2.0 · Rust 1.70+ · PostgreSQL 14+ · Open Source

"The database finally learned to talk to AI."