Flipmap · NL2SQL Semantic Resolution Pipeline

User Input — Natural Language Query

"What is the market cap of USDC on Ethereum over the last 30 days?"

Analysis

LLM Structured Parse

Structural Parse

LLM decomposes NL into structured signals

Intent Extraction

Primary + secondary intent with confidence

Component Mapping

Metric signal, entity signal, domain hint

Filter Normalisation

Time range, value filters, limit/order

Canonical Form

Normalised representation for few-shot matching

Address Scan

Regex extraction of blockchain addresses (0x...)

Confidence Score

Base + ambiguity bonus → clamp [0, 1]

Output: NormalizedQuery — intent, atomic components, normalised filters, canonical form, entities, and processing trace

Retrieval

RAG · Hybrid Search · Embedding

Model Retrieval Per Component

Probe Building

Construct semantic search query from metric signal + entity + domain hint

Vector ANN Search

Embed probe → approximate nearest neighbours on model namespace (1024-dim)

BM25 Full-Text Search

Keyword matching on model metadata — names, descriptions, domains

Reciprocal Rank Fusion

Fuse vector + BM25 rankings → deduplicated candidate list

Multi-Signal Scoring

4-signal weighted blend → composite score per candidate

Model Selection + Fallback

Top-1 candidate or LLM-suggested model (conf ×0.9) when no results

Scoring Weights

Base score

×0.82

Domain match

×0.08

Hist. prior

×0.07

Anti-pattern

−0.03

Few-Shot Retrieval Dual Strategy

Canonical Embedding

Embed the normalised canonical form from Analysis phase

Vector Search on Gold Records

Cosine similarity against known question→query pairs

Jaccard Pattern Matching

Tokenised query pattern similarity as secondary signal

Composite Scoring

vector ×0.7 + pattern ×0.3 → ranked matches

Top-5 Selection

Best-matching examples with similarity scores for prompt injection

Strategy Selection

Ground Truth Service — two-pass retrieval against curated gold records
Vector Store — direct embedding search on question namespace

Post-merge: Join paths mapped across multi-model queries · Warnings flagged for missing joins · Output: RetrievalContext — per-component results, multi-model context, few-shot matches, interpretation context

Generation

LLM Structured Output + Validation

Prompt Assembly

Schema + few-shots + intent + context

LLM Generation

Structured output → SemanticQuery candidate

Schema Validation

FQN member names, valid SQL operators

Semantic Validation

Measures belong to resolved models

Completeness Check

All query components addressed

Filter Injection

Deterministic override of conflicting LLM clauses

Confidence Score

Composite weighted score → [0, 1]

Output Confidence Score

Overall = LLM self-confidence ×0.8 + FewShot similarity ×0.2 → output range [0, 1]

Output: SemanticQuery (measures, dimensions, filters, time dimensions, segments, order, limit) + resolved models + full confidence breakdown + reasoning trace

◇ Decision Gate

Overall Confidence

≥ 0.75 ?

AGENTIC_FALLBACK_THRESHOLD

✓ YES — High Confidence → SQL Compilation ⚡ NO / SQL Error → Agentic Fallback

04a · SQL Compilation & Execution Schema Compiler · Snowflake

Schema Translation

SemanticQuery → compiler YAML schema with validated joins

Query Compilation

Measures, dimensions, filters → Snowflake SQL via dialect adapter

Time Resolution

Relative expressions → absolute ISO date pairs

Parameter Inlining

Parameterised query → runnable SQL

Execute Against Database

Run SQL → rows + metadata + execution time

⚠ SQL execution error → triggers Agentic Fallback →

04b · Agentic Fallback Max 40 steps

Tool-Calling Loop

list_models

search_models_rag

inspect_model

run_test_query

Context Injection

Agent receives: original query, normalised intent, RAG context, original SQL + error (if any)

Iterative Resolution

Searches models, inspects schemas, tests queries — up to 40 tool calls

Exit Decision

finalize — working SQL found | give_up — return original pipeline result

All tool-call traces + reasoning captured as learning candidates

✅

Resolved SQL Result

SemanticQuery · Compiled SQL · Query rows & metadata · Full confidence breakdown · Timing trace

Learning Loop

↺ Async · Closed Loop

Capture Failures

SQL errors, agent traces, tool call sequences

Store Candidates

Enrichment suggestions with source + confidence

Human Review

pending → approved gate before any enrichment

Model Enrichment

New measures, dimensions, joins, aliases

Vector Store Update

Re-embed enriched models into search namespace

Improve Retrieval

Future queries benefit from richer models + more examples

↑ Approved enrichments feed back into Phase 02 Retrieval — directly improving model resolution and few-shot matching for all future queries