projects

Production AI I've shipped.

What I can show publicly: BIFROST, ORVIAN, and Polaris in production, plus open-source tools for policy, memory, evidence, and harness control.

email →cv →full-time, contract, or fractional · remote EU

system inventory

current work

3

public repos

5

focus

production ai

base

madrid

production systems

public, non-confidential detail

state

system

since

type

what it does

current

BIFROST

Financial document workflows

Document intelligence for the documents finance runs on: ingestion quality gates, semantic chunking, multimodal retrieval, pgvector/HNSW search, caching, source-quality summaries, analytics, and honest no-answer behavior.

2024

production

Document intelligence for the documents finance runs on: ingestion quality gates, semantic chunking, multimodal retrieval, pgvector/HNSW search, caching, source-quality summaries, analytics, and honest no-answer behavior.

pythonfastapipostgresqlpgvectordoclingpytorchtransformers

current

ORVIAN

B2B collections

AI workflow runtime with protected multi-tenant APIs, context assembly, durable memory, deterministic/cached/full-LLM execution tiers, run events, idempotency, queue processing, and human-review metadata.

2024

production

AI workflow runtime with protected multi-tenant APIs, context assembly, durable memory, deterministic/cached/full-LLM execution tiers, run events, idempotency, queue processing, and human-review metadata.

typescripthonopostgresqldrizzlesupabasequeuesscheduled-jobs

current

Polaris

Support, sales, and product

Internal AI assistant product integrating BIFROST retrieval with MONARCH guardrails, cached safety-to-retrieval handoff, citations, streaming UX, analytics, and suggestion revalidation.

2024

production

Internal AI assistant product integrating BIFROST retrieval with MONARCH guardrails, cached safety-to-retrieval handoff, citations, streaming UX, analytics, and suggestion revalidation.

next.jsvercel-ai-sdkbifrostmonarchdrizzle-ormpostgresql

open source

github.com/Arakiss →

state

system

since

type

what it does

Deterministic policy engine for AI coding agents: maps tool calls to capabilities, evaluates YAML rules, and signs every decision in a verifiable audit log, with hard-stops that policy can't bypass.

Deterministic policy engine for AI coding agents: maps tool calls to capabilities, evaluates YAML rules, and signs every decision in a verifiable audit log, with hard-stops that policy can't bypass.

ai-agentspolicy-as-codeguardrailspermissionsharness-engineering

Self-inspecting, auditable memory for AI agents: surfaces the evidence, provenance, and health behind each recall so callers can see which memory to trust, with an optional Ed25519-signed tamper-evident ledger. Local-first, Rust.

Self-inspecting, auditable memory for AI agents: surfaces the evidence, provenance, and health behind each recall so callers can see which memory to trust, with an optional Ed25519-signed tamper-evident ledger. Local-first, Rust.

ai-agentsagent-memorytamper-evidenceed25519harness-engineering

Local-first trace recorder for AI agent runs: append-only, verifiable evidence of what the agent called, what it was allowed, and what failed, with hook ingestion for Codex/OMX harnesses.

Local-first trace recorder for AI agent runs: append-only, verifiable evidence of what the agent called, what it was allowed, and what failed, with hook ingestion for Codex/OMX harnesses.

ai-agentsagent-tracingharness-engineeringlocal-firstdeveloper-tools

Runtime-agnostic structured logging with automatic PII sanitization (GDPR/HIPAA/PCI-DSS) and native W3C tracing. Zero dependencies; runs on Node, Bun, Deno, Edge, and the browser.

Runtime-agnostic structured logging with automatic PII sanitization (GDPR/HIPAA/PCI-DSS) and native W3C tracing. Zero dependencies; runs on Node, Bun, Deno, Edge, and the browser.

loggingobservabilitypiigdprtypescript

Research harness exploring whether a coding-agent harness can measurably improve itself through typed, layered modifications validated against operator-defined evals within strict budgets.

Research harness exploring whether a coding-agent harness can measurably improve itself through typed, layered modifications validated against operator-defined evals within strict budgets.

ai-agentsharness-engineeringevalsself-improvementresearch