projects
Production AI I've shipped.
system inventory
current work
3
public repos
5
focus
production ai
base
madrid
production systems
public, non-confidential detailstate
system
since
type
what it does
current
BIFROST
Financial document workflows
Document intelligence for the documents finance runs on: ingestion quality gates, semantic chunking, multimodal retrieval, pgvector/HNSW search, caching, source-quality summaries, analytics, and honest no-answer behavior.
2024
production
Document intelligence for the documents finance runs on: ingestion quality gates, semantic chunking, multimodal retrieval, pgvector/HNSW search, caching, source-quality summaries, analytics, and honest no-answer behavior.
pythonfastapipostgresqlpgvectordoclingpytorchtransformers
current
ORVIAN
B2B collections
AI workflow runtime with protected multi-tenant APIs, context assembly, durable memory, deterministic/cached/full-LLM execution tiers, run events, idempotency, queue processing, and human-review metadata.
2024
production
AI workflow runtime with protected multi-tenant APIs, context assembly, durable memory, deterministic/cached/full-LLM execution tiers, run events, idempotency, queue processing, and human-review metadata.
typescripthonopostgresqldrizzlesupabasequeuesscheduled-jobs
current
Polaris
Support, sales, and product
Internal AI assistant product integrating BIFROST retrieval with MONARCH guardrails, cached safety-to-retrieval handoff, citations, streaming UX, analytics, and suggestion revalidation.
2024
production
Internal AI assistant product integrating BIFROST retrieval with MONARCH guardrails, cached safety-to-retrieval handoff, citations, streaming UX, analytics, and suggestion revalidation.
next.jsvercel-ai-sdkbifrostmonarchdrizzle-ormpostgresql
open source
github.com/Arakiss →state
system
since
type
what it does
public
gommage
Deterministic policy engine for AI coding agents: maps tool calls to capabilities, evaluates YAML rules, and signs every decision in a verifiable audit log, with hard-stops that policy can't bypass.
public
rust
Deterministic policy engine for AI coding agents: maps tool calls to capabilities, evaluates YAML rules, and signs every decision in a verifiable audit log, with hard-stops that policy can't bypass.
ai-agentspolicy-as-codeguardrailspermissionsharness-engineering
public
nahuali
Self-inspecting, auditable memory for AI agents: surfaces the evidence, provenance, and health behind each recall so callers can see which memory to trust, with an optional Ed25519-signed tamper-evident ledger. Local-first, Rust.
public
rust
Self-inspecting, auditable memory for AI agents: surfaces the evidence, provenance, and health behind each recall so callers can see which memory to trust, with an optional Ed25519-signed tamper-evident ledger. Local-first, Rust.
ai-agentsagent-memorytamper-evidenceed25519harness-engineering
public
traceframe
Local-first trace recorder for AI agent runs: append-only, verifiable evidence of what the agent called, what it was allowed, and what failed, with hook ingestion for Codex/OMX harnesses.
public
rust
Local-first trace recorder for AI agent runs: append-only, verifiable evidence of what the agent called, what it was allowed, and what failed, with hook ingestion for Codex/OMX harnesses.
ai-agentsagent-tracingharness-engineeringlocal-firstdeveloper-tools
public
vestig
Runtime-agnostic structured logging with automatic PII sanitization (GDPR/HIPAA/PCI-DSS) and native W3C tracing. Zero dependencies; runs on Node, Bun, Deno, Edge, and the browser.
public
typescript
Runtime-agnostic structured logging with automatic PII sanitization (GDPR/HIPAA/PCI-DSS) and native W3C tracing. Zero dependencies; runs on Node, Bun, Deno, Edge, and the browser.
loggingobservabilitypiigdprtypescript
public
greco
Research harness exploring whether a coding-agent harness can measurably improve itself through typed, layered modifications validated against operator-defined evals within strict budgets.
public
rust
Research harness exploring whether a coding-agent harness can measurably improve itself through typed, layered modifications validated against operator-defined evals within strict budgets.
ai-agentsharness-engineeringevalsself-improvementresearch