[ 01 ] · INDEX

PORTFOLIO · 2026

NYCEST 2019SENIOR · PRODUCTAI SYSTEMS

[ AVAILABLE · Q3 2026 ] / NYC / PRODUCT DESIGN · AI SYSTEMS

I design the decisions AI products make — not just the screens.

Product designer focused on the line where AI meets real consequences. What the model is allowed to decide. What has to be a rule. How the system shows its work. Currently shipping case studies on multi-agent systems, retrieval memory, and LLM policy architecture.

→ BASED IN New York City · working globally

→ FOCUS AI systems · product · UX architecture

→ LATEST Director · Supply Chain MAS · Apr 2026

→ BACKGROUND 5 yrs · B2B · social · finance · a11y

[ 02 ] · FEATURED WORK

2026 · IN DEPTH

FIVE CASE STUDIESBESTBITE · NEWEST4 SHIPPED · 1 BETA

01 · FEATURED WORK

Five case studies about the hardest part of AI design — deciding what the model doesn't decide.

All five started from the same observation: LLM products fail in predictable ways. These are the design responses when you take that seriously — shipped, working code, not mockups.

/01 Bitez · iOS · Product AI Solo · on-device LLM · graceful fallback 11 MIN /02 Director · Supply Chain MAS Multi-agent · eval-driven 10 MIN /03 Visual RAG Memory Map AI memory · retrieval UX 6 MIN /04 Demon Rising · The Council LLM in the loop · fallback design 9 MIN /05 The Translator Pattern Design pattern · AI + policy 8 MIN

[ 03 ] · FEATURED / 01

2026 · 11 MIN READ · TESTFLIGHT BETA

iOS · SwiftUIAPPLE FOUNDATION MODELS2 AI MOMENTS~80% API SAVINGS

FEATURED Product AI On-device LLM Transparent reasoning

Bitez · the on-device AI that picks one.

You're hungry. You open Bitez. It picks one place — not fifty — and explains why in a sentence you'd want to hear from a friend. Apple's on-device AI does the language part: reading what you typed, writing the explanation. The actual reasoning behind the pick is laid out on screen — every fact sourced, never a black box. Currently in TestFlight beta.

ROLE · Founder · Product AI designer YEAR · 2026 READ · 11 min

Read the case study → Request TestFlight access →

[ 04 ] · FEATURED / 02

2026 · 10 MIN READ

F1=0.94180,519 ROWS4 SUB-AGENTS3 ROUTER STRATEGIES

FEATURED Multi-agent Eval-driven

Director · Supply Chain MAS.

A multi-agent system that turns 180,519 raw orders into a one-paragraph answer — by routing through four specialised sub-agents, surfacing the raw rows behind every claim, and benchmarking three router strategies against each other. Built for legibility, not for autonomy.

ROLE · Solo design + engineering YEAR · 2026 READ · 10 min

Read the case study →

[ 05 ] · FEATURED / 03

2026 · 6 MIN READ

42 CARDS6 CLUSTERSUMAP 2DLOCAL-FIRST

FEATURED AI Memory Retrieval UX

Visual RAG · Memory Map.

A 2D coordinate-space view of an LLM's long-term memory — 42 chunked cards across 6 semantic clusters, projected with UMAP. The query lights up the cluster it lands in, draws rays to top-K retrieved chunks, and shows the match percentages live. Makes retrieval legible — not magic.

ROLE · Solo design + prototype YEAR · 2026 READ · 6 min

Read the case study →

[ 06 ] · FEATURED / 04

2026 · 9 MIN READ

≤ 8s LATENCY0 DEAD-ENDS5-WAY INTENT

FEATURED LLM in the loop Game

Demon Rising · The Council.

A browser-based narrative game where 5 council advisors react to the player's choices in natural language. The LLM classifies player intent into 5 channels, generates contextual responses, and the game state stays consistent. Latency budget ≤ 8s, 0 dead-ends, designed fallbacks for every failure mode.

ROLE · Design + LLM eng YEAR · 2026 READ · 9 min · external

Play on Steam ↗ Case study ↗

[ 07 ] · FEATURED / 05

2026 · 8 MIN READ

4 GATES6/6 ATTACKS BLOCKED0 HALLUCINATED APPROVALS

FEATURED Pattern AI + Policy

The Translator Pattern.

A design pattern for AI-policy systems: LLM translates, code decides. The model converts natural language into a typed, structured plan. Code runs a deterministic validator with 4 gates — schema, range, policy, sandbox. Tested with 6 adversarial prompts: 6/6 blocked, 0 hallucinated approvals.

ROLE · Design + spec YEAR · 2026 READ · 8 min

Read the case study →

[ 08 ] · SELECTED

2019 — 2025

4 PROJECTSE-COMM · AI · A11Y · SOCIAL

02 · SELECTED WORK

Earlier & client work — five years.

The boring, important parts of products — IA, data-heavy dashboards, permission-gated flows, AI-augmented client work.

MechaPro homepage — Professional Auto Service & Bodywork Equipment

2025CLIENT · E-COMM · LIVE

MechaPro · B2B equipment

A heavy-equipment e-commerce site where a single decision can be $20K. Designed the IA, configurator, trust modules and buyer flows — first 30 days: $38K+ revenue, $9.7K AOV.

CASE STUDY→

2026CLIENT · AI · LIVE

SAGE · AI receptionist

A multi-program NYC learning center site with Shirely — an AI assistant that classifies parent intent, holds boundaries on pricing & promises, and routes to branch staff. ~12% conversion (4× industry avg).

CASE STUDY→

2025B2B · AI · A11Y

A11y Copilot

An AI accessibility scanner that explains WCAG violations in plain English and proposes design-level fixes — not just color-contrast tickets.

CASE STUDYSOON →

2021SOCIAL · MOBILE

Gather

A mobile app for strangers meeting IRL. Designed around the question: how does a social product create trust between people who haven't met yet?

OVERVIEW→

[ 09 ] · ABOUT

JESTAZ YAO · NYC

NYCEST 2019SENIOROPEN · Q3 2026

03 · ABOUT

I design the parts of AI products where the decisions are actually hard.

Five years in product design — moving from B2B SaaS and accessibility tooling to AI-augmented client work (SAGE, MechaPro, A11y Copilot) to multi-agent systems and retrieval architecture (Director, RAG, Translator Pattern). The thread: I'm most useful where the design problem isn't "make this screen prettier" but "decide what the model gets to decide, and what stays a rule."

Born in Beijing, based in NYC. Senior product designer, comfortable in code (TS, Python, swift to wire prototypes), and the person on the team who'll argue for the fallback flow before the happy path.

→ NOWOpen to senior product / AI design roles · NYC or remote.

→ TOOLSFigma · Linear · Cursor · Python · TS · Claude · GPT.

→ FOCUSLLM products · multi-agent systems · retrieval UX · AI policy patterns.

→ INDUSTRIESAI tooling · B2B SaaS · e-commerce · creator finance · social.

/01

Design the failure mode first.

What happens when the model is wrong? What does the user see? Where does the system route to next? I draw the unhappy path before the happy one.

/02

LLM translates. Code decides.

I treat the model as a translator from messy human inputs to typed structured data. The structured data goes through deterministic gates. The model isn't allowed to be the final authority on anything irreversible.

/03

Show the work, not just the answer.

Confidence scores, citations to source rows, retrieval rays, traces. Users trust systems that let them check the work — and design has to make the checking cheap.

/04

Care more about the boring half.

Empty states, error states, permissions, ops dashboards, trust modules. The half that doesn't ship to the demo reel is the half that actually keeps users.

[ 10 ] · CONTACT

OPEN · Q3 2026 · TRANSMITTING

EMAIL · LINKEDIN · RESUMERESPONSE WITHIN 48H

05 · GET IN TOUCH

Building something where the design decisions are actually hard?

AI products, multi-agent systems, retrieval, complex B2B flows — that's the work I want. Drop a note and tell me what you're building.

→ EMAIL yxhzdm@gmail.com → LINKEDIN linkedin.com/in/jestaz → RESUME PDF · 1 page