Liu Wei — AI Agent Builder & Embodied-AI Researcher

Education

Asia Pacific University of Technology & Innovation (APU), Malaysia

Nov 2024 – Sep 2027

Bachelor of Science (Honours) in Computer Science · 3-year full-time

Cumulative CGPA: 3.87 / 4.00 (Y2S1 GPA 3.88) — currently Year 2 (Level 2). Vice-Chancellor's List (Academic Year 2024/25). First-Class trajectory (≥ 3.70 threshold).
Programme Code R2/0613/6/0055 (06/29) MQA/FA4622 — approved by MoHE Malaysia, accredited by MQA. Medium of instruction: English. Programme completes before HKUST(GZ) Fall 2027 commencement.
Institutional AI infrastructure: APU provides student access to NVIDIA DGX Spark units (128 GB LPDDR5x unified memory) for LLM training.

Founding

DuoCode Technology · Founder & CEO · 50% beneficial ownership

Mar 2026 – Present

Klang, Selangor, Malaysia · SSM 202603062356 (LA0087244-A) · duocodetech.com

pipeline-infra (public, AGPL-3.0, TypeScript) — autonomous Discover (Google Maps Places API) → Prepare (brand-colour extraction with WCAG enforcement, scaffold) → Design (Claude Code orchestrator + 6 skills) → Finalize (Lighthouse a11y ≥ 95, Vercel REST v13 deploy). 30+ live customer landing pages in 3 months for Malaysian SMEs across food / beauty / dental / auto / sports / mobile-repair. 19-region locale support; 428 tests.
CS-EverySystem (private beta) — drop-in customer-service agent layer; 11 product pillars; 6 north-star metrics (resolution ≥ 40%, grounded ≥ 90%, p95 ≤ 3 s). Targeting Intercom Fin technical parity in 90 days. Hybrid RAG + multi-layer guardrails + trace-first observability (Langfuse). No LangChain, no provider-locked SDKs, no self-trained models.
Drove all SEO / GEO infrastructure personally: Cloudflare DNS+CDN, Vercel edge + ISR, schema.org JSON-LD, multi-locale canonical, sitemap + hreflang, Lighthouse-gated a11y / performance budgets.
Joint decision authority with co-founder Tang Chye Fong on contracts ≥ RM 5,000, hiring, IP licensing, and investment per signed founders' shareholding agreement (27 April 2026).

Public org: github.com/DuoCode2

Awards & Recognition

APICTA Awards (Malaysia Digital) 2025 — Tertiary Student Project Finalist. SynTour: Vertex AI Gemini fine-tuned Malaysia travel agent (LangChain/LangGraph · n8n). Presented to category judging panel 21 Oct 2025 (PIKOM, National Tech Association of Malaysia).

National

Deriv AI Hackathon Challenge 2026 — Certificate of Recognition · RM 2,000. 14–15 Feb 2026, TradeIQ (team lead). Five-agent LLM pipeline + Trading Twin behavioural simulation. Signed by Founder Jean-Yves Sireau and CEO Rakshit Choudhary.

International

iii

Amazon Nova AI Hackathon 2026 — Bonus Blog Post Prize · USD 200 AWS credits. NovaTour, voice-first travel planner on Amazon Nova Sonic/Lite/Act + Strands Agents SDK. Announced 10 Apr 2026 via Devpost.

International

MariaDB Hackathon MY 2026 — Proposal A+ rated, Finalist. seamless-rag: MariaDB-native vector + RAG toolkit using VECTOR(N) + HNSW + TOON v3 for token-efficient retrieval payloads. 23–39% token reduction vs JSON on real datasets (MovieLens, SF Restaurant Violations); 538 tests passing.

National

APU Vice-Chancellor's List. Excellent Academic Achievement, Academic Year 2024/25. Signed by Vice Chancellor Prof Dr Ho Chin Kuan.

University

Internship & Engagements

EBKernel · 上海具脑磐石科技有限公司 — Embodied Agent Algorithm Intern

19 Nov 2025 – Present

Embodied-AI / Human-Computer Interaction Researcher · Fully remote · 弹性工作制 (flexible hours)

RynnBrain VLM × OmniGibson eval pipeline (sub-project from 03/2026): lifted BDDL true success 0% → 80% by stripping six classes of privileged information (GT segmentation → GSAM open-vocabulary perception; container AABBs → pixel-level snap; teleport → CuRobo trajectory planning). 60+ seeds × 3 BEHAVIOR-1K tasks; documented 28 falsified hypotheses. Three-machine remote pipeline: Mac ↔ RTX 4090 ↔ H100 cluster (SSH tunnels + rsync; vLLM TP=1).
Robot voice interaction: co-deployed real-time stack on Lejun (乐聚) humanoid for water-fetch / hotel front-desk / reception scenarios. Full domestic localisation: Alibaba Cloud TTS/STT, Agora (声网) RTC, TEN Framework. Researched 3C noise-suppression and microphone directivity at the hardware/software boundary.
Multi-cloud GPU-infra collaboration: worked across Huawei Cloud and partnered with the infrastructure teams of Kuaishou (快手) and DataCanvas · 九章云极 (Alaya NeW Cloud) to debug GPU container scheduling, heterogeneous-compute cluster orchestration (mixed deployment of domestic chips and NVIDIA), and shared-compute inference pipelines.

iMean · 椰子宇宙（杭州）科技有限公司 — Algorithm Intern (Sole Engineer, Voice Agent)

15 Sep 2025 – 02 Feb 2026

Algorithm Department (算法部门) · Onsite Hangzhou Sep–Nov 2025 · Remote Dec 2025 – Feb 2026

Owner of the full Web + iOS voice-agent stack: cross-tested STT (Deepgram, ElevenLabs, Gemini), TTS (ElevenLabs, ChatGPT TTS), and LLM (GPT-4o, Gemini, Gemini Live cascaded vs. native multimodal).
Voice infrastructure across LiveKit, FastRTC, TEN Framework — WebSocket and WebRTC underneath. Abstracted OpenAI Agents SDK, Gemini ADK, and LangChain/LangGraph behind a single multi-provider interface so we could swap orchestration without touching business logic.
Engineered streamed long-context management, dual-tier (short + long) memory over vector recall, and per-session state machines that switched cadence and tone (podcast tempo, calm voice) based on the detected scene. Earned formal team-lead and company recognition under an ambiguous spec.

Daikin Malaysia (大金) — Team Leader, Enterprise Systems (4-Module Lead)

2025 – Ongoing

APU university–industry collaboration project · Ongoing delivery

Leading delivery across four enterprise modules: (1) IoT-device interconnection + sensor data ingest; (2) form-assistant + work-order ticketing system; (3) internal wiki knowledge base; (4) on-prem RAG question-answering assistant (locally deployed Ollama). Own architecture, integration, and cross-team handoff.

Open Source & Selected Works

Personal: github.com/SunflowersLwtech · 41 public repos | Company org: github.com/DuoCode2

DuoCode2 / pipeline-infra

AGPL-3.0 · TS

DuoCode growth engine: 4-phase Discover → Prepare → Design → Finalize pipeline. 30+ live SME sites; 428 tests; 19-region locale.

polanyi-design

Pinned · 5 ★

Claude Code Skill — frontend cognitive engine grounded in Michael Polanyi's tacit-knowledge framework.

search-harness

Pinned

Agent CLI + skills; 5-track roadmap: search-result persistence · agent orchestration · native multimodal pass-through · codebase context-search · CUA E2E testing.

seamless-rag

MariaDB Finalist

MariaDB-native RAG toolkit (VECTOR + HNSW + TOON v3, 23–39% token saving); 538 tests passing.

Council

Multi-agent

Local-first multi-agent + multi-user social-deduction game. Mistral AI cognition + PowerSync 6-stream isolated sync + 3D rendering.

SynCVE

Private · on request

Real-time emotion recognition: ensemble OpenCV + SSD fusion · FasNet anti-spoofing · EMA temporal smoothing (α = 0.2, ablation-tuned) · two-stage Gemini reporting · ~1–2 s e2e.

SightLine

iOS · Accessibility

iOS + Gemini Live accessibility platform for the blind. Apple Watch sensors · 18 function-calling tools · adaptive LOD engine.

TradeIQ

Deriv RM 2,000

AI-powered trading platform. 5-agent LLM pipeline · DeepSeek V3 streaming · "Trading Twin" dual equity curve simulation. Team lead.

NovaTour

AWS USD 200

Voice-first travel planner on Amazon Nova Sonic/Lite/Act + Strands Agents SDK + Google Maps tool orchestration.

arus — Banjir Drill

Google ADK · Cloud Run

Browser-playable Malaysian flood-coordination simulator. Three modes (PLAY / COACH / AUTO) on Gemini 2.5 Pro + Google ADK 1.27 + Veo 3 + Cloud Run.

Astrolabe

Auth0 · RFC 8693

Agent observability — RFC 8693 token exchange + Auth0 FGA. Post-authentication audit + scope-bound risk classification.

mcp_creator_growth

MCP · 6 ★

Context-aware MCP server: learning sidecar for AI coding assistants. Quizzes + persistent debugging memory.

Skills

ML / DL / RL

PyTorch TensorFlow JAX NumPy Pandas scikit-learn Hugging Face ONNX Runtime W&B RLHF / DPO Ablation Design

AI Agents

LangChain / LangGraph OpenAI Agents SDK Google ADK Anthropic Agent SDK Claude Code (skills + sub-agents) CrewAI AutoGen Pydantic AI Vercel AI SDK MCP browser-use Playwright (CUA) Vertex AI fine-tuning

Inference / Serving

vLLM (TP=1) SGLang Ollama TGI NVIDIA Triton H100 / RTX 4090 ops KV-cache & prefix-cache tuning

Voice / Real-time

LiveKit FastRTC TEN Framework WebRTC WebSocket Deepgram ElevenLabs Gemini Live Agora 声网 Alibaba TTS

Embodied AI

OmniGibson BEHAVIOR-1K GSAM (Grounded-SAM) CuRobo BDDL eval design IK branch analysis Lejun 乐聚 humanoid

Computer Vision

OpenCV DeepFace FasNet anti-spoofing RetinaFace MTCNN Ensemble Detector Fusion EMA Temporal Smoothing CUDA 11.8 / cuDNN

GPU Infra

Huawei Cloud Kuaishou 快手 infra DataCanvas 九章云极 (Alaya NeW) GPU container debug heterogeneous-compute scheduling

Google Cloud

Vertex AI Gemini 3.1 / Live Firebase (Firestore + Auth + Vector) Cloud Run Cloud Build Routes / Places / Maps Apigee

Edge / Web

Vercel (REST v13, ISR, edge) Cloudflare (DNS, CDN, Workers, Turnstile) Next.js 15/16 React 19 SwiftUI Three.js Tailwind CSS

Backend / Data

FastAPI Django Channels MariaDB (VECTOR/HNSW) pgvector Supabase + PowerSync Upstash Redis Docker n8n sqlglot

Observability / Eval

OpenTelemetry Langfuse Promptfoo Lighthouse CI Gemini-as-judge eval TOON v3 token benchmarking

SEO / GEO

Schema.org JSON-LD multi-locale canonical sitemap + hreflang Lighthouse-gated a11y / perf GEO (AI Overviews / ChatGPT Search / Perplexity)

Continuous Learning

Python for Everybody Specialization (5 courses) · University of Michigan — Coursera, May 2025. verify: coursera.org/verify/specialization/I2UVB701P0JF

C for Everyone: Programming Fundamentals · UC Santa Cruz — Coursera, Jul 2025.

Introduction to Networking · NVIDIA — Coursera, Jan 2025.

Writings & Thoughts

Source-level analyses of agent infrastructure and engineering practice notes — published on duocodetech.com/blog.

Multi-Agent · OSS Comparative 2026 · 04 · 02

The Agent Team Problem: Seven Open-Source Approaches to Orchestrating AI Coding Agents

Comparative analysis of OrbitDock · codex-autorunner · Squad · Open-Inspect · Multica · Symphony · AionUi (20.7K ★) on four axes (identity / platform / infrastructure / state). Surfaces Symphony's Erlang-style zero-persistence as fault tolerance for agent coordination.

Read →

Source-Level Study 2026 · 03 · 31

The Harness That Makes the Model Useful: A Source-Level Study of Claude Code

513,237 LOC TypeScript across 1,902 files; 40 tools / 9 categories; 101 commands; 20 service modules. Thesis: 95% harness, 5% model interaction. Custom Ink renderer; 4-layer permissions; git-worktree multi-agent isolation.

Read →

Architecture Anatomy 2026 · 03 · 31

Inside Claude Code: How a 513K-Line Codebase Turns an LLM Into a Coding Agent

Anatomy of a 1,902-file codebase: Yoga-WASM rendering pipeline (60 fps double-buffer, viewport culling), Mailbox-async multi-agent isolation, MCP defer-load (300+ integrations), 4-layer permission stack with audit trails.

Read →

Architecture Proposal 2026 · 03 · 30

What Kind of Context Search Engine Do We Actually Need?

Distinguishes search ("where is X?") from orientation ("what does this codebase look like?"). Proposes MCP-based, token-budget-constrained engine with 4 tools and ~1,500-LOC reference. Generic embeddings only score 0.42–0.49 on code retrieval.

Read →

Engineering Practice 2025

My AI-Assisted Development Policy

A personal framework for using AI coding tools responsibly — how I stay productive without losing engineering rigor, and the rules I hold myself to.

Read →