Skip to content
View Haihan-Jiang's full-sized avatar

Block or report Haihan-Jiang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Haihan-Jiang/README.md

Haihan Jiang

Production / SRE / infrastructure engineer focused on reliable systems, Kubernetes/cloud operations, observability, and automation that is safe to run in production.

I like work where the result is reviewable: smaller diffs, clear failure modes, tests or gates that prove behavior, and evidence a future on-call engineer can trust.

Best fit: SRE, Production Engineering, Infrastructure, Platform, Cloud/DevOps, and backend infrastructure roles close to real operations.

Fast Proof

  • Verified upstream PRs merged in Google and Google Cloud Platform maintained repositories.
  • Recent upstream work across gVisor, syzkaller, KHI, go-containerregistry, google/benchmark, stellar-engine, and vertex-ai-creative-studio.
  • Built a GKE AI inference reliability lab with OpenTelemetry traces, Kubernetes manifests, incident replay, and SLO-style evidence gates.
  • Production context includes Meta monetization data infrastructure and SHEIN gateway infrastructure work.
  • Experience around production gateways, Kubernetes/AKS-style platforms, Kafka, ZooKeeper, Elasticsearch, Terraform, runbooks, dashboards, and operational automation.

Contributor Signals

Google upstream contributor Google Cloud upstream contributor gVisor contributor syzkaller contributor KHI contributor go-containerregistry contributor Google Benchmark contributor

Projects where my upstream PRs have been merged: google/gvisor, google/syzkaller, GoogleCloudPlatform/khi, google/go-containerregistry, google/benchmark, google/stellar-engine, and GoogleCloudPlatform/vertex-ai-creative-studio.

Selected Upstream Work

Area Evidence
Container/runtime reliability google/gvisor#13276 - set swap for precreated cgroups
Kernel fuzzing / report parsing google/syzkaller#7420, google/syzkaller#7376
Kubernetes troubleshooting GoogleCloudPlatform/khi#708, GoogleCloudPlatform/khi#692
Container image tooling google/go-containerregistry#2318
C++ build/test infrastructure google/benchmark#2198, #2199, #2204
Safer cloud defaults google/stellar-engine#68, GoogleCloudPlatform/vertex-ai-creative-studio#1445

Live searches: org:google merged PRs / org:GoogleCloudPlatform merged PRs

Featured Builds

A runnable infrastructure lab for AI inference reliability:

  • OpenTelemetry trace collection and Kubernetes resource context
  • incident replay for baseline traffic, cache-miss latency, dependency timeout, and rollout regression
  • SLO-style reliability gate with published evidence reports
  • GKE-shaped manifests for collector RBAC, PVC-backed queue storage, and sample workloads

What I Optimize For

  • Production changes that can be rolled out, observed, and rolled back.
  • Automation with explicit inputs, validation, state, side effects, and retry boundaries.
  • Reliability evidence: runbooks, dashboards, audit trails, tests, and incident reports.
  • Practical open-source changes that reduce ambiguity for maintainers and users.

Stack

Python Go C++ Java SQL Bash Linux Kubernetes AKS GKE OpenTelemetry Terraform Ansible Nginx/APISIX Kafka ZooKeeper Elasticsearch CMake pkg-config GitHub Actions

Contact

Merged PR status was verified from GitHub on 2026-06-14. I keep merged work separate from review-in-progress work.

Pinned Loading

  1. benchmark benchmark Public

    Forked from google/benchmark

    Merged upstream Google Benchmark fixes: CMake/pkg-config docs, perf-counter gating, repetition-stat handling

    C++

  2. go-containerregistry go-containerregistry Public

    Forked from google/go-containerregistry

    Merged upstream go-containerregistry fix: avoid creating crane export tar after pull failure

    Go

  3. Haihan-Jiang.github.io Haihan-Jiang.github.io Public

    Production/SRE profile: Kubernetes, observability, infra automation, and verified Google/GCP upstream work

    HTML

  4. kubernetes-engine-samples kubernetes-engine-samples Public

    Forked from GoogleCloudPlatform/kubernetes-engine-samples

    Under-review GKE sample/doc fixes tied to Kubernetes reliability and cloud platform work

    HCL

  5. opentelemetry-operator-sample opentelemetry-operator-sample Public

    Forked from GoogleCloudPlatform/opentelemetry-operator-sample

    Under-review OpenTelemetry/Kubernetes recipes: queue storage, resource detection, instrumentation, GCP ops

    Go

  6. google/gvisor google/gvisor Public

    Application Kernel for Containers

    Go 18.5k 1.6k