About Me

Hello! I'm Hao Feng, a software engineer and current Master's student in Computer Science at the University of Southern California (GPA 3.8/4.0). I have a strong passion for building scalable backend systems, developing intelligent applications, and advancing AI-driven solutions that connect research with real-world impact.

I’ve worked as a Software Engineer Intern at Alibaba, where I contributed to DaMai’s large-scale ticketing and movie pipeline, integrating GraphRAG with LLMs to improve retrieval precision, and automating fault diagnostics with AI-powered tools. At Unity, I focused on SDK engine development, maintaining a JavaScript runtime via JNI and Google V8 Engine, and decompiling WeChat MiniGames to ensure seamless execution across platforms. My research experience at USC involved extending SHAP to multimodal AI models for improved interpretability, while earlier at Wuhan University I applied graph algorithms (SimRank) to historical data analysis.

Beyond industry and research, I actively contribute to open source. In LightRAG, I implemented vector indexing (HNSW/IVFFLAT) and query optimization for pgvector, enabling large-scale recommendation use cases. I also enjoy designing systems that combine retrieval, knowledge graphs, and AI reasoning to create smarter, more explainable applications.

My expertise spans Java, Python, C++, distributed systems, and machine learning frameworks like PyTorch and TensorFlow. I thrive in fast-paced environments that require both deep technical skills and cross-functional collaboration. Always eager to explore cutting-edge technologies, I aim to bridge backend engineering with AI innovation to deliver products that matter.

Skills & Technologies

Programming & Languages

Java Python C++ JavaScript / TypeScript SQL

AI & Machine Learning

PyTorch TensorFlow scikit-learn HuggingFace SHAP LangGraph LLM RAG

Backend & Distributed Systems

Spring Boot FastAPI Redis MySQL PostgreSQL Kafka RocketMQ

Cloud & DevOps

Docker Kubernetes AWS Alibaba Cloud CI/CD Git

Experience

Software Engineer Intern

Alibaba May 2025 – Aug 2025

Migrated 10M+ user assets to DaMai’s independent movie pipeline, optimized AI assistant with GraphRAG and LLM-based retrieval (recall@20 +16%), and built automated diagnostics cutting RCA time from 30 min to under 5 min.

Software Engineer Intern

Unity Sep 2024 – Apr 2025

Maintained a JavaScript runtime using JNI and V8 engine, decompiled WeChat MiniGames with Node.js pipelines, and built a WebSocket-based auto-login system covering 85% of games, reducing manual effort by 80%.

Research Assistant

USC - Laboratory for Machine Learning, Health and Biomedicine Jan 2024 – Jul 2024

Developed multimodal SHAP interpretation for LXMERT, computing Shapley values across text and image modalities with mathematical correctness verification.

Research Assistant

Wuhan University May 2022 – Apr 2023

Constructed a Qin Dynasty Liye strips knowledge graph by applying optimized SimRank to compute pairwise similarities across thousands of historical strips.

Education

Master of Science in Computer Science

University of Southern California Aug 2023 – Dec 2025 (expected)

GPA: 3.8/4.0. Coursework: Deep Learning, Machine Learning, Geospatial Information Analysis, Algorithms.

Bachelor of Engineering in Software Engineering

Wuhan University Aug 2019 – Jun 2023

Overall GPA: 88.57/100. Research focus on knowledge graphs and data analysis.