Yuangang Li

Yuangang Li

PhD Student at UCI

Biography

Hi 👋! I am Yuangang Li, a PhD student at the Donald Bren School of Information and Computer Sciences, University of California, Irvine, supervised by Prof. Iftekhar Ahmed.

Before my PhD study at UCI, I spent wonderful years at the University of Southern California, completing my Master’s degree and working as a Research Assistant under Prof. Yue Zhao. During that time, I also had the pleasure of collaborating with Prof. Xiyang Hu and Prof. Jiechao Gao.

🌟 Looking for 2026 Summer internship. 🌟

Interests
  • 🤖 Trustworthy AI & Large Language Models (LLMs)
  • 💻 Software Engineering
  • 🌐 Decentralized Learning
  • ☁️ MLSys & Cloud Computing

News

  • [Sep 2025] Joined UC Irvine Donald Bren School as Research Assistant focusing on LLM transparency and code generation explainability.
  • [Aug 2025] Completed Research Assistant position at Stanford University on causal reasoning in LLMs.
  • [May 2025] Started as Research Assistant at Stanford University with LLM-guided semantic bootstrapping research.
  • [Dec 2024] Paper "NLP-ADBench: NLP Anomaly Detection Benchmark" published on arXiv (first author).
  • [Dec 2024] Paper "Ad-llm: Benchmarking Large Language Models for Anomaly Detection" published on arXiv.
  • [Dec 2024] Paper "A Large-scale Empirical Study on LLMs for Election Prediction" published on arXiv.
  • [Nov 2024] Paper "FedMetaMed: Federated Meta-Learning for Personalized Medical Treatment" accepted at IEEE BIBM 2024.
  • [Nov 2024] Paper "Fed-LDR: Federated Local Data-infused Graph Creation" accepted at IEEE ICDM SSTDM Workshop 2024.
  • [Sep 2024] Began Research Collaboration at University of Virginia with Dr. Yue Cheng on AutoML and LLM inference.
  • [Aug 2024] Paper "AI-Aided Digital Twin Design: A Systematic Review" published in Preprints (co-first author).
  • [Jul 2024] Paper "FedBCGD: Communication-efficient Federated Learning" accepted at ACM MM 2024.
  • [Jun 2024] Paper "H-FedSN: Hierarchical Federated Learning with Sparse Networks" published in IEEE IoT Journal.
  • [Dec 2023] Joined University of Virginia as Research Assistant with Dr. Jiechao Gao and Prof. Brad Campbell.
  • [Jul 2023] Joined USC FORTIS Lab as Research Assistant, advised by Prof. Yue Zhao on NLP anomaly detection.
  • [May 2023] Completed USC Games position as Game Analyst & Developer on "Hexagon Adventure".
  • [Mar 2023] Published "Ablator" framework on GitHub, open-source distributed ML framework used by 40+ researchers.
  • [Feb 2023] Joined USC as Research Engineer working on distributed AutoML with Dr. Iordanis Fostiropoulos.
  • [Jan 2023] Joined USC Games as Game Analyst & Developer.
  • [Dec 2022] Completed Infrastructure Engineer role at SenseTime.
  • [Jan 2022] Started M.S. in Computer Science at University of Southern California.
  • [Dec 2021] Joined SenseTime Research as Infrastructure Engineer developing "RocketMQ as a Service".
  • [Oct 2021] Completed Research Engineer role at Institute of Software, Chinese Academy of Sciences.
  • [Apr 2021] Joined Institute of Software, Chinese Academy of Sciences as Research Engineer.
  • [Apr 2021] Completed Back End Developer role at NiuTrans.
  • [Jan 2021] Joined NiuTrans as Back End Developer.
  • [Jun 2020] Completed Full-Stack Developer role at Beijing City University.
  • [May 2018] Joined Beijing City University as Full-Stack Developer & Team Leader.

Academia Experience

 
 
 
 
 
University of Southern California
Research Assistant
July 2023 – Present California

Advisor: Prof. Yue Zhao - NLP Anomaly Detection Toolkit and Benchmarking Development | LLM, AutoML, NLP, Anomaly Detection, PyTorch

  • Created a robust benchmarking system for evaluating algorithmic efficacy with over 40 PyOD library O.D. algorithms and 3 end2end algorithms across 20 datasets to facilitate comprehensive performance analysis and comparison.
  • Redesigned NLP datasets into a unified format for anomaly detection, addressing a field gap, and developed a system to automatically recommend optimal AD algorithms based on benchmarks.
  • Co-first authored a systematic review on AI-aided digital twin design, analyzing how machine learning enhances digital twins and their applications across multiple domains.
  • Constructed causal reasoning datasets, fine-tuned LLMs, and evaluated using hallucination benchmarks.
  • Investigated generative AI techniques to tackle Anomaly and Out-of-Distribution Detection problems and explored the capabilities of LLMs in politics.
 
 
 
 
 
University of Virginia
Researche Collaboration
University of Virginia
September 2024 – Present Remote

Advisor: Dr. Yue Cheng - Effective Machine Learning Systems Development | LLM Inference, LLM compression

  • Researched an automated method to generate the optimal LLM inference compression algorithm for user-specific tasks.
 
 
 
 
 
University of Virginia
Independent Researcher
University of Virginia
July 2024 – September 2024 Remote

Advisor: Dr. Jiechao Gao - Federated Learning System Development | Federated Learning, PyTorch, Python, Spatio-temporal data

  • Developed the FedMetaMed, integrating federated learning and meta-learning to enhance personalized medication strategies across distributed healthcare systems, improving model adaptability and privacy preservation.
  • Single-handedly developed FedLDR, a brand new federated learning algorithm that employs GCN to enhance spatio-temporal data analysis through local data integration and node-centric optimization. = FedMetaMed and FedLDR[were accepted by BIBM and ICDM respectively, and I gave talks on these research.
 
 
 
 
 
University of Virginia
Independent Researcher
University of Virginia
January 2024 – July 2024 Remote

Advisor: Dr. Jiechao Gao, Prof. Brad Campbell - Hierarchical Federated Learning System Development | Hierarchical Federated Learning, PyTorch, Python, Sparse Network

  • Independently developed H-FedSN pushes the boundaries of IoT with a unique approach that uses masking techniques to train a sparse network, enhancing personalization through client-based transfer learning. Applied to non-IID IoT datasets, it achieves high accuracy and boosts communication efficiency by at least 58x.
  • Solely developed and integrated innovative federated learning algorithms—FedAvg, FedCAMS, FedPer, PerFedAvg, and FedRS—into a hierarchical framework to optimally benchmark against H-FedSN.
 
 
 
 
 
University of Southern California
Research Assistant / Engineer
March 2023 – July 2023 California

Advisor: Dr. Iordanis Fostiropoulos - Distributed ML Execution Framework Development | MLSys, AutoML, Ray, Docker, Github Action, PyTorch, Pytest

  • Contributed to “Ablator”, an open-source Deep Learning framework used by 40+ USC researchers for horizontal scaling of ablation experiments and hyperparameter tuning, encompassing 70 pull requests.
  • Implemented distributed experiment execution with Ray, managed open-source projects, set up CI pipelines via GitHub Actions, oversaw release management and version control, and authored pytest unit tests with 97% coverage.
  • Solely launched ‘python-rclone’ on PyPI, a Python API for RClone that streamlines cloud data synchronization for ‘Ablator’, removing pre-installation requirements and enabling automatic binary selection (python-rclone).
 
 
 
 
 
Institute of Software, Chinese Academy of Sciences
Research Assistant / Engineer
Institute of Software, Chinese Academy of Sciences
May 2021 – October 2021 Beijing

Advisor: Prof. Guoquan Wu - Automated Testing Platform Development | Docker, Node.js, JSON, Vue.js, RobotFramework

  • Contributed to the R&D of a web-based automated testing tool using Record and Playback technology, significantly enhancing test case management by enabling streamlined recording, editing, execution, analysis, and result generation. This implementation boosted end-to-end testing efficiency by 300% and saved over 15 hours per week.
  • Independently developed a script parser using Node.js that converts user actions recorded in JSON format into executable Robot Framework and Selenium scripts, enabling the replay and repeated execution of these user actions.
  • Single-handedly created innovative UI components using Vue.js and AceEditor, orchestrated the optimal containerization of the program with Docker, and automated the DevOps pipeline to maximize development efficiency.

Industry Experience

 
 
 
 
 
SenseTime
Infrastructure Engineer & Researcher
SenseTime
January 2022 – January 2023 Bejing

SaaS Platform Development {Demo} | Kubernetes, Docker, Go, CRD, Operator-SDK, Helm3, Prometheus, Grafana

  • Developed “RocketMQ as a Service”, akin to “RabbitMQ as a Service” in AWS Marketplace, offering fully managed SaaS-based RocketMQ clusters, increasing 100% creation speed and saving 10+ hours/week in manual operations.
  • Utilized Operator SDK to build a Kubernetes-based RocketMQ Operator and CRD automating lifecycle management.
  • Employed Helm3 to package RocketMQ’s components into Helm charts, simplifying Kubernetes deployment.
  • Implemented Prometheus and Grafana for real-time monitoring of critical service metrics and node health.
  • Automated workflows, including unit tests, image builds, and Helm3 Chart updates, via GitLab CI/CD.
  • Researched and evaluated container runtimes (sysbox, crun, youki) for suitability as replacements for Docker in the SaaS platform, ensuring CRI-O compliance and robust community support.
  • Optimized a machine learning training pipeline using GPUs on Kubernetes for enhanced computational efficiency.
 
 
 
 
 
Full Stack Engineer
Xiaoniu Translations (Beijing) Technology Co., Ltd.
January 2021 – April 2021 Beijing

Text Translation Platform Development | Java, SpringBoot, Spring, Java Persistence API, Maven, Nginx, MySQL, Git

  • Developed an AI document translation system with Java/Spring/Maven, independently created a PDF/XML parsing module attracting 30,000 MAUs, used Nginx for reverse proxy, and managed version control with Git.

Projects

Apache/RocketMQ Operator

Apache/RocketMQ Operator

RocketMQ Operator is to manage RocketMQ service instances deployed on the Kubernetes cluster. It is built using the Operator SDK, which is part of the Operator Framework.

PyRCLONE

PyRCLONE

A Python wrapper for rclone, available on PyPI, conveniently includes the rclone binary (version v1.62.2) eliminating the need for pre-installation of rclone. It caters to various operating systems like Windows, Mac, and Linux, and supports both amd64 and x86_64 architectures. When a user downloads the package, the appropriate rclone binary file is installed based on their system type.

SLinux OS

SLinux OS

This project is a simple operating system implementation, designed to provide a hands-on learning experience for understanding the fundamentals of operating system development. It includes components such as a bootloader, kernel, and basic system utilities.

USC/ABLATOR

USC/ABLATOR

ABLATOR is a DISTRIBUTED EXECUTION FRAMEWORK designed to enhance ablation studies in complex machine learning models. It automates the process of configuration and conducts multiple experiments in parallel.

USC/Hexagon Adventure (2D Game)

USC/Hexagon Adventure (2D Game)

Hexagon Adventure is a 2D game developed using Unity3D. The game is designed to be a fun and challenging experience for players of all ages.

USC/EventMaster

USC/EventMaster

Event search platform

Transformer from Scratch

Transformer from Scratch

This project implements a Transformer model from scratch for a machine translation task. The goal is to build a functional Transformer model starting from the basic principles and gradually developing it into a full-fledged model capable of translating text between languages.

Services

Program Committee (PC) and/or Area Chair (AC) and/or Senior Area Chair(SAC) for Conferences