Skip to content
View AdvancedUno's full-sized avatar
😁
Happy Uno
😁
Happy Uno

Block or report AdvancedUno

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
AdvancedUno/README.md

Hi there πŸ‘‹, I'm Uno Lee (EunHo)

Software Engineer | Distributed Systems & AI Infrastructure

Email LinkedIn Portfolio

I'm a BS/MS Computer Science student at Washington University in St. Louis (GPA: 4.0), previously studying Mathematics at Denison University. I specialize in building highly scalable microservices, low-latency data pipelines, and optimized GPU kernels.

Currently working on: Distributed scheduling research (I-Cilk) and high-performance CUDA computing.


πŸš€ Experience Highlights

  • Microsoft | Software Engineer Intern (App Gateway) | May 2026 – Aug 2026
  • Microsoft | Software Engineer Intern (App Gateway) | May 2025 – Aug 2025
    • Led end-to-end onboarding of App Gateway managed identity certificate renewal, enhancing operational reliability.
    • Transitioned to IMDS-based token management, eliminating credential storage in Key Vault to meet strict compliance needs.
  • Palantir | Software Fellowship | Mar 2025 – Apr 2025
    • Designed a microservices-based platform integrating Palantir OSDK for semantic search and optimization.
  • Key Capture Energy | Software Engineer Intern | May 2024 – Apr 2025
    • Optimized microservice efficiency by 20% utilizing gRPC, C#, Docker, and Kubernetes.
  • INTELLIZ | Software Engineer Intern | Nov 2021 – Aug 2023
    • Engineered CV modules (OpenCV/PyTorch) and integrated TensorRT/ONNX models, slashing inference times by 23%.

πŸ› οΈ Tech Stack

Languages: C++, C, C#, CUDA, Go, Python, JavaScript, SQL, Java, R
Frameworks & Tools: PyTorch, Spark, Kafka, Flink, React.js, Node.js, .NET/Blazor, Django
Infrastructure: Docker, Kubernetes (AKS), Azure, AWS, GitHub Actions


πŸ’» Featured Projects

C++, CUDA, Nvidia RTX 4000 Ada

  • Engineered a highly optimized Flash Attention kernel from scratch, translating scaled dot-product attention from a CPU reference down to bare-metal parallel execution.
  • Eliminated VRAM bottlenecks using kernel fusion, shared memory tiling, and a custom online softmax algorithm.

Kafka, Apache Flink, AWS S3, Snowflake

  • Built a fault-tolerant streaming pipeline processing 100+ million daily market events with sub-second latency.
  • Synchronized and detected cross-exchange crypto arbitrage opportunities in real-time.

C#, Golang, Node.js, gRPC, Kubernetes, Azure

  • Designed a 5-microservice architecture communicating via gRPC, HTTP, and Kafka.
  • Fully containerized and deployed on Azure Kubernetes Service (AKS) with automated CI/CD pipelines via GitHub Actions.

🌍 Open Source Contributions

I actively contribute to the open-source community, particularly in data and machine learning ecosystems:


Let's connect! I'm always open to discussing distributed systems, infrastructure, or exciting software engineering opportunities.

Pinned Loading

  1. Deep-Learning Deep-Learning Public

    This repository is a treasure trove for those passionate about harnessing the power of artificial intelligence to unlock new potentials. Whether you're a student eager to learn, a researcher seekin…

    C++ 2

  2. Google-Universal-Image-Embedding-Competition Google-Universal-Image-Embedding-Competition Public

    Python

  3. OrderBook OrderBook Public

    C++

  4. gRpcService gRpcService Public

    This repository contains a gRPC service developed using C# and .NET. The service is designed to provide robust and scalable communication for various applications.

    C#

  5. microsoft/onnxruntime microsoft/onnxruntime Public

    ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

    C++ 21k 4k

  6. Flash_Attention_Using_CUDA Flash_Attention_Using_CUDA Public

    Cuda 4