Skip to content
View valk's full-sized avatar

Block or report valk

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
valk/README.md

Val Kotlarov Hoffman

Principal Systems Architect | AI Performance Engineering

  • πŸ”­ Current Focus: Hardware-aware LLM optimization, GPU orchestration, and custom Triton kernels.
  • πŸš€ Nebius Academy: Mastering high-performance infrastructure for the Israel National AI Supercomputer.
  • πŸ› οΈ Experience: 25+ years in low-level R&D, from compilers to high-frequency trading engines.

⚑ AI Infrastructure & Systems Mastery

  • AI Compute: Triton, CUDA Kernels, GPU Memory Optimization (HBM3/SRAM), Inference Scaling.
  • Systems Architecture: High-Performance Computing (HPC), Distributed Clusters, C++, Go, Rust.
  • MLOps: GPU Orchestration, Docker/K8s for AI, Latency-Critical Backend Systems.
  • Legacy Expertise: Compiler Design (Lex/Yacc), R&D, Real-time Search, and IoT Data Streams.

πŸ§ͺ Performance Labs (Nebius Academy)

I am documenting my deep-dive into AI Performance Engineering here:


πŸ’Ό Strategic Engineering Leadership

Founder & CTO | vkhey! Mar 2025 – Present

  • Architecting high-performance multimodal RAG systems (text, video, audio) for enterprise scale.
  • Bridging 25 years of systems engineering into the "plumbing" of the agentic AI era.

Principal Developer | Labguru May 2023 – Mar 2025

  • Engineered a first-of-its-kind cross-server communication platform for distributed lab environments.
  • Led technical implementation for ISO 27001, GDPR, and SOC 2 compliance in high-security environments.

Principal Developer | Kando Jan 2021 – Mar 2023

  • Tech lead for the National COVID-19 monitoring project, managing massive IoT data streams.
  • Optimized data-processing algorithms and high-performance dashboards for national-scale infrastructure.

Principal R&D Dev | SeekingAlpha Jul 2015 – Jul 2020

  • Built multi-million dollar ad products and high-throughput analytics pipelines for 20M+ monthly users.

πŸ›  The Polyglot Foundation

Note: While I am a polyglot (Python, Ruby, Node.js, Go, C#, C++), I specialize in System Design where the choice of tool is dictated by hardware constraints and performance requirements. Currently spending my days in Triton and C++ to squeeze every TFLOP out of the H100.


πŸ“« Connect with Me


Credits

This profile and current R&D projects are powered by real-time web grounding via the Brave Search API.
Brave Logo

Popular repositories Loading

  1. jquery-ui-rails jquery-ui-rails Public

    Forked from jquery-ui-rails/jquery-ui-rails

    jQuery UI for the Rails 3.1+ asset pipeline

    Ruby 1

  2. web-developer-interview-task web-developer-interview-task Public

    Example mini web-framework written in pure PHP v5.3.

    1

  3. ai-performance-engineering ai-performance-engineering Public

    Lecture notes for AI Performance Engineering course

    1

  4. jquery-rails jquery-rails Public

    Forked from indirect/jquery-rails

    A gem to automate using jQuery with Rails 3

    Ruby

  5. rack-less rack-less Public

    Forked from kellyredding/rack-less

    LESS CSS preprocessing for Rack apps.

    Ruby

  6. edible-recipe edible-recipe Public

    An engine with food recipes in mind. Useful to embed into blogs or other food-related apps.

    Ruby 1