MLOps Engineer
Together AI
• www.linkedin.com/company/togethercomputer
• Full-time
Jul 2025 - Present
Amsterdam, Netherlands
MLOps/DevOps Engineer
Stealth Startup
• www.linkedin.com/company/stealth-startup-community
• Full-time
Jul 2023 - Jul 2025
San Francisco Bay Area
ML/MLOps Engineer
Stealth Startup
• www.linkedin.com/company/stealth-startup-community
• Full-time
Nov 2022 - Mar 2024
Tbilisi, Georgia
Data Scientist
Stealth Startup
• www.linkedin.com/company/stealth-startup-community
• Full-time
Sep 2021 - Nov 2022
San Francisco Bay Area
Teaching Assistant
Higher School of Economics
• www.linkedin.com/school/%D0%B2%D1%8B%D1%81%D1%88%D0%B0%D1%8F-%D1%88%D0%BA%D0%BE%D0%BB%D0%B0-%D1%8D%D0%BA%D0%BE%D0%BD%D0%BE%D0%BC%D0%B8%D0%BA%D0%B8
• Part-time
Jan 2020 - Dec 2021
Moscow, Russia

MLOps Engineer
Hi — I’m Gleb, an MLOps/DevOps engineer with 4+ years of experience building and scaling ML infrastructure. Recent work: • Built GPU-powered, multi-modal serving pipelines for 30k+ daily users. • Deployed Kubernetes-native GPU clusters, cutting hosting costs by 50%. • Set up Grafana Cloud dashboards and alerts, reducing on-call guesswork by over 40%. What do I do? • Bridge research and production environments. • Build reproducible development setups with Nix and dev containers. • Code in Python, Rust, Go, C++, and TypeScript. • Keep codebases clean with pre-commit hooks and conventional commits. • Package applications with Docker. • Orchestrate deployments with Kubernetes, Kustomize, and Helm. • Set up infrastructure using Terraform, Ansible, and GitOps via Argo CD. • Automate testing and deployments with CI/CD pipelines. • Train models with PyTorch and Lightning; track experiments using Weights & Biases. • Deploy those models with NVIDIA Triton Inference Server. • Manage data with Supabase and PostgreSQL. • Build asynchronous pipelines with Celery and Kafka. • Develop APIs with FastAPI and grpc-go. • Monitor metrics, logs, and traces through Grafana Cloud. What domains do I know? • Speech processing and synthesis. • Natural language processing. • Finance and quantitative modeling. How do I work? I focus on measurable wins: lower latency, higher uptime, reduced costs, and better user retention. Contact: 📧 khaykingleb@gmail.com 🌐 khaykingleb.com Skills: Python, Rust, Go, SQL, TypeScript, Kubernetes, Docker, Terraform, AWS, GCP, NVIDIA Triton, PyTorch, Lightning, Grafana
hse.ru
together.xyz
together.ai
United States