Dear Hiring Team,
I am writing to express my strong interest in the Senior ML Engineer (Token Factory) position at Nebius. With a profound understanding of transformer architectures and extensive experience in GPU workload profiling using tools like Nsight and PyTorch profiler, I am excited about the opportunity to maximize inference throughput and minimize latency across tens of thousands of GPUs.
My background includes contributing to open-source inference engines such as vLLM and TensorRT-LLM, and I have successfully implemented low-precision training pipelines using FP8. I thrive in fast-paced startup environments, take full ownership of projects, and collaborate effectively with cross-functional teams.
I am eager to bring my expertise in LLM inference optimization and distributed systems to Nebius, and I look forward to the possibility of discussing how I can contribute to the Token Factory's mission. Thank you for considering my application.
Sincerely,
[Your Name]








