AI Engineer

Remote from
Europe flag
Europe
Annual salary
Undisclosed
Salary information is not provided for this position. Check our Salary Directory to estimate the average compensation for similar roles.
Employment type
Full Time,
Job posted
Apply before
10 Jul 2026
Experience level
Senior
Views / Applies
51 / 5

About Ruby Labs

Crafting Consumer Products of Tomorrow

Actively Hiring
Verified job posting
This job post has been manually reviewed for authenticity and compliance.

AI Summary

Ruby Labs is seeking a Senior AI Engineer to take high ownership of AI systems in production. The role involves end-to-end delivery of major AI features, designing agentic and tool-using AI workflows, and ensuring output quality across multiple LLM providers. You'll work in a modern stack with Next.js, TypeScript, Node.js, and Redis, collaborating with product, growth, and data teams. Candidates need 6+ years of backend engineering experience and 2+ years of production AI/LLM experience. This is a high-impact role driving the evolution of consumer products in health, education, and entertainment.

Role DNA

Job Complexity
Easy Hard
Pace & Pressure
Relaxed Fast-paced
Autonomy Level
Guided Full Ownership
Communication Load
Independent Highly Collaborative
AI Insight The role requires 6+ years of software engineering and 2+ years of production AI experience, along with deep expertise in LLM APIs, agentic systems, and orchestration frameworks, making it a demanding senior position.

Salary Analysis

Median Highly Competitive
$160,000
US Market
$120k – 200k
0 $220k
AI Insight No salary was specified in the listing. Based on the role's seniority and requirements, the offered salary is likely competitive with the US market median of approximately $160,000. Companies typically adjust based on location and experience, but this range reflects the high demand for Senior AI Engineers.

Key Skills

TypeScript Node.js Next.js LLM APIs LangChain Langfuse Agentic AI Production AI Systems Microservices Redis

I am writing to express my strong interest in the Senior AI Engineer position at Ruby Labs. With over 6 years of backend engineering experience and a proven track record of delivering production-grade AI systems, I am confident in my ability to drive the quality and evolution of your AI features.

My expertise includes designing agentic workflows, managing multiple LLM providers via OpenRouter, and building scalable systems with TypeScript, Next.js, and Node.js. I have successfully implemented multi-step LLM pipelines and used tools like Langfuse for evaluation and monitoring.

I am particularly drawn to the high-ownership culture at Ruby Labs and the opportunity to collaborate with cross-functional teams to shape consumer products. I thrive in fast-paced environments and enjoy solving complex challenges related to output reliability and cost optimization.

Thank you for considering my application. I look forward to discussing how my skills can contribute to Ruby Labs' innovative projects.

Can you describe a production AI system you built from scratch? What were the key challenges and how did you ensure reliability?
I led the development of a multi-agent system for customer support that integrated with multiple LLMs via OpenRouter. Key challenges included managing latency across providers and ensuring consistent output formats. I implemented fallback strategies and used Langfuse for monitoring, which reduced errors by 30%.
How do you approach designing tool-using AI workflows? Can you give an example of a tool contract you defined?
I design tool contracts by specifying function schemas with clear input/output types and error handling. For example, for a weather query tool, I defined an endpoint that accepts location and returns JSON, with retry logic for timeouts. This ensures reliable AI-to-service integration.
Explain how you would optimize cost and latency when using multiple LLM providers.
I use OpenRouter to route requests based on model performance and cost. For simple tasks, I route to cheaper models like Mistral, and for complex reasoning, I use GPT-4. I also implement caching for common prompts and use batching where possible.
Describe an instance where you had to debug a complex LLM hallucination issue. What steps did you take?
I used Langfuse traces to identify the step where the model generated incorrect data. I added output scoring with regex validation and forced the model to output structured JSON. I also improved prompting with explicit constraints and few-shot examples, reducing hallucination by 50%.
How do you ensure code quality and maintainability in a fast-paced AI engineering environment?
I enforce code reviews, unit tests, and integration tests for all AI-related code. I advocate for modular design, separating prompt logic from business logic. I also use feature flags to roll out AI changes gradually and monitor production metrics before full release.

About us

Ruby Labs is a leading tech company that creates and operates innovative consumer products. We offer a diverse range of opportunities across the health, education, and entertainment industries. Our innovative teams are driving the future of consumer-led products, and we’re always looking for passionate individuals to join us. Learn more about our story at: https://rubylabs.com/about-us/

About the role

At Ruby Labs we are looking for a Senior AI Engineer to own and drive the quality, reliability, and evolution of our AI systems in production.

This is a high-ownership role. You will be responsible for end-to-end delivery of major AI features, production stability of AI systems, and data-driven experimentation using tools like Langfuse, Mixpanel and OpenRouter. You’ll work in a modern stack built on Next.js, TypeScript, Node.js, and Redis, collaborating closely with product, growth, data, and billing teams. Increasingly, this includes building agentic, tool-using AI systems — defining clean tool contracts (including MCP-based tools) and orchestrating how AI interacts with internal services and business systems.

Our engineering organization uses a squad-based structure. You will operate within an AI engineering squad, contributing as a senior technical voice and driving engineering quality within your area of the product.

Key Responsibilities

  • Take complete ownership and deliver major AI engineering features within agreed timelines.

  • Own AI output quality, structure, and predictability across all user-facing AI interactions.

  • Design, implement, and maintain output-type-based AI systems, including segmentation, routing, and enforcement.

  • Ensure consistent output structure and formatting across different LLMs for the same request type.

  • Integrate and orchestrate multiple LLM providers via OpenRouter, managing model selection, fallback strategies, and cost optimisations.

  • Design and orchestrate tool-using and agentic AI workflows, defining clean tool contracts (including MCP-based tools), function-calling interfaces, and reliable AI-to-system integrations.

  • Build and maintain complex, multi-step LLM workflows, including with orchestration frameworks such as LangChain or LlamaIndex, for advanced reasoning, context reuse, and retrieval.

  • Design and manage production prompt systems with dynamic prompting, context injection, and conditional logic.

  • Own the deployment and release of LLM experiments, prompt management, and Langfuse-based evaluation pipelines.

  • Run A/B tests across models, analyse results, and present data-driven impact assessments of AI features and experiments.

  • Monitor AI system metrics, quality signals, latency, and release health using Langfuse and other observability tools.

  • Deep-debug complex LLM chains using Langfuse traces, identifying bottlenecks and optimising for cost, latency, and context-window usage, and build output-scoring systems to root-cause hallucinations and logic errors.

  • Write clean, scalable, and maintainable TypeScript code across the Next.js and Node.js stack.

  • Build reliable backend logic for AI systems, with strong error handling, request validation, fallback flows, and predictable behaviour in production, including reliable tool execution and AI-to-service integrations.

  • Ensure high code quality through testing, code reviews, and clear engineering standards.

  • Monitor, troubleshoot, and improve production performance, reliability, and system health.

  • Drive maintainability and technical quality through solid architecture, refactoring, and disciplined release practices.

Qualifications

  • 6+ years of backend/full-stack software engineering experience, including production-grade TypeScript/Node.js. Experience with Next.js and/or Python is a plus.

  • 2+ years of experience building AI/LLM systems in production. Less experience may be considered for exceptional candidates.

  • Deep hands-on experience working with LLM APIs (OpenAI, Anthropic, or similar) in production environments.

  • Experience with Agentic AI, multi-agent orchestration, tool-based workflows (function calling/tool execution), and/or RAG pipelines, including indexing, retrieval, and re-ranking.

  • Experience with LLM observability tools such as Langfuse, LangSmith, or similar platforms.

  • Experience with AI gateways and model routing solutions, such as OpenRouter or equivalent technologies.

  • Solid understanding of Redis and relational databases, such as PostgreSQL.

  • Exceptional ownership mindset and personal responsibility for engineering quality and delivery.

Nice to have

  • Experience with AI-centered development tools such as Cursor, Claude Code, Windsurf, or similar platforms.

  • Familiarity with evaluation frameworks, including LLM-as-a-judge, RAGAS, or similar approaches.

  • Experience working in high-pressure startup environments with rapid product iteration cycles.

  • Experience with MCP (Model Context Protocol), including building MCP servers/clients or designing tool contracts for AI agents.

  • Experience with edge and serverless runtimes, such as Cloudflare Workers, and supporting services including KV, Durable Objects, Queues, R2, and D1.

  • Experience with payments, billing and checkout flows, or orchestration platforms.

  • Practical experience fine-tuning models for domain-specific tasks or achieving strict JSON/schema compliance.

  • Working proficiency in Python for data science, evaluation scripts, or AI tooling.

Location

Ruby Labs operates within the CET (Central European Time) zone. Applicants from any country are welcome to apply for the position as long as they are located within approximately ± 4 hours of CET. This ensures optimal collaboration and communication during working hours.

Benefits

Discover the perks of being part of our vibrant team! We offer:

  • Remote Work Environment: Embrace the freedom to work from anywhere, anytime, promoting a healthy work-life balance.

  • Unlimited PTO: Enjoy unlimited paid time off to recharge and prioritize your well-being, without counting days.

  • Paid National Holidays: Celebrate and relax on national holidays with paid time off to unwind and recharge.

  • Company-provided MacBook: Experience seamless productivity with top-notch Apple MacBooks provided to all employees who need them.

  • Flexible Independent Contractor Agreement: Unlock the benefits of flexibility, autonomy, and entrepreneurial opportunities. Benefit from tax advantages, networking opportunities, reduced employment obligations, and the freedom to work from anywhere. Read more about it here: https://docs.google.com/document/d/1tzxGX4Uu7Ts_HCLFXESKLnKaaBfVCPf1f9AYZPrkjJM/preview?tab=t.0

Be part of our fast-growing team and seize this excellent opportunity for personal and professional growth!

Interview Process

After submitting your application, we conduct a thorough review which typically takes 3 to 5 days, but may occasionally take longer due to the volume of applications received. If we see a potential fit, we proceed with the following steps:

  • Recruiter Screening (40 minutes)

  • Technical Interview (60 minutes)

  • Second Interview (30 minutes)

  • Final Interview (20 minutes)

Life at Ruby Labs

At Ruby Labs, we move fast, aim high, and expect the same from our team. We’re not here to play small—we’re here to build, grow, and win. That means we look for people who are ambitious, driven, and ready to give their best every single day.

This is a place for individuals who thrive under pressure, embrace challenges, and see opportunity in every obstacle. If you’re hungry to achieve, motivated by impact, and want to grow at the speed of your own ambition, Ruby Labs offers the platform to make it happen.

Here, effort is matched with reward. We recognize those who go all in and deliver results, and we create space for people who want more—more responsibility, more growth, and more success.

#LI-Remote

Apply now >

Annual salary information is not provided for this position. Explore salary ranges for similar roles in our Salary Directory ›

This job listing has been manually reviewed by the Jobicy Trust & Safety Team for compliance with our posting guidelines, including verification of the company's legitimacy, accuracy of job details, clarity of remote work policy, and absence of misleading or fraudulent content.

How to apply

Did you apply? Let us know, and we’ll help you track your application.

See a few more

Similar Software Engineering remote jobs

Job Search Safety Tips

Here are some tips to help you search and apply for jobs safely:
Watch out for suspicious jobs Don't apply for jobs that offer high pay for little work or offer to hire you without an interview. Read more ›
Check the employer's profile Make sure you're applying for a trustworthy job by visiting the employer's profile and learning more about them. Read more ›
Protect your information Don't share personal details like your bank account or government-issued ID on suspicious websites or messengers. Read more ›
Report jobs that feel unsafe If you see a job that seems misleading, inappropriate or discriminatory, report it for going against our policies and we'll review it.

Share this job

Jobicy+ Subscription

Jobicy

614 professionals pay to access exclusive and experimental features on Jobicy

Free

USD $0/month

For people just getting started

  • • Unlimited applies and searches
  • • Access on web and mobile apps
  • • Weekly job alerts and digest
  • • Access to additional tools like Bookmarks, Applications, and more

Plus

USD $8/month

Everything in Free, and:

  • • Ad-free experience
  • • Daily job alerts and digest
  • • Personal career consultant
  • • AI-powered job advice
Go to account ›