Remote Mid/Senior Data Engineer @ CodiLime

Get to know us better

CodiLime is a software and network engineering industry expert and the first-choice service partner for top global networking hardware providers, software providers, and telecoms. We create proofs-of-concept, help our clients build new products, nurture existing ones, and provide services in production environments. Our clients include both tech startups and big players in various industries and geographic locations (US, Japan, Israel, and Europe).

While no longer a startup, we have 250+ people on board and have been operating since 2011. We’ve kept our people-oriented culture. Our values are simple:

The project and the team

 The project is divided into two main parts:

You will spend approximately 70% of your time on data processing activities, contributing to the continuous improvement of the large dataset. The remaining 30% will focus on maintaining the platform, working with the API, and ensuring proper integration with the latest version of the dataset.

The goal of the project is to build a centralized, large-scale business data platform for one of the biggest global consulting firms. The final dataset must be enterprise-grade, providing consultants with reliable, easily accessible information to help them quickly and effectively analyze company profiles during Mergers & Acquisitions (M&A) projects.

You will contribute to building data pipelines that ingest, clean, transform, and integrate large datasets from more than 10 different data sources, resulting in a unified database containing over 300 million company records. The data must be accurate, well-structured, and optimized for low-latency querying. The dataset will power several internal applications, enabling a robust search experience across massive datasets and making your work directly impactful across the organization.

The data will provide firm-level and site-level information, including firmographics, technographics, and hierarchical relationships (e.g., GU, DU, subsidiary, site). This platform will serve as a key data backbone for consultants, delivering critical metrics such as revenue, CAGR, EBITDA, number of employees, acquisitions, divestitures, competitors, industry classification, web traffic, related brands, and more.

Technology stack:

What else you should know:

We work on multiple interesting projects at the same time, so it may happen that we’ll invite you to the interview for another project if we see that your competencies and profile are well suited for it.

Your role

As a part of the project team, you will be responsible for:

Do we have a match?

As a Data Engineer, you must meet the following criteria:

Beyond the criteria above, we would appreciate the nice-to-haves:

More reasons to join us