Senior Data Engineer (Autonomous Vehicles Data)
Join our remote, long-term AV data team to turn massive sensor data into simulation-ready pipelines and ML-ready datasets for safer autonomous driving.
Senior Data Engineer (Autonomous Vehicles Data)
At RSB Automotive Consulting, we work with engineers and technology teams developing advanced mobility solutions. We are currently looking for a Senior Data Engineer to join a long-term project focused on data processing and analytics for Autonomous Vehicle (AV) development.
In this role, you will work with large-scale sensor datasets collected from test vehicles and support teams building simulation environments and machine learning pipelines used to validate safety-critical AV systems.
Project context: data pipelines and analytics supporting Autonomous Vehicle (AV) development
Tech stack: Python, SQL, Spark / PySpark, Databricks
Data scale: up to ~1 TB of sensor data per hour (cameras, LiDAR, radar)
Focus: time-series data, advanced analytics, data preparation for simulation and ML training
Work mode: remote (any location), with daily overlap with the US team
Start: ASAP
Project duration: long-term collaboration with a global automotive OEM
Your responsibilities
Analyze large volumes of sensor and time-series data from autonomous vehicle test fleets
Develop advanced SQL, Python, and PySpark queries to filter, transform, and aggregate datasets
Design and maintain ETL pipelines processing large-scale structured and semi-structured data
Identify and extract data suitable for AV simulation scenarios and ML training pipelines
Support the discovery of rare or complex driving situations (e.g. unusual traffic scenarios, hard braking events, edge cases)
Develop scripts and internal tools supporting data mining and analytics workflows
Collaborate with engineers working across the autonomous driving technology stack
What we’re looking for
Strong software engineering background
Advanced SQL skills with experience writing complex queries
Advanced Python programming
Hands-on experience with Spark / PySpark
Experience working with Databricks
Experience in advanced data analytics and time-series analysis
Understanding of data preparation for machine learning workflows
Project context
You will work with sensor data generated by autonomous vehicle test fleets equipped with multiple cameras, LiDAR, and radar. These systems generate extremely large datasets used to build simulation environments that help validate autonomous driving algorithms.
The role focuses on transforming raw sensor data into structured, simulation-ready datasets used by engineering and research teams.
If you are interested in working with large-scale data systems and real-world autonomous driving datasets, we would be happy to connect.
- Department
- Automotive
- Role
- Software Engineer
- Remote status
- Fully Remote
- Job-ID:
- JB-129
Kraków
About RSB Automotive Consulting
RSB Automotive Consulting specialises in providing automotive competencies. We have a strong focus on all candidate’s profile aspects:
- Knowledge
- Experience
- Communication