140 Data Preprocessing jobs in Singapore
Data Engineering Specialist
Posted today
Job Viewed
Job Description
We are seeking a skilled Data Engineer with experience or interest in IoT technologies and cloud-based data engineering to fill this key role.
Key Responsibilities:
- Design, develop, and maintain scalable and efficient data pipelines and ETL processes for IoT and enterprise data systems.
- Implement data ingestion workflows from IoT devices and integrate with enterprise platforms using Azure Data Factory or similar tools.
- Ensure data quality through validation, cleansing, and monitoring processes to address issues such as missing data, duplicates, and inconsistencies.
- Define data attributes and formats for IoT device and network data to support seamless integration with existing systems and standards.
- Optimise data storage solutions in Azure Data Lake Storage (or equivalent) for structured and unstructured data.
- Develop APIs and data interfaces for real-time or near-real-time data transfer between IoT components and enterprise platforms.
- Apply advanced analytics techniques to IoT data for performance monitoring, usage profiling, and network management.
- Leverage BI tools (e.g., Power BI) to enable business intelligence and operational insights.
- Implement robust data security and privacy measures, ensuring compliance with relevant regulations.
- Collaborate with cross-functional teams to gather requirements and deliver high-quality, documented solutions.
This position requires expertise in IoT data systems and modern cloud engineering to drive innovation and excellence across industries.
Bonus Points:
- Familiarity with machine learning algorithms and frameworks.
- Experience with containerization using Docker and Kubernetes.
- Knowledge of data governance and compliance best practices.
As a Data Engineer, you will have the opportunity to work on cutting-edge projects, driving business growth and improving operational efficiency.
About the Role:
- Type: Full-time.
- Location: Remote.
Data Engineering Specialist
Posted today
Job Viewed
Job Description
We are seeking skilled developers with a strong background in coding and test case writing to join our data engineering team.
Candidates should have experience with SQL or Informatica, but a background primarily in these areas will not be considered.
The ideal candidate will possess excellent coding skills and be able to write effective test cases.
Required Skills and Qualifications:
- Strong coding skills
- Ability to write test cases
- Experience with SQL or Informatica
Benefits:
- Global diversity: Be part of an international team celebrating diverse perspectives and collaboration
- Trust and growth: Nurture your talent and empower yourself to reach new heights
- Continuous learning: Unlock your full potential with over 250 training modules
- Vibrant culture: Enjoy a workplace where energy, fun, and camaraderie come together
- Meaningful impact: Join us in making a difference through CSR initiatives
Data Engineering Manager
Posted today
Job Viewed
Job Description
We're looking for Lead Data Engineer / Data Engineering Manager to join our client to lead the design and delivery of scalable, cloud-based data solutions. In this role, you'll work closely with cross-functional teams to solve complex data challenges, modernize infrastructure, and shape strategic data platforms from the ground up.
You'll drive technical direction, design reusable solutions, and influence best practices in data engineering and governance across high-impact projects.
What You'll Do
- Advise on data strategy, architecture, and implementation.
- Build and optimize data pipelines across cloud and on-prem environments.
- Develop secure, scalable infrastructure for structured and unstructured data.
- Design reusable data models and maintain metadata and lineage.
- Ensure governance, data quality, and access controls are in place.
- Support both greenfield builds and legacy modernization efforts.
- Mentor teams and contribute to internal capability development.
What You Bring
- Over 10 years of experience in data engineering, platform or cloud infrastructure.
- Expertise in cloud platforms (AWS, Azure, GCP) and distributed systems (Spark, Hadoop).
- Proficient in Python, SQL, Java, Scala.
- Experience with orchestration tools (e.g. Airflow, ADF) and DevOps (Docker, Git, Terraform).
- Familiarity with Databricks and real-time/batch data pipelines.
- Strong grasp of data governance, security, and compliance practices.
- Clear communicator with strong stakeholder management skills.
- Proven ability to lead, mentor, and drive technical alignment across teams.
For more information you can contact Norean Tan at
We regret to inform you that only shortlisted candidates will be notified / contacted.
EA Registration No.: R - Tan Lee Ying, Norean
iKas International (Asia) Pte Ltd
ROC No.: E | EA License No.: 16S8086
Tell employers what skills you haveManagement Skills
Airflow
Azure
Pipelines
Hadoop
Technical Direction
Data Quality
Data Design
Data Engineering
SQL
Python
Docker
Cloud
Java
Orchestration
Data Strategy
Data Engineering Manager
Posted 2 days ago
Job Viewed
Job Description
We're looking for Lead Data Engineer / Data Engineering Manager to join our client to lead the design and delivery of scalable, cloud-based data solutions. In this role, you’ll work closely with cross-functional teams to solve complex data challenges, modernize infrastructure, and shape strategic data platforms from the ground up.
You’ll drive technical direction, design reusable solutions, and influence best practices in data engineering and governance across high-impact projects.
What You’ll Do
- Advise on data strategy, architecture, and implementation.
- Build and optimize data pipelines across cloud and on-prem environments.
- Develop secure, scalable infrastructure for structured and unstructured data.
- Design reusable data models and maintain metadata and lineage.
- Ensure governance, data quality, and access controls are in place.
- Support both greenfield builds and legacy modernization efforts.
- Mentor teams and contribute to internal capability development.
What You Bring
- Over 10 years of experience in data engineering, platform or cloud infrastructure.
- Expertise in cloud platforms (AWS, Azure, GCP) and distributed systems (Spark, Hadoop).
- Proficient in Python, SQL, Java, Scala.
- Experience with orchestration tools (e.g. Airflow, ADF) and DevOps (Docker, Git, Terraform).
- Familiarity with Databricks and real-time/batch data pipelines.
- Strong grasp of data governance, security, and compliance practices.
- Clear communicator with strong stakeholder management skills.
- Proven ability to lead, mentor, and drive technical alignment across teams.
For more information you can contact Norean Tan at
We regret to inform you that only shortlisted candidates will be notified / contacted.
EA Registration No.: R - Tan Lee Ying, Norean
iKas International (Asia) Pte Ltd
ROC No.: E | EA License No.: 16S8086
Head of Data Engineering
Posted today
Job Viewed
Job Description
We're looking for Lead Data Engineer / Data Engineering Manager to join our client to lead the design and delivery of scalable, cloud-based data solutions. In this role, you’ll work closely with cross-functional teams to solve complex data challenges, modernize infrastructure, and shape strategic data platforms from the ground up.
You’ll drive technical direction, design reusable solutions, and influence best practices in data engineering and governance across high-impact projects.
What You’ll Do
Advise on data strategy, architecture, and implementation.
Build and optimize data pipelines across cloud and on-prem environments.
Develop secure, scalable infrastructure for structured and unstructured data.
Design reusable data models and maintain metadata and lineage.
Ensure governance, data quality, and access controls are in place.
Support both greenfield builds and legacy modernization efforts.
Mentor teams and contribute to internal capability development.
What You Bring
Over 10 years of experience in data engineering, platform or cloud infrastructure.
Expertise in cloud platforms (AWS, Azure, GCP) and distributed systems (Spark, Hadoop).
Proficient in Python, SQL, Java, Scala.
Experience with orchestration tools (e.g. Airflow, ADF) and DevOps (Docker, Git, Terraform).
Familiarity with Databricks and real-time/batch data pipelines.
Strong grasp of data governance, security, and compliance practices.
Clear communicator with strong stakeholder management skills.
Proven ability to lead, mentor, and drive technical alignment across teams.
For more information you can contact Norean Tan at
We regret to inform you that only shortlisted candidates will be notified / contacted.
EA Registration No.: R - Tan Lee Ying, Norean
iKas International (Asia) Pte Ltd
ROC No.: E | EA License No.: 16S8086
#J-18808-Ljbffr
AI / ML / Data Engineering
Posted today
Job Viewed
Job Description
Overview
We are a technology venture backed by Shell, a multinational energy major, dedicated to tackling the most complex issues in the energy sector through groundbreaking research and development in AI and Data Science. Founded in Singapore, our ultimate goal is to establish ourselves as the leading hub for AI and Data Science R&D in Southeast Asia.
The Opportunity
We are looking for sensible and inquisitive engineers and scientists with a strong background in the fields of artificial intelligence, deep learning, data science/engineering, software development, high-performance computing, applied mathematics and/or physics.
As a member of our R&D team, you will have the opportunity to work with some of the brightest and most passionate people in the industry, collaborating on innovative solutions. Your work will have an immediate impact on the energy sector. You will be part of a collaborative and inclusive work environment that values authenticity, integrity, technical mastery, collaboration and impact.
Prepare yourself for an intellectually stimulating journey where each day brings fresh challenges and opportunities for growth. You will immerse yourself in continuous learning experiences, honing your technical skills across various subfields such as artificial intelligence, machine learning, HPC, and geoscience. Furthermore, your work holds the potential to be published in esteemed scientific conferences and journals, offering you a platform to share your insights and discoveries with the global community.
Responsibilities
Conduct applied research and development focused on the core technologies of our company
Design and develop software libraries tailored to run efficiently on our high-performance computing infrastructure
Apply your domain knowledge and expertise to contribute to high-stakes engineering projects that have a tangible impact in the real world
In line with our performance-driven culture, you will be rewarded with a competitive package with unparalleled performance incentives.
Who You Are
You have exceptional analytical and problem solving skills. You are intellectually curious, constantly seeking knowledge and growth. You take strong ownership of your work, proactively ensuring its quality and timely delivery.
Requirements
Bachelor's or Master's degree in Computing, Science or Engineering with 2 years+ of software development experience
Strong programming skills in Python, specifically in NumPy
Experience with development tools such as git, Linux Shell Scripting, vim/emacs, etc
Preferred Qualifications
Programming skills in C, C++
Experience with parallel computing, system programming
Benefits
Why Join LRD?
LRD offers a conducive work environment in a company culture that values authenticity, integrity, technical skills, teamwork and results. Here are some of the opportunities offered by the position:
You are always presented with new challenges to solve intellectually stimulating problems. The work never gets stale, mundane or boring
Continuous learning opportunities to develop your technical expertise in many subfields of AI, ML, HPC, geoscience, engineering, etc
Have your work recognized with publication opportunities in top scientific conferences and journals
Be rewarded for your contributions with high performance bonuses well above market rate
Seniority level
Associate
Employment type
Full-time
Job function
Information Technology
Industries
IT Services and IT Consulting
#J-18808-Ljbffr
Head of PT Data Engineering
Posted 6 days ago
Job Viewed
Job Description
**The Position**
The Pharma Technical Operations (PT) department is establishing the One PT Data Office to serve as the strategic center for data governance, strategy, and enablement across the entire global PT network. This team is at the heart of our digital transformation, responsible for architecting and leading a central data office to unlock the full potential of PT's data assets.
The Head of PT Data Engineering will be instrumental in building the robust data backbone that powers PT's digital transformation and data driven decision making. Reporting into the One PT Data Office, this critical role is accountable for leading a cutting-edge internal and external global data engineering team. You will define the strategy, evolve the data platforms and processes, and oversee the delivery of scalable, high-quality data products to enable advanced analytics, AI initiatives, and critical business processes across Pharma Technical Operations. You will lead a critical team of internal and external data engineers, fostering a culture of technical excellence, innovation, and continuous delivery. This pivotal role requires a visionary leader to build and manage the foundational data infrastructure, pipelines, and platforms that enable the seamless flow of high-quality, FAIR data from diverse sources to data consumers, ensuring compliance, scalability, and future readiness for PT's ambitious digital agenda.
**The Opportunity**
+ Provide strategic leadership and vision for PT's global data engineering capabilities, defining the roadmap for data ingestion, transformation, storage, and consumption architectures.
+ Accountable for the design, development, and evolution of scalable, robust, and cost-effective data platforms (e.g., data lakes, data warehouses, streaming platforms) that support PT's advanced analytics, AI/ML, and data product needs.
+ Define and implement best practices, standards, and guidelines for data modeling, ETL/ELT processes, data quality, and data pipeline orchestration across the PT landscape.
+ Actively monitor and integrate cutting-edge industry trends, emerging data engineering technologies, and cloud-native solutions to continually optimize PT's data infrastructure in close collaboration with IT.
+ Build, mentor, mobilize, and empower a high-performing, global team of internal and external data engineers, fostering a culture of technical excellence, innovation, and agile delivery.
+ Accountable for the end-to-end delivery and operational excellence of critical data pipelines, ensuring timely, accurate, and reliable data availability for PT's business processes and analytical use cases.
+ Ensure data infrastructure and pipelines adhere to strict quality, security, and compliance standards (e.g., GxP, data integrity, data privacy), collaborating closely with Data Governance and Cybersecurity teams.
+ Drive the automation and optimization of data engineering workflows to enhance efficiency, reduce manual effort, and improve data freshness.
**Who You Are**
+ 12+ years of progressive experience in data engineering, data platform architecture, or related roles within a complex, global enterprise, preferably in life sciences/pharma and 7+ years of senior leadership experience, specifically building, developing, and leading large, global teams of data engineers.
+ Proven track record of successfully designing, implementing, and scaling robust data pipelines and cloud-based data platforms (AWS, Azure, GCP data services) for advanced analytics and AI/ML.
+ Expert-level knowledge of modern data architectures, ETL/ELT, data orchestration, and data quality management.
+ Strong understanding of GxP, data integrity, and data privacy regulations in a manufacturing context.
+ Exceptional strategic thinking, communication, and influencing skills to lead and align diverse stakeholders globally.
+ Bachelor's degree in a relevant technical field required; Master's or advanced certifications are highly advantageous.
Ready for the next step? We look forward to hearing from you. Apply now to discover this exciting opportunity!
**Who we are**
A healthier future drives us to innovate. Together, more than 100'000 employees across the globe are dedicated to advance science, ensuring everyone has access to healthcare today and for generations to come. Our efforts result in more than 26 million people treated with our medicines and over 30 billion tests conducted using our Diagnostics products. We empower each other to explore new possibilities, foster creativity, and keep our ambitions high, so we can deliver life-changing healthcare solutions that make a global impact.
Let's build a healthier future, together.
**Roche is an Equal Opportunity Employer.**
Be The First To Know
About the latest Data preprocessing Jobs in Singapore !
Managing Architect of Data Engineering
Posted today
Job Viewed
Job Description
About Bitdeer:
Bitdeer Technologies Group (Nasdaq: BTDR) is a leader in the blockchain and high-performance computing industry. It is one of the world’s largest holders of proprietary hash rate and suppliers of hash rate. Bitdeer is committed to providing comprehensive computing solutions for its customers.
The company was founded by Jihan Wu, an early advocate and pioneer in cryptocurrency who cofounded multiple leading companies serving the blockchain economy. Headquartered in Singapore, Bitdeer has deployed mining datacenters in the United States, Norway, and Bhutan. It offers specialized mining infrastructure, high-quality hash rate sharing products, and reliable hosting services to global users. The company also offers advanced cloud capabilities for customers with high demands for artificial intelligence.
Dedication, authenticity, and trustworthiness are foundational to our mission of becoming the world’s most reliable provider of full-spectrum blockchain and high-performance computing solutions. We welcome global talent to join us in shaping the future.
What you will be responsible for:
We are seeking a highly experienced and technically profound
Managing Architect of Data Engineering
to lead the strategic design and architectural oversight of our enterprise data platforms. This senior leadership role will be responsible for defining our data architecture vision, setting technical standards, guiding complex architectural decisions, and mentoring a team of data architects and principal engineers. The ideal candidate will possess deep technical expertise in modern data stacks, a strong track record of building scalable data solutions, and exceptional leadership skills to drive architectural excellence across the organization.
How you will stand out:
Education:
Bachelor's or Master's degree in Computer Science, Data Engineering, Information Systems, or a related quantitative field.
Experience:
10+ years of progressive experience in data engineering, software engineering, or related fields, with at least 5+ years in a lead architect or principal architect role for large-scale data platforms.
Proven experience as a Managing Architect, Chief Architect, or equivalent senior architectural leadership position, with a track record of successfully defining and implementing enterprise-level data architectures.
Extensive experience designing, building, and operating highly scalable, resilient, and secure data solutions in a cloud-native environment (AWS, Azure, or GCP).
Demonstrated ability to lead and mentor senior technical individual contributors.
Technical Expertise (Deep Mastery Required):
Cloud Platforms:
Expert-level proficiency and hands-on experience with core data services across at least one major cloud provider (e.g., AWS: S3, Redshift, Glue, EMR, Kinesis, Athena, MSK, Lake Formation; Azure: Data Lake, Synapse Analytics, Data Factory, Databricks, Event Hubs; GCP: BigQuery, Dataflow, Cloud Storage, Pub/Sub).
Big Data Technologies:
Mastery of distributed processing frameworks (e.g., Apache Spark) and real-time streaming technologies (e.g., Apache Kafka, Flink).
Data Warehousing/Lakes/Lakehouse: In-depth expertise in designing and optimizing modern data warehousing, data lake, and data lakehouse architectures (e.g., Snowflake, Databricks Lakehouse Platform, Redshift, Synapse Analytics).
Data Modeling:
Expert knowledge of various data modeling techniques (dimensional modeling, Data Vault, 3NF, star/snowflake schemas) for analytical and operational systems.
ETL/ELT & Orchestration:
Deep experience with advanced ETL/ELT patterns and orchestration tools
Data Governance & Security: Proven experience in implementing robust data governance, data quality, data lineage, metadata management, and data security frameworks.
DevOps & MLOps:
Strong understanding of CI/CD, IaC (Terraform, CloudFormation), and MLOps principles as applied to data platforms.
Leadership & Soft Skills:
Exceptional strategic thinking, problem-solving, and decision-making abilities in complex, ambiguous environments.
Outstanding communication, presentation, and interpersonal skills, with the ability to influence and engage executive leadership and diverse technical teams.
Proven ability to lead and inspire highly skilled technical professionals, fostering innovation and a culture of continuous improvement.
Demonstrated ability to manage technical debt while driving new architectural initiatives.
Strong business acumen and ability to align technical solutions with business goals.
What you will experience working with us:
A culture that values authenticity and diversity of thoughts and backgrounds;
An inclusive and respectable environment with open workspaces and exciting start-up spirit;
Fast-growing company with the chance to network with industrial pioneers and enthusiasts;
Ability to contribute directly and make an impact on the future of the digital asset industry;
Involvement in new projects, developing processes/systems;
Personal accountability, autonomy, fast growth, and learning opportunities;
Attractive welfare benefits and developmental opportunities such as training and mentoring.
---
Bitdeer is committed to providing equal employment opportunities in accordance with country, state, and local laws. Bitdeer does not discriminate against employees or applicants based on conditions such as race, colour, gender identity and/or expression, sexual orientation, marital and/or parental status, religion, political opinion, nationality, ethnic background or social origin, social status, disability, age, indigenous status, and union.
#J-18808-Ljbffr
Data Engineering Traineeship [GRIT@Gov]
Posted today
Job Viewed
Job Description
Data Engineering Traineeship ( )
Join to apply for the
Data Engineering Traineeship ( )
role at
Civil Aviation Authority of Singapore . Get AI-powered advice on this job and more exclusive features.
What The Role Is
This traineeship provides hands‐on experience in building and maintaining data pipelines, automating data workflows, and integrating datasets within CAAS’s enterprise data environment. Trainees will learn to apply data engineering principles, which includes data modeling, transformation, and quality management, to support analytics, AI applications, and data‐driven decision‐making in the aviation domain.
What You Will Be Working On
Develop and maintain data pipelines to automate data extraction, transformation, and loading into CAAS’s Enterprise Data Management System (EDMS).
Create scripts and automation tools for data processing, integration, and validation.
Build dashboards and visualisations to communicate insights and support operational decision‐making.
Support AI‐driven data use cases, including prompt design, data transformation workflows, and chatbot setup.
What We Are Looking For
We are seeking an entry‐level candidate who is passionate about data engineering and applied AI, and eager to contribute to CAAS’s data transformation efforts. The ideal candidate enjoys problem‐solving, working with data systems, and developing practical solutions to derive actionable insights.
Preferred Qualifications And Experience
Background in Computer Science, Data Science, Engineering, Information Systems, or a related discipline.
Basic understanding of data engineering concepts, including ETL processes, data pipelines, and APIs.
Familiarity with Python and SQL for data extraction, transformation, and automation.
Exposure to data visualisation tools (e.g. Power BI, Tableau) for building dashboards and reporting insights.
Interest in AI and data applications, such as prompt design and chatbot development.
Strong analytical and communication skills, with the ability to work both independently and collaboratively in a team environment.
#J-18808-Ljbffr
AI and Data Engineering Lead
Posted 16 days ago
Job Viewed
Job Description
The day-to-day activities (Duties and Responsibilities):
- Grow, mentor and lead the H3’s AI, data science and data engineering team.
- Ensuring that team members are clear about expected standards of performance,motivated and developed to provide effective and efficient services.
- Building on H3’s AI roadmap, lead the development of the H3’s AI and data science vision, strategy and capability plan.
- Continue to develop H3’s AI data microservices and infrastructure, and integrate the latest data models and data pipelines, establishing a model management approach that is best in class for responsible data use.
- Provide expertise in statistics and probability, and the evaluation of data use, algorithm and models for internal development
Work experience/Skills required:
- Possessing a Bachelor’s degree in relevant field; Higher education qualifications in computer vision, machine-learning, statistics and/or technology also preferred –computer science, software engineering etc.
- Significant (5 years +) experience in AI, data science and computer vision.
- Significant experience of recruiting, establishing and managing a data science / AI team
- Significant experience of developing a data strategy and data governance approach
- Significant experience of designing, developing, deploying and managing algorithms, models, and model management approaches.
- Significant experience of delivering machine learning and data analysis projects. Generating data insights
- Industry-specific knowledge; Facilities Management/Built Environment/Civil Engineering is a huge bonus