Talent.com
Lead Data Engineer

Lead Data Engineer

North Eastern ServicesQuetta City Tehsil, Balochistan, Pakistan
30+ days ago
Job description

About Fusemachines

Fusemachines is a leading AI strategy, talent, and education services provider. Founded by Sameer Maskey Ph D., Adjunct Associate Professor at Columbia University, Fusemachines has a core mission of democratizing AI. With a presence in 4 countries (Neb, United States, Canada, and Dominican Republic and more than 450 employees). Fusemachines seeks to bring its global expertise in AI to transform companies around the world.

Location : Remote (Full-time)

About the role

This is a remote full-time position, responsible for designing, building, testing, optimizing and maintaining the infrastructure and code required for data integration, storage, processing, pipelines and analytics (BI, visualization and Advanced Analytics) from ingestion to consumption, implementing data flow controls, and ensuring high data quality and accessibility for analytics and business intelligence purposes. This role requires a strong foundation in programming, and a keen understanding of how to integrate and manage data effectively across various storage systems and technologies.

We’re looking for someone who can quickly ramp up, contribute right away and lead the work in Data & Analytics, helping from backlog definition, to architecture decisions, and lead technical the rest of the team with minimal oversight.

We are looking for a skilled Sr. Data Engineer / Technical Lead with a strong background in Python, SQL, Pyspark, Redshift and AWS cloud‑based large‑scale data solutions with a passion for data quality, performance and cost optimization. The ideal candidate will develop in an Agile environment, and would have GCP experience too, to contribute to the migration from AWS to GCP.

This role is perfect for an individual passionate about leading, leveraging data to drive insights, improve decision‑making, and support the strategic goals of the organization through innovative data engineering solutions.

Qualification / Skill Set Requirement :

  • Must have a full‑time Bachelor's degree in Computer Science, Information Systems, Engineering, or a related field.
  • 5+ years of real‑world data engineering development experience in AWS and GCP (certifications preferred). Strong expertise in Python, SQL, PySpark and AWS in an Agile environment, with a proven track record of building and optimizing data pipelines, architectures, and datasets, and proven experience in data storage, modeling, management, lake, warehousing, processing / transformation, integration, cleansing, validation and analytics.
  • Senior person who can understand requirements and design end‑to‑end solutions with minimal oversight.
  • Strong programming skills in one or more languages such as Python, Scala, and proficient in writing efficient and optimized code for data integration, storage, processing and manipulation.
  • Strong knowledge of SDLC tools and technologies, including project management software (Jira or similar), source code management (GitHub or similar), CI / CD system (GitHub Actions, AWS CodeBuild or similar) and binary repository manager (AWS CodeArtifact or similar).
  • Good understanding of Data Modeling and Database Design Principles, being able to design and implement efficient database schemas that meet the requirements of the data architecture to support data solutions.
  • Strong SQL skills and experience working with complex data sets, Enterprise Data Warehouse and writing advanced SQL queries. Proficient with Relational Databases (RDS, MySQL, Postgres, or similar) and Non‑SQL Databases (Cassandra, MongoDB, Neo4j, etc.).
  • Skilled in Data Integration from different sources such as APIs, databases, flat files, event streaming.
  • Strong experience in implementing data pipelines and efficient ELT / ETL processes, batch and real‑time, in AWS and using open source solutions, being able to develop custom integration solutions as needed, including Data Integration from different sources such as APIs (PoS integrations is a plus), ERP (Oracle and Allegra are a plus), databases, flat files, Apache Parquet, event streaming, including cleansing, transformation and validation of the data.
  • Strong experience with scalable and distributed Data Technologies such as Spark / PySpark, DBT and Kafka, to be able to handle large volumes of data.
  • Experience with stream‑processing systems : Storm, Spark‑Streaming, etc. is a plus.
  • Strong experience in designing and implementing Data Warehousing solutions in AWS with Redshift. Demonstrated experience in designing and implementing efficient ELT / ETL processes that extract data from source systems, transform it (DBT), and load it into the data warehouse.
  • Strong experience in orchestration using Apache Airflow.
  • Expert in Cloud Computing in AWS, including deep knowledge of a variety of AWS services like Lambda, Kinesis, S3, Lake Formation, EC2, EMR, ECS / ECR, IAM, CloudWatch, etc.
  • Good understanding of Data Quality and Governance, including implementation of data quality checks and monitoring processes to ensure that data is accurate, complete, and consistent.
  • Good understanding of BI solutions including Looker and LookML (Looker Modeling Language).
  • Strong knowledge and hands‑on experience of DevOps principles, tools and technologies (GitHub and AWS DevOps) including continuous integration, continuous delivery (CI / CD), infrastructure as code (IaC – Terraform), configuration management, automated testing, performance tuning and cost management and optimization.
  • Good problem‑solving skills : being able to troubleshoot data processing pipelines and identify performance bottlenecks and other issues.
  • Possesses strong leadership skills with a willingness to lead, create ideas, and be assertive.
  • Strong project management and organizational skills.
  • Excellent communication skills to collaborate with cross‑functional teams, including business users, data architects, DevOps / DataOps / MLOps engineers, data analysts, data scientists, developers, and operations teams. Essential to convey complex technical concepts and insights to non‑technical stakeholders effectively.
  • Ability to document processes, procedures, and deployment configurations.

Responsibilities :

  • Design, implement, deploy, test and maintain highly scalable and efficient data architectures, defining and maintaining standards and best practices for data management independently with minimal guidance.
  • Ensuring the scalability, reliability, quality and performance of data systems.
  • Mentoring and guiding junior / mid‑level data engineers.
  • Collaborating with Product, Engineering, Data Scientists and Analysts to understand data requirements and develop data solutions, including reusable components.
  • Evaluating and implementing new technologies and tools to improve data integration, data processing and analysis.
  • Design architecture, observability and testing strategies, and building reliable infrastructure and data pipelines.
  • Takes ownership of storage layer, data management tasks, including schema design, indexing, and performance tuning.
  • Swiftly address and resolve complex data engineering issues, incidents and resolve bottlenecks in SQL queries and database operations.
  • Conduct Discovery on existing Data Infrastructure and Proposed Architecture.
  • Evaluate and implement cutting‑edge technologies and methodologies and continue learning and expanding skills in data engineering and cloud platforms, to improve and modernize existing data systems.
  • Evaluate, design, and implement data governance solutions : cataloging, lineage, quality and data governance frameworks that are suitable for a modern analytics solution, considering industry‑standard best practices and patterns.
  • Define and document data engineering architectures, processes and data flows.
  • Assess best practices and design schemas that match business needs for delivering a modern analytics solution (descriptive, diagnostic, predictive, prescriptive).
  • Be an active member of our Agile team, participating in all ceremonies and continuous improvement activities.
  • Equal Opportunity Employer : Race, Color, Religion, Sex, Sexual Orientation, Gender Identity, National Origin, Age, Genetic Information, Disability, Protected Veteran Status, or any other legally protected group status.

    #J-18808-Ljbffr

    Create a job alert for this search

    Data Engineer • Quetta City Tehsil, Balochistan, Pakistan

    Related jobs
    • Promoted
    People Analytics Data Scientist, AI & Dashboards

    People Analytics Data Scientist, AI & Dashboards

    AccentureQuetta City Tehsil, Balochistan, Pakistan
    A leading consulting firm is seeking a Data Science Consultant to enhance HR analytics through advanced statistical models and data visualization. The ideal candidate will possess strong skills in P...Show moreLast updated: 1 day ago
    • Promoted
    • New!
    Remote E-commerce CRO Specialist - Data-Driven Growth

    Remote E-commerce CRO Specialist - Data-Driven Growth

    TalentPop AppQuetta City Tehsil, Balochistan, Pakistan
    A fast-growing e-commerce solutions provider in Pakistan seeks an E-commerce Conversion Specialist to optimize customer journeys from first click to final purchase. With a focus on Conversion Rate O...Show moreLast updated: 12 hours ago
    • Promoted
    Backend Engineer (Go)

    Backend Engineer (Go)

    TarakiQuetta City Tehsil, Balochistan, Pakistan
    The role is fully remote, with UK working hours (i.Join a hyper-growth fintech company that provides cutting‑edge digital asset trading infrastructure for institutional investors.Their unified plat...Show moreLast updated: 23 days ago
    • Promoted
    Remote ML / AI Software Engineer — Build LLM Workflows

    Remote ML / AI Software Engineer — Build LLM Workflows

    Leverify LLCQuetta City Tehsil, Balochistan, Pakistan
    A modern technology company seeks a Software Developer for Machine Learning / AI.The role involves deploying and integrating LLM-powered tools and developing Python-based systems.Ideal candidates sho...Show moreLast updated: 1 day ago
    • Promoted
    • New!
    Senior Python Backend Engineer — Remote ML / NLP Pipelines

    Senior Python Backend Engineer — Remote ML / NLP Pipelines

    TarakiQuetta City Tehsil, Balochistan, Pakistan
    A leading technology recruitment service is looking for a Backend Engineer (Python) for a full-time remote position, available during US EST hours. The role involves implementing Python services, ar...Show moreLast updated: 10 hours ago
    • Promoted
    • New!
    Amazon PPC Team Lead – Razab Inc

    Amazon PPC Team Lead – Razab Inc

    RazabQuetta City Tehsil, Balochistan, Pakistan
    Get AI-powered advice on this job and more exclusive features.We are currently seeking an experienced and results-driven. Amazon PPC campaigns across multiple accounts, drive revenue growth, and gui...Show moreLast updated: 12 hours ago
    • Promoted
    • New!
    Systems Integration Lead

    Systems Integration Lead

    DatamaticstechnologiesQuetta City Tehsil, Balochistan, Pakistan
    Revenue Cycle Management (RCM) software solutions into clients’ existing.Enterprise Resource Planning (ERP).Electronic Health Records (EHR). This role demands strong technical expertise, project lea...Show moreLast updated: 10 hours ago
    • Promoted
    Growth Strategy & Ops Analyst (Data-Driven)

    Growth Strategy & Ops Analyst (Data-Driven)

    MotiveQuetta City Tehsil, Balochistan, Pakistan
    A leading operational technology firm in Pakistan is seeking a Sales Operations professional with 1-3 years of relevant experience. The role involves maintaining data accuracy, building dashboards, ...Show moreLast updated: 1 day ago
    • Promoted
    Remote Full Stack Engineer - Python, Kubernetes, AWS (EST)

    Remote Full Stack Engineer - Python, Kubernetes, AWS (EST)

    TarakiQuetta City Tehsil, Balochistan, Pakistan
    A funded US startup is hiring a remote Full Stack Engineer (Python, Kubernetes, AWS) to lead backend service migration to a microservices architecture. Responsibilities include architecting scalable...Show moreLast updated: 1 day ago
    • Promoted
    Remote DevOps & QA Engineer : Build Reliable Cloud Pipelines

    Remote DevOps & QA Engineer : Build Reliable Cloud Pipelines

    AHOYQuetta City Tehsil, Balochistan, Pakistan
    A leading technology firm is seeking a Remote DevOps and QA Engineer to manage cloud infrastructure and ensure software quality. The ideal candidate has at least three years of experience in DevOps,...Show moreLast updated: 1 day ago
    • Promoted
    Remote Social Media Lead Gen Manager

    Remote Social Media Lead Gen Manager

    Leading EdgeQuetta City Tehsil, Balochistan, Pakistan
    A dynamic marketing agency is seeking a talented individual to manage and respond to leads from social media platforms such as Instagram and Facebook. The ideal candidate will implement strategies t...Show moreLast updated: 1 day ago
    • Promoted
    • New!
    Remote Senior Software Engineer : Spec‑Driven & AI‑Led

    Remote Senior Software Engineer : Spec‑Driven & AI‑Led

    CrossoverQuetta City Tehsil, Balochistan, Pakistan
    A leading software firm is seeking a Senior Software Engineer who will lead system design and implementation using AI coding tools. This remote position offers a competitive salary of $100,000 per y...Show moreLast updated: 12 hours ago
    • Promoted
    Remote Data Engineer : Build Scalable Data Platform

    Remote Data Engineer : Build Scalable Data Platform

    Enabling Qapital LtdQuetta City Tehsil, Balochistan, Pakistan
    A leading impact asset manager is seeking a Data Engineer to manage data acquisition, governance, and analytics.This remote role requires strong software engineering and SQL skills, along with expe...Show moreLast updated: 1 day ago
    • Promoted
    Senior Database Engineer : HA / DR & Cloud Solutions

    Senior Database Engineer : HA / DR & Cloud Solutions

    NXT LABSQuetta City Tehsil, Balochistan, Pakistan
    A technology firm in Pakistan is seeking an experienced Database Administrator to manage and optimize their database systems. The role includes responsibilities such as configuring MySQL and Postgre...Show moreLast updated: 1 day ago
    • Promoted
    Full Stack Engineer, LearnWith.AI (Remote) - $100,000 / year USD

    Full Stack Engineer, LearnWith.AI (Remote) - $100,000 / year USD

    CrossoverQuetta City Tehsil, Balochistan, Pakistan
    Full Stack Engineer, LearnWith.AI (Remote) - $100,000 / year USD.This range is provided by Crossover.Your actual pay will be based on your skills and experience — talk with your recruiter to learn mo...Show moreLast updated: 1 day ago
    • Promoted
    Data Engineer

    Data Engineer

    Enabling Qapital LtdQuetta City Tehsil, Balochistan, Pakistan
    EQ) is a FINMA‑regulated, leading Swiss impact asset manager dedicated to a world where investments generate financial, social, environmental, and economic returns. EQ currently manages USD 750 mill...Show moreLast updated: 1 day ago
    • Promoted
    Senior C# AI-Driven Cloud-Native Engineer (Remote)

    Senior C# AI-Driven Cloud-Native Engineer (Remote)

    CrossoverQuetta City Tehsil, Balochistan, Pakistan
    A global technology firm is seeking a Senior C# Developer to spearhead the development of AI-powered systems and cloud-native solutions. In this role, you will leverage modern software practices to ...Show moreLast updated: 1 day ago
    • Promoted
    • New!
    AI & Data Platform Product Marketing Lead

    AI & Data Platform Product Marketing Lead

    VIDIZMO LLCQuetta City Tehsil, Balochistan, Pakistan
    A leading technology firm is seeking a Product Marketing Specialist to drive effective product positioning and go-to-market strategies. The ideal candidate will have a Bachelor's degree and 4–7 year...Show moreLast updated: 12 hours ago