Talent.com
Lead Data Engineer

Lead Data Engineer

FusemachinesGujrat Division, Punjab, Pakistan
30+ days ago
Job description

About Fusemachines

Fusemachines is a leading AI strategy, talent, and education services provider. Founded by Sameer Maskey Ph.D., Adjunct Associate Professor at Columbia University, Fusemachines has a core mission of democratizing AI. With a presence in 4 countries (Nepal, United States, Canada, and Dominican Republic) and more than 450 employees, Fusemachines seeks to bring its global expertise in AI to transform companies around the world.

Location

Remote (Full-time)

Role Overview

This is a remote full-time position, responsible for designing, building, testing, optimizing and maintaining the infrastructure and code required for data integration, storage, processing, pipelines and analytics (BI, visualization and Advanced Analytics) from ingestion to consumption, implementing data flow controls, and ensuring high data quality and accessibility for analytics and business intelligence purposes. This role requires a strong foundation in programming, and a keen understanding of how to integrate and manage data effectively across various storage systems and technologies.

We\'re looking for someone who can quickly ramp up, contribute right away and lead the work in Data & Analytics, helping from backlog definition, to architecture decisions, and lead technical the rest of the team with minimal oversight.

Responsibilities

  • Design, implement, deploy, test and maintain highly scalable and efficient data architectures, defining and maintaining standards and best practices for data management independently with minimal guidance
  • Ensuring the scalability, reliability, quality and performance of data systems
  • Mentoring and guiding junior / mid-level data engineers
  • Collaborating with Product, Engineering, Data Scientists and Analysts to understand data requirements and develop data solutions, including reusable components
  • Evaluating and implementing new technologies and tools to improve data integration, data processing and analysis
  • Design architecture, observability and testing strategies, and building reliable infrastructure and data pipelines
  • Takes ownership of storage layer, data management tasks, including schema design, indexing, and performance tuning
  • Swiftly address and resolve complex data engineering issues, incidents and resolve bottlenecks in SQL queries and database operations
  • Conduct Discovery on existing Data Infrastructure and Proposed Architecture
  • Evaluate and implement cutting-edge technologies and methodologies and continue learning and expanding skills in data engineering and cloud platforms, to improve and modernize existing data systems
  • Evaluate, design, and implement data governance solutions : cataloging, lineage, quality and data governance frameworks that are suitable for a modern analytics solution, considering industry-standard best practices and patterns.
  • Define and document data engineering architectures, processes and data flows
  • Assess best practices and design schemas that match business needs for delivering a modern analytics solution (descriptive, diagnostic, predictive, prescriptive)
  • Be an active member of our Agile team, participating in all ceremonies and continuous improvement activities

Qualifications

  • Must have a full-time Bachelor's degree in Computer Science Information Systems, Engineering, or a related field
  • 5+ years of real-world data engineering development experience in AWS and GCP (certifications preferred). Strong expertise in Python, SQL, PySpark and AWS in an Agile environment, with a proven track record of building and optimizing data pipelines, architectures, and datasets, and proven experience in data storage, modeling, management, lake, warehousing, processing / transformation, integration, cleansing, validation and analytics
  • Senior person who can understand requirements and design end to end solutions with minimal oversight
  • Strong programming Skills in one or more languages such as Python, Scala, and proficient in writing efficient and optimized code for data integration, storage, processing and manipulation
  • Strong knowledge SDLC tools and technologies, including project management software (Jira or similar), source code management (GitHub or similar), CI / CD system (GitHub actions, AWS CodeBuild or similar) and binary repository manager (AWS CodeArtifact or similar)
  • Good understanding of Data Modeling and Database Design Principles. Being able to design and implement efficient database schemas that meet the requirements of the data architecture to support data solutions
  • Strong SQL skills and experience working with complex data sets, Enterprise Data Warehouse and writing advanced SQL queries. Proficient with Relational Databases (RDS, MySQL, Postgres, or similar) and NonSQL Databases (Cassandra, MongoDB, Neo4j, etc.)
  • Skilled in Data Integration from different sources such as APIs, databases, flat files, event streaming.
  • Strong experience in implementing data pipelines and efficient ELT / ETL processes, batch and real-time, in AWS and using open source solutions, being able to develop custom integration solutions as needed, including Data Integration from different sources such as APIs (PoS integrations is a plus), ERP (Oracle and Allegra are a plus), databases, flat files, Apache Parquet, event streaming, including cleansing, transformation and validation of the data
  • Strong experience with scalable and distributed Data Technologies such as Spark / PySpark, DBT and Kafka, to be able to handle large volumes of data
  • Experience with stream-processing systems : Storm, Spark-Streaming, etc. is a plus
  • Strong experience in designing and implementing Data Warehousing solutions in AWS with Redshift. Demonstrated experience in designing and implementing efficient ELT / ETL processes that extract data from source systems, transform it (DBT), and load it into the data warehouse
  • Strong experience in Orchestration using Apache Airflow
  • Expert in Cloud Computing in AWS, including deep knowledge of a variety of AWS services like Lambda, Kinesis, S3, Lake Formation, EC2, EMR, ECS / ECR, IAM, CloudWatch, etc
  • Good understanding of Data Quality and Governance, including implementation of data quality checks and monitoring processes to ensure that data is accurate, complete, and consistent
  • Good understanding of BI solutions including Looker and LookML (Looker Modeling Language)
  • Strong knowledge and hands-on experience of DevOps principles, tools and technologies (GitHub and AWS DevOps) including continuous integration, continuous delivery (CI / CD), infrastructure as code (IaC – Terraform), configuration management, automated testing, performance tuning and cost management and optimization
  • Good Problem-Solving skills : being able to troubleshoot data processing pipelines and identify performance bottlenecks and other issues
  • Possesses strong leadership skills with a willingness to lead, create Ideas, and be assertive
  • Strong project management and organizational skills
  • Excellent communication skills to collaborate with cross-functional teams, including business users, data architects, DevOps / DataOps / MLOps engineers, data analyst, data scientists, developers, and operations teams. Essential to convey complex technical concepts and insights to non-technical stakeholders effectively
  • Equal Opportunity Employer

    Race, Color, Religion, Sex, Sexual Orientation, Gender Identity, National Origin, Age, Genetic Information, Disability, Protected Veteran Status, or any other legally protected group status.

    #J-18808-Ljbffr

    Create a job alert for this search

    Data Engineer • Gujrat Division, Punjab, Pakistan

    Related jobs
    • Promoted
    Data Engineer

    Data Engineer

    Creative ChaosSialkot, Punjab, Pakistan
    We are seeking a highly skilled Data Engineer with extensive experience in Azure Data Lake to contribute to our dynamic team at Creative Chaos. The ideal candidate will be responsible for architecti...Show moreLast updated: 30+ days ago
    • Promoted
    Data Engineer

    Data Engineer

    CreativechaosGujrat, Punjab, Pakistan
    We are seeking a highly skilled Data Engineer with extensive experience in Azure Data Lake to contribute to our dynamic team at Creative Chaos. The ideal candidate will be responsible for architecti...Show moreLast updated: 30+ days ago
    • Promoted
    Software Engineer, Data Infrastructure & Acquisition - Asia

    Software Engineer, Data Infrastructure & Acquisition - Asia

    SpeechifySialkot, Punjab, Pakistan
    Software Engineer, Data Infrastructure & Acquisition - Asia.Software Engineer, Data Infrastructure & Acquisition - Asia.Software Engineer, Data Infrastructure & Acquisition - Asia.Software Engineer...Show moreLast updated: 30+ days ago
    • Promoted
    Databricks Engineer - Remote - Pakistan

    Databricks Engineer - Remote - Pakistan

    FullStack LabsSialkot, Punjab, Pakistan
    FullStack is the most transparent IT talent network, connecting highly skilled individuals with top global companies and Silicon Valley startups for remote, on‑demand projects.We focus on building ...Show moreLast updated: 10 days ago
    • Promoted
    Data Engineer - M&A (Remote)

    Data Engineer - M&A (Remote)

    360trainingGujrat Division, Punjab, Pakistan
    Over the years, we have continued to grow our expansive library of regulatory‑approved training courses with new content suited for today’s modern workforce. By offering these courses online, all 36...Show moreLast updated: 19 days ago
    • Promoted
    Lead Generation Specialist

    Lead Generation Specialist

    Mad Tech HeadsGujrat, Pakistan
    Job Description : Madtechheads is a UK-based remote company established in 2022, specializing in web development and accounting solutions. Committed to quality work and a positive work environment, ...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Senior Data Engineer

    Senior Data Engineer

    traxccelSialkot, Punjab, Pakistan
    We are looking for a highly experienced and motivated Azure Data Engineer to join our team.The successful candidate will have a minimum of 5 years of hands‑on experience in Azure and Databricks arc...Show moreLast updated: 20 hours ago
    • Promoted
    Technical Architect – Data & Analytics Platform

    Technical Architect – Data & Analytics Platform

    veraqorGujrat Division, Punjab, Pakistan
    This role is a technical, customer facing role that is accountable for the end-to-end customer.When you are working with us, you will : . Data and BI strategy workshops, solution briefings, technical ...Show moreLast updated: 14 days ago
    • Promoted
    • New!
    Senior Data Engineer - 100% Remote

    Senior Data Engineer - 100% Remote

    Hyly.AIGujrat Division, Punjab, Pakistan
    AI is the multifamily industry’s premier Intelligence Fabric™, combining Artificial, Business, and Human intelligence into one unified operating system for growth. The platform transforms raw data i...Show moreLast updated: 20 hours ago
    • Promoted
    • New!
    AI Engineer

    AI Engineer

    Weel Technologies IncGujrat Division, Punjab, Pakistan
    Get AI-powered advice on this job and more exclusive features.We are seeking a highly skilled.API architecture, and applied data science. This role requires a deep understanding of.As part of our ad...Show moreLast updated: 20 hours ago
    • Promoted
    Lead Generation Specialist

    Lead Generation Specialist

    TryonnixSialkot, Punjab, Pakistan
    English and a proven background in software sales.This role is pivotal in expanding our client base through lead finding, lead generation, and cold calling. The ideal candidate will have a strong sa...Show moreLast updated: 30+ days ago
    • Promoted
    Data Engineer

    Data Engineer

    Burq, Inc.Sialkot, Punjab, Pakistan
    Burq started with an ambitious mission : how can we turn the complex process of offering delivery into a simple turnkey solution. It’s a big mission and now we want you to join us to make it even big...Show moreLast updated: 30+ days ago
    • Promoted
    Azure Data Integration Engineer (DP600 / 700)

    Azure Data Integration Engineer (DP600 / 700)

    ITC WorldwideGujrat, Pakistan
    Azure Data Integration Engineer (DP600 / 700).Overview The role involves building and managing data pipelines, troubleshooting issues, and ensuring data accuracy across platforms such as Azure Synaps...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Data Engineer (Amazon Redshift) - Remote

    Senior Data Engineer (Amazon Redshift) - Remote

    HR WaysGujranwala Division, Punjab, Pakistan
    Senior Data Engineer (Amazon Redshift) - Remote.Remote from anywhere in Pakistan.Our Client company is Saudi based Leading E-Commerce shopping distance, where fashion meets beauty, offering a uniqu...Show moreLast updated: 22 days ago
    • Promoted
    Senior Lead Engineer

    Senior Lead Engineer

    Solace Ltd.Gujrat, Punjab, Pakistan
    Medimap is on a mission to make healthcare more convenient and accessible for all Canadians.Our platform serves as a vital bridge between patients seeking care and the clinics providing it.By impro...Show moreLast updated: 30+ days ago
    • Promoted
    Data Architect

    Data Architect

    Creative ChaosSialkot, Punjab, Pakistan
    As a Data Architect at Creative Chaos, you will be responsible for designing, building, and managing the data architecture of our organization. You will work closely with stakeholders, data scientis...Show moreLast updated: 30+ days ago
    • Promoted
    Microsoft Azure and AI Practice Lead

    Microsoft Azure and AI Practice Lead

    Global ITSGujrat Division, Punjab, Pakistan
    As a Microsoft Azure and AI Practice Lead at Global iTS, a Microsoft Inner Circle partner for Business Applications, you will spearhead the strategy, growth, and delivery of Azure cloud and AI solu...Show moreLast updated: 6 days ago
    • Promoted
    Lead Data Engineer

    Lead Data Engineer

    FusemachinesSialkot, Punjab, Pakistan
    Fusemachines is a leading AI strategy, talent, and education services provider.Adjunct Associate Professor at Columbia University, Fusemachines has a core mission of democratizing AI.With a presenc...Show moreLast updated: 30+ days ago