We are seeking a skilled and detail-oriented Data Scientist to join our team. In this role, you will be responsible for the full data lifecycle — from collection and cleaning to analysis and modeling. You will play a critical role in building robust datasets, designing performance metrics, and supporting the development and evaluation of AI models and intelligent agents.
Key Responsibilities :
- Data Collection & Preprocessing :
- Collect, clean, and preprocess structured and unstructured data from multiple sources to ensure data quality and integrity.
- Dataset Development :
- Design and build high-quality datasets for training, validation, and testing of AI / ML models and agents.
- Model Evaluation : Develop and implement performance metrics to assess and benchmark the effectiveness of AI models and intelligent systems.
- Data Analysis & Insights : Perform exploratory data analysis (EDA) and statistical analysis to extract meaningful insights that guide modeling efforts.
- Collaboration : Work closely with machine learning engineers, product teams, and researchers to iterate on data needs and optimize model performance.
Qualifications :
Bachelor's or Master’s degree in Data Science, Computer Science, Statistics, or a related field.Proven experience in data cleaning, preprocessing, and analysis.Proficiency in Python and relevant data science libraries (e.g., Pandas, NumPy, Scikit-learn).Experience with building and managing datasets for machine learning.Strong understanding of evaluation metrics for classification, regression, and generative models.Familiarity with database systems and data querying (e.g., SQL).Excellent problem-solving skills and attention to detail.Preferred Qualifications :
Experience working with AI agents or LLM-based systems.Knowledge of data versioning tools and ML pipeline frameworks.Prior experience in research environments or cross-functional product teams.#J-18808-Ljbffr