Go to search
Lead Data Scientist
Data Science, Python, NLP Preprocessing
Hyderabad, Bangalore, Pune, Chennai, Gurgaon
We are seeking a highly skilled Lead Data Scientist to join our team and lead a project focused on developing novel data science models and machine learning algorithms to solve complex problems.
In this role, you will be responsible for designing, developing, and implementing data science models and machine learning algorithms, including data pre-processing, training, optimization, and evaluation of models. You will work collaboratively with data engineers and data scientists to collect, pre-process, and curate datasets suitable for training, and train and fine-tune machine learning/deep learning models using large-scale datasets and distributed computing frameworks.
Responsibilities
- Design, develop, and implement novel data science models and machine learning algorithms that solve complex problems
- Collaborate with data engineers and data scientists to collect, pre-process, and curate datasets suitable for training
- Train and fine-tune machine learning/deep learning models using large-scale datasets and distributed computing frameworks. Optimize models for performance, efficiency, and scalability
- Design and conduct experiments to evaluate the performance, robustness, and generalization of tune machine learning/deep learning models. Use appropriate metrics and statistical analysis to measure and interpret results
- Prepare technical documentation, including model architecture, implementation details, and experimental results. Communicate findings, insights, and recommendations to both technical and non-technical stakeholders
Requirements
- Bachelor’s or master’s degree in computer science, Data Science, Statistics, or related field
- Minimum of 8 to 12 years of experience is required
- Solid foundation in Machine Learning, Deep Learning, Computer Vision, NLP
- Proficiency in Python
- Experience with deep learning frameworks like Tensorflow, Pytorch, Keras, Jax, etc.
- General understanding of data structures, algorithms, multi-threaded programming, and distributed computing concepts
- Experience with pandas, scikit-learn, matplotlib, spacy, statsmodel, etc.
- Knowledge of statistical and algorithmic models as well as of fundamental mathematical concepts, such as linear algebra and probability
- Familiarity with cloud services (AWS, Google Cloud, Azure)
- Experience in Natural Language Processing (NLP) preprocessing techniques
- Excellent written and verbal communication skills
- B2+ English level proficiency