Data Scientist - NLP
Data Scientist
At Sogeti, we believe the best is inside every one of us. Whether you are early in your career or at the top of your game, we’ll encourage you to fulfill your potential to be better. Through our shared passion for technology, our entrepreneurial culture, and our focus on continuous learning, we’ll provide everything you need to become the best you can be.
YOUR ROLE
- Design, develop and monitor NLP-based data pipelines for large-scale textual data (web data, job advertisements, unstructured text), from data acquisition to statistical outputs, using Python and R.
- Develop, train and evaluate NLP and Machine Learning models (information extraction, text classification, topic modeling, skill extraction) to transform unstructured text into actionable insights.
- Perform advanced statistical analysis and data exploration on large textual datasets using R (RStudio) and Python, ensuring data quality, robustness and reproducibility.
- Contribute to the improvement and optimisation of data production workflows, including model performance monitoring, documentation and auditability.
- Collaborate with multidisciplinary teams (data engineers, statisticians, domain experts) and contribute to technical documentation, knowledge sharing and user communication.
- Support the deployment, validation and continuous improvement of NLP services within modular, scalable and production-ready architectures.
YOUR PROFILE
- Master’s degree in Data Science, Computer Science or Statistics, with a minimum of 7 years of professional experience.
- Advanced programming skills in Python and R, including data processing, statistical analysis, machine learning and model evaluation.
- Solid knowledge of NLP techniques and frameworks (text parsing, classification, information extraction, language models) and their application at scale.
- Good understanding of Machine Learning concepts, including training, testing, performance assessment and model monitoring.
- Experience working with large and complex datasets, ensuring data quality, traceability and reproducibility of analyses.
- Excellent communication skills, able to explain complex data science and NLP concepts to both technical and non-technical stakeholders, in an international environment.
- Fluent in English and in French
WHAT YOU’LL LOVE ABOUT WORKING HERE?
At Sogeti, we have a modern, human centric approach to promoting, assessing and ensuring future high performance. We support and guide you at every step of your career with continuous feedback and conversations. We’ll always make space to check in and discuss your growth and readiness for promotion.
We encourage flexibility when it comes to when and where people get their work done, allowing a better work-life balance and greater empowerment. Employees work with their managers to determine an arrangement that works best for their role and personal circumstances.
We offer access to one of the best digital learning packages in the market to improve both your soft and hard skills. Encouraging you to be curious and accountable for your skills development. Our learning offers are world class and free to every employee across the globe. Access all learning assets and curated programs from business experts, plus content from leading partners including Harvard Business Review, Coursera, Pluralsight, Udemy, Microsoft, AWS, Google and many more.
Bertrange, LU