SGS is looking for an experienced and innovative Sr. Data Scientist to join our team to help create the next generation of artificial intelligence solutions that will help our customers make well-informed decisions and support critical missions. At SGS, you will immerse yourself in cutting-edge research and work with the latest technologies to deliver value in the Industrial IoT and Defense spaces.
Responsibilities:
- Building models to solve specific problems
- Processing, cleansing, and verifying the integrity of data used for analysis
- Feature engineering using various techniques for the enhancement of data
- Performing feature selection on original and generated dat
- Using machine learning tools to develop and train models
- Performing efficacy testing of the models
- Building automated tools that enable the data scientist to more effectively perform tasks such as data cleaning, feature generation, feature selection, or model building
- Performing ad-hoc analysis and presenting results in a clear manner
- Working with a team to help solve new, never-before-solved challenges across multiple industries
- Presenting concepts and findings to non-technical audiences, such as company leadership or our customers
Required Qualifications:
- US Citizenship
- Understanding and experience using machine learning techniques and algorithms, including but not limited to: transformers, clustering, tree-based methods, neural networks, anomaly detection and more
- Hands-on experience with data wrangling, feature engineering, model building, evaluation, and visualization.
- Fluency in Python programming and related machine learning tools (such as NumPy, Pandas, scikit-learn, NLTK, and SpaCy), deep learning frameworks (such as PyTorch), and SQL-like query languages
- Good applied statistics skills, such as distributions, statistical testing, etc.
- Experience with both graph and vector databases. ArangoDB and Elastic or similar
- Familiarity with advanced NLP methods, such as large language models (LLMs), retrieval augmented generation (RAG), fine-tuning or domain-adaptation of NLP models using pre-trained LLMs (such as Hugging Face).
- Familiarity with document parsing techniques and optical character recognition tools (Tesseract).
- Graduate degree (or equivalent industry experience), in Computer Science, Statistics, Physics, Mathematics, Neuroscience, Linguistics, Electrical Engineering, Economics, or a related scientific discipline
Preferred Qualifications:
- Masters’ degree
- 5+ years’ experience with AI/ML solutions
- Experience working on supply chain related use cases to help support SGS’s readiness campaigns.
- TS/SCI security clearance
Clearance:
Applicants selected will be subject to a security investigation and may need to meet eligibility requirements for access to classified information.