[Remote] Data Scientist-Python Libraries
Note: The job is a remote job and is open to candidates in USA. Mastech Digital is a provider of digital and mainstream technology staff and services for American Corporations. They are currently seeking a Data Scientist-Python Libraries to develop and maintain Python modules for text parsing and implement NLP techniques to process unstructured data.
Responsibilities
• Develop and maintain Python modules for text parsing, cleaning, and extraction.
• Implement NLP and text analytics techniques to process unstructured data into structured outputs.
• Integrate external APIs, open-source libraries, and cloud services into data workflows.
• Write robust code with error handling and exception management for data pipelines.
• Build utilities for rule-based text extraction, normalization, and transformation.
• Document workflows, experiments, and code in a structured manner.
Skills
• 2-5 years of experience in Python-based development.
• Strong knowledge of NLP libraries and text analytics (spaCy, NLTK, regex, transformers).
• Familiarity with data parsing, unstructured data processing, and extraction frameworks.
• Experience with external APIs and JSON/structured data handling.
• Solid understanding of error handling and debugging practices in Python.
• Strong analytical skills with ability to work on unstructured datasets.
• Minimum 7+ years of experience.
• Local Preferred: Yes
Education Requirements
• Bachelor's degree in Computer Science, Data Science, Engineering, or related field.
Benefits
• Medical, Dental (Including Ortho) & Vision Insurance (Option to Enroll)
• Paid Leaves (Wherever applicable)
• Life & Disability Coverage (Upon eligibility)
• 401K Option, Education Assistance Program and more
Company Overview
• Welcome to Jobs via Dice, the go-to destination for discovering the tech jobs you want. It was founded in undefined, and is headquartered in , with a workforce of 0-1 employees. Its website is https://www.dice.com.
Apply tot his job
Apply To this Job