Job Description: Data Scientist
We are looking for a self-motivated Data scientist to work as part of the Document Automation Team with experience in natural language processing, information retrieval, question answering, machine learning, various deep learning related technologies, predictive and prescriptive analytics.
Duties and Responsibilities:
• Must be comfortable in a fast-paced, constantly evolving and sometimes ambiguous environment working with current and emerging AI technologies
• Deep knowledge and understanding of common methods in information extraction and various cognitive patterns
• Design, train, and implement models for classification of document, information extraction from documents using NLP and ML models using ML libraries
• Analyze large sets of documents and create new and enhance various algorithmic approaches to extract information from document with high level of accuracy.
• Train and optimize predictive models
• Strong research and problem-solving skills
• Collaborate and work closely with business partners across various business units
• Work as part of a large team comprising of employees and offshore contractors
• Ability to work with offshore data scientist contractors on developing ML algorithms as part of a bigger platform
• Educate and mentor fellow team members
Required Competencies
• Should have experience with Cloud based Computing tools (AWS Comprehend, AWS Textract)
• 5+ years of hands on solid experience developing algorithms centered around documents and their processing using ML libraries such as Python NLP libraries Spacy, NLTK, Stanford Core NLP.
• 3+ Strong experience working with Structured and unstructured documents
• 5+ Strong Programming skills and experience, such as Python, R, Java Script, Jupyter Notebook, PyTorch etc.
• Solid understanding of data science fundamentals: NLP, Information extraction, Reinforcement
• Excellent written and verbal communication skills for coordinating across teams, creating power point presentations and presenting to a technical business audience
• Experience working with OCR Technologies such as Abbyy, Tessaract
• Strong Experience working with Relational and NO SQL databases
Desired Competencies
• Prior experience working in Mortgage Industry is a plus
• Prior experience working with Java is a plus
Educational Requirement
• BS or higher in Computer Science or related technology field
• Relevant professional certification will be considered as a testament to the candidate being on a constant learning path.
Mandatory Skillsets: Python, R, AWS Comprehend, AWS Textract, NLP/NLTK, OCR Technologies such as Abbyy, Tessaract
General Idea about the requirement:
Data scientist who has experience in OCR tools (Optical Character Recognition) and NLP technologies.
Also, they must have strong experience in Data extraction.
At least one latest main project should be worked with OCR or NLP concepts.
Notice period: Immediate to 30 days Max
Shift: Regular Dayshift (10am – 7PM) Might extend based on requirement.