Job Description
- The Python Data Scientist will write scripts to query and manipulate large data sets. S/he will support the data team in the automation of the collection and analysis of data. S/he will extract data from different sources, analyze it in the best way possible and draw inferences from them. S/he may build specific AI tools for them to automate certain processes in the company.
Responsibilities:
- Defines product objectives by gathering and analyzing user requirements; envisioning system features and functionality.
- Recommends data solutions by exploring the advantages and benefits of existing development against alternative development.
- Be involved in regular testing of data scraping scripts and software products; unit testing and integration testing.
- Writes and updates software test processes and procedures.
- Coordinates (or partakes in) the coding/programming of software products and solutions with version control systems such as git.
- Develop and maintain Python scripts or applications to perform data scraping using the best available libraries in Python.
- Development and maintenance of data pipeline processes.
- Perform Data Analysis, mining and visualization using relevant Python libraries.
- Developing data models and algorithms best suited to a particular scenario.
- Employ sophisticated analytics programs, machine learning and statistical methods to prepare data for use in predictive and prescriptive modeling.
- Develop and maintain Natural Language processing applications using python’s NLP libraries like Spacy, NLTK, HuggingFace, LangChain etc.
- Write and implement test scripts for python programs.
If the lead developer for a product or solution, he/she will do the following;
- Breaks down tasks into executable units,
- Assigns tasks to team members and allocates time,
- Manages product features – backlog manager,
- Responsible for managing and enforcing coding style and collaboration standards,
- Supports and coordinates team members with appropriate review and retrospectives.
- Involves in code review and optimization, and implements reviews made on products.
- Write reusable and testable codes.
Qualification and Skills:
- A degree in Computer Science, Information Science, Mathematics or any closely related field is required.
- Three (3) years of Sound knowledge of python libraries like Pandas, Request, NumpPy, MatPlotLib etc.
- Good knowledge of relational and structural databases and how to create database schemas that represent and support business processes.
- Understands the design principles behind a scalable application.
- Sound knowledge of version control tools such as git.
- Strong knowledge of web server exploits and their solutions.
- Passion for best design and coding practices, and a desire to develop new interesting ideas.
- Top-notch programming skills and in-depth knowledge of modern data science/machine learning and natural language processing technologies.
- Teamwork.
- Good communication skills.
- Highly organized and pays extraordinary attention to detail.
- Must have the ability to act quickly and pragmatically under pressure, prioritize and resolve issues.
- Enthusiasm and high level of motivation.