PYGIO’s aim is to build the best Data Science team in South Africa. We work with some of the largest enterprises in Africa, as well as a diverse range of other SMEs and startups. This individual is inspired to work with the PYGIO founding team, and wants to carve their own path, in an environment that enables high-growth and learning. It is important that this person has a strong sense of urgency about their work, a willingness to learn and takes initiative. If you are enterprising and have an entrepreneurial spirit, don’t waste time – apply now!
Required skills:
- Excellent communication, analytical skills, and decision-making ability in collaborative environments
- Work with stakeholders throughout the organization to identify opportunities for leveraging company data to drive business solutions
- Mine and analyse data from company databases to drive optimization and improvement of product development, marketing techniques, and business strategies.
- Assess the effectiveness and accuracy of new data sources and data-gathering techniques.
- Develop custom data models and algorithms to apply to data sets.
- Use predictive modeling to increase and optimize customer experiences, revenue generation, ad targeting, and other business outcomes.
Must-have skills:
- Experience using Python & SQL (optional) to manipulate data and draw insights from large data sets
- Open source data science libraries and packages (Pandas, PySpark, Dask, Numpy, Scikit-learn, Tensorflow/PyTorch, Huggingface Transformers, OpenCV)
- Experience creating and using advanced machine learning algorithms and statistics using traditional ML (regression, simulation, scenario analysis, time series forecasting, clustering, decision trees) and DL (neural networks) methods
- Strong knowledge and experience in data cleaning, transformation and standardisation techniques (text mining, database record linkage, log data etc).
- Strong understanding of version control and related concepts and techniques (e.g. Git)
- Strong understanding of containerization technologies (e.g. Docker)
Nice-to-have skills:
- Experience setting up and maintaining production Data and MLOps pipelines (training, inference, model monitoring etc).
- Experience using AWS services: Redshift, S3, Lambda functions, Kinesis, Glue, SageMaker, etc.
- Excellent experience with the open-source relational database management system, eg PostgreSQL, MySQL, MS SQL Server.
- Bonus: Understanding of Python web app frameworks (FastAPI).