Tanzania Communications Regulatory Authority (TCRA)
Dar es Salaam, Dodoma
POST: ICT OFFICER II (DATA SCIENTIST) – 3 POST
EMPLOYER: Tanzania Communications Regulatory Authority (TCRA)
APPLICATION TIMELINE: 2024-06-07 to 2024-06-20
DUTIES AND RESPONSIBILITIES
- Designing, implementing, and managing big data collection and preprocessing of structured and unstructured data from various sources such as databases, APIs, streaming platforms, and files.
- Analyzing and handling large volumes of data using frameworks like Apache Hadoop and Apache Spark to distribute data processing tasks across multiple nodes.
- Designing and maintaining robust Extract, Transform, Load (ETL) pipelines to ensure smooth data flow and integration from various sources.
- Optimizing data processing pipelines for performance and cost-effectiveness, utilizing technologies such as Hadoop, Spark, and other Open Source technologies.
- Integrating disparate datasets from different sources, formats, and schemas, maintaining data lineage and metadata management.
- Applying appropriate Machine Learning algorithms and models for extraction of useful information from large datasets to identify patterns, trends, and relationships.
- Collaborating with cross-functional teams including data analysts and business stakeholders to understand data requirements and ensure data accessibility and usability.
- Designing and implementing scalable data architectures and storage solutions to accommodate the volume, variety, and velocity of big data, leveraging technologies such as HDFS and OLAP (Online Analytical Processing) databases.
- Defining data partitioning, indexing, and compression strategies to optimize storage efficiency and query performance.
- Establishing and enforcing data governance policies, standards, and best practices to ensure data privacy, security, and compliance with Laws and regulations.
- Implementing access controls, encryption, and auditing mechanisms to protect sensitive data and mitigate risks of data breaches or unauthorized access.
- Monitoring data pipelines and systems for performance, availability, and reliability, proactively identifying and resolving issues to minimize downtime and data loss.
- Conducting regular maintenance tasks such as data backups, system upgrades, and capacity planning to ensure the stability and scalability of the infrastructure.
- Assisting in developing and updating technical documentation.
- Performing other related duties as may be assigned by the Supervisor.
QUALIFICATION AND EXPERIENCE
- Holder of Bachelor’s Degree in one of the following fields: Computer Science, Electronic Science, Computer Engineering, Information Technology, Information Systems, Data Science, or equivalent qualifications from a recognized institution.
REMUNERATION: TCRAS 6