We are looking for an enthusiastic Data Analyst to work on projects for Analytics and Innovation team within a large financial organization.
Mandatory Skill(s)
- Degree in Computer Science, Information Systems, Data Science or equivalent;
- Proficiency in writing complex queries for data extraction, transformation, and manipulation;
- Strong knowledge of Python for data processing, including libraries like Pandas, NumPy, and others;
- Experience with PySpark for distributed data processing, especially when dealing with large datasets;
- Experience with tools like Apache Airflow, Dagster, or similar tools for orchestrating and scheduling data workflows;
- Working knowledge of cloud platforms such as AWS, Azure, or Google Cloud;
- Expertise in building and maintaining ETL (Extract, Transform, Load) processes that are scalable, efficient, and error-free;
- Experience deploying batch processes for data extraction, pipeline and transformation;
- Experience with data visualization tools such as Tableau, Power BI, or libraries in Python (e.g., Matplotlib, Seaborn);
- Strong written and verbal communication skills.
Desirable Skill(s)
- Experience with agile methodologies or similar frameworks for managing project tasks and deliverables;
- Familiarity with containerization technologies like Docker or orchestration platforms like Kubernetes;
- Prior experience with banking clients.
Responsibilities
- Design, build, and maintain robust data pipelines that efficiently transport data from various sources to the storage systems;
- Improve data collection processes to capture more relevant, clean, and useful data for analysis;
- Cleanse and preprocess raw data to ensure it is accurate, complete, and in the right format for analysis;
- Perform data validation to ensure the integrity of the dataset and handle any inconsistencies;
- Support the operationalization of data systems, ensuring that data flows seamlessly from collection through processing and storage;
- Perform testing and validation of the systems and pipelines before deploying them into production;
- Work closely with data scientists, software engineers, and business analysts to understand data needs and deliver suitable solutions;
- Provide support in troubleshooting issues, optimizing processes, and continuously improving system performance.
If you are interested in this role, click on the “Apply to this job” button below or you could also write in with your CV to Sakshi Awasthi at sakshi.a@sciente.com quoting the job title.