Analyst – Data Scientist

November 10, 2022

Job Overview

Job Description

Description: 


ABOUT UPL:                                                                                     

UPL is focused on emerging as a premier global provider of total crop solutions designed to secure the world’s long-term food supply. Winning farmers hearts across the globe, while leading the way with innovative products and services that make agriculture sustainable, UPL is the fastest growing company in the industry. Our successes in the field add up to powerful financials. UPL delivers results from protecting crops that translate into attractive investor value. Based on the recognition that humankind is one community, UPL’s overarching commitment is to improve areas of its presence, workplace, and customer engagement.

 

Our purpose isOpenAg’. An Open agriculture network that feeds sustainable growth for all. No limits, no borders.

JOB PURPOSE: The Data scientist is an emerging role in UPL’s Digital team and will play a pivotal role in operationalizing the most critical data and analytics initiatives for UPL’s digital business. Data Scientist will work with the Global Digital team to build, maintain, and optimize data pipelines for key data and Digital data consumers including business / data analysts and data scientists covering our digital and physical channels and value chain. This role will require both creative and collaborative working with Digital / IT and the wider business. It will involve evangelizing effective data management practices and promoting better understanding of data and analytics. The data scientist will also be tasked working with key business stakeholders, Digital experts, and subject-matter experts to plan and deliver optimal enterprise data assets.

 

Job Responsibilities:

  • Architect, build, and maintain data pipelines that will provision high quality data ready for analysis. This includes ingestion, exploration, modelling, and curation of high value data.
  • Use innovative and modern tools, techniques, and architectures to automate the most-common, repeatable, and tedious data preparation and integration tasks partially or completely in order to minimize manual and error-prone processes and improve productivity with following processes:
  • The data scientist should be curious and knowledgeable about new data initiatives and how to address them. This includes applying their data and/or domain understanding in addressing new data requirements. Establishing efficient design and programming patterns for scientists as well as for non-technical partners.
  • Participate in ensuring compliance and governance during data use
  • Build data expertise, act like a data owner for the company and manage complex data systems for a product or a group of products. He / She will be performing all of the necessary data transformations to serve products that empower data-driven decision making. He / She needs to understand the analytical objectives to make logical recommendations and drive informed actions
  • Work with a team of high-performing analytics, data science professionals, and cross-functional teams to identify business opportunities, optimize product performance or go to market strategy. He / She will be engaging with internal platform teams to prototype and validate tools developed in-house to derive insight from very large datasets or automate complex algorithms. The data scientist contributes to innovations that fuel UPL’s vision and mission.

 

 

REQUIRED EDUCATION AND EXPERIENCE:

  • Education and Experience

A bachelor's or master's degree in computer science, statistics, applied mathematics, data management, information systems, information science or a related quantitative field is required.

 

  • Technical Knowledge/Skills
  • At least 2-3 years of experience with advanced analytics tools for Object-oriented/object function scripting using languages such as Python, Scala, or similar.
  • Proficiency with Python and basic libraries for machine learning such as scikit-learn and pandas, NLP, deep learning framework such as TensorFlow or Keras etc.
  • Strong ability to design, build and manage data pipelines in PySpark and related technologies for data structures encompassing data transformation, data models, schemas, metadata, and workload management.
  • The ability to work with both digital and business in integrating analytics and data science output into business processes and workflows.
  • Exposure in machine Learning on both supervised and unsupervised models, experience on AWS ML platforms and able to build, train, deploy using Amazon Sagemaker.
  • Experience with distributed data systems such as Hadoop and related technologies (Spark, Presto, Pig, Hive, etc.).
  • 2 + years' experience with popular database programming in relational and nonrelational environments including on AWS Redshift, AWS Aurora, SQL Server, and similar platforms.

 

Location: Mumbai, India

We are one team, for maximum impact. One team with shared goals. We all play for the team, and no-one plays against team. We have a laser-like focus on what our customers need and want, on anticipating their future needs and how we can create innovative solutions and experiences for them.