Clients is global leader in professional information services. Professionals in the areas of healthcare, legal, business, tax, accounting, finance, audit, risk, and compliance rely on Wolters Kluwer's market-leading information-enabled tools and software solutions to manage their business efficiently, deliver results to their clients, and succeed in an ever more dynamic world. Wolters Kluwer combines deep domain knowledge with specialized technology.
Responsibilities:
- Work side-by-side with machine learning engineers to solve AI problems
- Provide expertise in ETL, data wrangling, data analysis and feature engineering using complex big data from a wide variety of sources
- Create and maintain an optimal, scalable, and automated big data pipeline from inception to delivery, augmenting a broader machine learning pipeline
- Implement feature engineering as part of a machine learning model training workflow
- Support the development of proofs of concept to demonstrate the application of AI/ML capabilities in solving customer problems in collaboration with product and other
Must-Have Skills:
- Data Engineering
- Python (NumPy, SciPy, scikit-learn, pandas, matplotlib, anaconda, Jupyter, etc).
- Experience building, optimizing, and scaling data pipelines, architectures, and data sets in cloud-based (AWS or Azure) environments
- Advanced working SQL and NoSQL knowledge, experience working with relational databases (e.g., RDS, Oracle, Postgres) and document databases (e.g., DynamoDB, Cosmos DB)
- Experience with cloud data warehouses (e.g., Redshift, Azure SQL Data Warehouse) and search engines (e.g., Solr, Elasticsearch)
- Working knowledge of message queuing, stream processing, and highly scalable data stores
Nice-to-Have Skills:
- Experience with tools such as Kafka, Spark, or Hadoop is a plus
- Apart from Python, working knowledge of languages such as Java, C++, or C#
- Experience working with agile and Software Development Lifecycle tools (e.g., JIRA, Confluence, Git)
Assessment Path: Data Engineering; MSA signed: Yes; Max All-in rate: $3500 /Month; Location: India; Working hours: 6-8 hours overlap with EST. Must-Haves: Python, Data Engineering