Vacancy expired!
Hi All,Greetings!PFB urgent contract requirement with Direct client and revert with resume if interested to rpradhan(at)softpathtech(dot)com.
Job Title: Big Data Engineer Location: Hybrid Onsite (Mclean, VA)Duration: 6-12 months ContractExp: 10+yrsResponsibilities include:- Cleanse, manipulate and analyze large datasets (Semi-Structured and Unstructured data – XMLs, JSONs, CSVs, PDFs) using python and Snowflake database.
- Develop Python scripts to filter/cleanse/map/aggregate data.
- Manage and implement data processes (Data Quality reports)
- Develop data profiling, deduping logic, matching logic for analysis
- Programming Languages experience in Python, PySpark and SQL for data ingestion
- Present ideas and recommendations on data handling and data parsing technologies to management
- 5+ years of experience in processing large volumes and variety of data (Structured and semi-structured data, writing code for parallel processing, shredding XMLS, JSONs and reading PDFs) - Mandatory
- 3+ years of programming experience in Python for data processing and analysis – Mandatory
- 2+ years of experience with Snowflake, preferable parsing JSON and XML files- Desirable
- Strong SQL experience is a must - Mandatory
- 3+ years of experience – using Hadoop platform and performing analysis. Familiarity with Hadoop cluster environment and configurations for resource management for analysis work - Optional
- 2+ years of programming experience in PySpark for data processing and analysis - Optional
- Detail oriented. Excellent communication skills (verbal and written)
- Must be able to manage multiple priorities and meet deadlines
- Degree in Computer Science , Statistics, Mathematics, or related field
Vacancy expired!