NAVEEN KUMAR M
Data Engineer
Specializing in Data Pipeline & ML Infrastructure
Data Engineering • ETL Pipelines • Cloud Architecture
Building scalable data solutions for business growth
Summary
Data Engineer with 5+ years of experience specializing in Extract, Transform, Load (ETL) processes, data pipeline development, and analytics solutions. Proven expertise in Amazon Web Services (AWS), Snowflake, PostgreSQL, and Python for building scalable data infrastructure. Demonstrated success in delivering data-driven insights and optimizing business operations through efficient data engineering solutions.
Key Skills
- Data Engineering: ETL Pipeline Development, Data Warehousing, Data Modeling, Big Data Processing, Data Migration
- Cloud Platforms: Amazon Web Services (AWS), Snowflake Data Cloud
- Databases: PostgreSQL, MySQL, Data Lakes, Data Warehouses
- Programming Languages: Python, SQL, JavaScript
- Machine Learning: PyTorch, Scikit-learn, Neural Networks, Deep Learning (Learning)
- Business Intelligence: Microsoft Power BI, Looker Studio, Metabase, Data Visualization
- Automation Tools: n8n Workflow Automation, Python Scripting, Google Apps Script, RESTful APIs
- Version Control: Git, GitHub
- Project Management: Agile Methodology, Jira
Experience
Assistant Manager – Operations
Indian Education Service | Nov 2024 – Present
- Implemented ETL pipelines using AWS and Snowflake, improving data processing efficiency by 40%
- Developed automated tracking systems integrating n8n workflows with Python scripts
- Created real-time dashboards in Microsoft Power BI for recruitment and performance monitoring
- Optimized database queries in PostgreSQL, reducing response time by 50%
- Established data quality frameworks ensuring 99.9% data accuracy
Assistant Manager – MIS
GetMyUni | Jul 2020 – Nov 2024
- Architected data pipelines processing 1M+ daily records using Python and SQL
- Built automated reporting systems using Power BI and Looker Studio, reducing manual effort by 60%
- Integrated multiple data sources using RESTful APIs and custom ETL processes
- Implemented data validation checks reducing error rates by 85%
Associate CS
Sutherland Global Services | Apr 2021 – Mar 2022
- Analyzed customer data using SQL queries to identify trends and patterns
- Generated automated reports for customer satisfaction metrics using Excel
- Maintained data quality in Customer Relationship Management (CRM) systems
Projects
Enterprise Data Pipeline Development
Built scalable ETL pipeline using AWS services and Snowflake. Implemented automated data quality checks using Python. Reduced data processing time by 65%.
Tech: AWS, Snowflake, Python, SQL
Advanced Analytics Dashboard
Developed real-time analytics platform using Power BI and SQL. Created custom DAX measures and automated refresh mechanisms.
Tech: Power BI, SQL, Python, DAX
Machine Learning Pipeline (In Progress)
Building end-to-end ML pipeline integrating PyTorch models with data engineering workflow.
Tech: Python, PyTorch, AWS, MLflow
Education
- Bachelor of Engineering, Anna University, Chennai, Tamil Nadu Jul 2016 – Dec 2020
- Higher Secondary School, Christian Matriculation, Oddanchatram, Tamil Nadu (TNBSE) Oct 2014 – May 2016
- Schooling, Kendriya Vidyalaya, Karaikudi, Tamil Nadu (CBSE) Jun 2004 – Sep 2014
Certifications
- Amazon Web Services (AWS) EMR & Glue
- Google Data Science Foundations
- HackerRank SQL Advanced Certification
- Newton School Database Management
- Crio.Do QA Automation Engineer
Achievements
- Successfully processed and analyzed 10TB+ of data through optimized pipelines
- Completed 135+ advanced SQL challenges on LeetCode and HackerRank
- Reduced system downtime by 75% through improved monitoring and maintenance
- Led 5 successful data migration projects with zero data loss