I am a Data Analyst and Data Scientist specializing in scalable data pipelines, big data, real-time processing, and cloud storage.
Built a robust pipeline for replicating MongoDB data into Apache Iceberg for scalable storage and analytics.
✅ Real-time Sync: Kafka + Change Data Capture (CDC) ✅ Data Transformation: Apache Spark ✅ Efficient Querying: Trino ✅ Cloud Storage: AWS S3 ✅ Containerization: Docker
🔗 GitHub: Nareshtiwari74 🔗 LinkedIn: Naresh Tiwari