Description:
Proven experience working with Python for data analysis and programming. Expertise in Apache Spark and PySpark for distributed data processing. Experience with Databricks notebooks, clusters, and workspace management on either AWS or Azure. Strong understanding of data warehousing, data lakes, and data engineering concepts. Experience with data visualization libraries (e.g., Matplotlib, Seaborn) is a plus. Experience building and deploying machine learning models is a plus. Excellent communicati
Apr 24, 2024;
from:
dice.com