What are the responsibilities and job description for the "Databricks Certified Architect" position at Kastech Software Solutions Group?
Urgent Role: Databricks Certified Architect
Location: Washington DC
12 YRS Exp
Responsibilities
- Design and implement enterprise-scale solutions on the Databricks Lakehouse Platform.
- Architect end-to-end data pipelines for batch and real-time processing using Apache Spark and PySpark.
- Develop scalable data ingestion, transformation, and data quality frameworks.
- Design and implement Medallion Architecture (Bronze, Silver, Gold) using Delta Lake.
- Build and optimize data warehouses, data marts, and analytical solutions.
- Implement data governance, security, lineage, and access controls using Unity Catalog.
- Develop and support AI/BI dashboards, semantic models, and self-service analytics solutions.
- Configure and optimize Genie Spaces to enable natural language business queries and conversational analytics.
- Design and deploy Generative AI and RAG-based solutions using Databricks Mosaic AI and Vector Search.
- Collaborate with business users to translate requirements into scalable data and AI solutions.
- Optimize Databricks workloads for performance, scalability, reliability, and cost efficiency.
- Lead cloud-native implementations across Azure environments.
- Define architecture standards, best practices, and reusable design patterns.
- Mentor data engineers, analysts, and architects on Databricks technologies and platform adoption.
- Lead migration initiatives from legacy data warehouses and analytics platforms to Databricks.
- Build and maintain Genie Spaces for business self-service analytics.
- Create semantic models, metrics, and trusted data assets for AI-driven reporting.
- Develop natural language-to-SQL analytics solutions using Databricks Genie.
- Implement RAG solutions using enterprise data and Vector Search.
- Optimize AI/BI dashboards and conversational analytics experiences.
- Troubleshoot Spark performance, query optimization, and workload management.
- Automate data validation, monitoring, and governance controls.
- Support AI use cases using Mosaic AI model serving and inference endpoints.
Technical Skills
- Databricks Lakehouse Platform, Apache Spark, PySpark, Spark SQL
- Python, SQL, Delta Lake, Delta Live Tables, Lakeflow
- Unity Catalog, Databricks AI/BI and Genie
- Mosaic AI, Vector Search, RAG
- Data Modeling (Dimensional & Data Vault)
- Structured Streaming
- Data Quality and Data Governance
- Azure, Terraform, Git, Azure DevOps, Jenkins
- REST APIs and Data Integration
- Performance Tuning and Cost Optimization
Certification
- Architect: Azure Databricks Certified Data Engineering Professional.