What are the responsibilities and job description for the Senior Data Manager/ Data architect position at PSRTEK?
Position:- Senior Data Manager/ Data architect / Data Engg LeadLocation: (Onsite from Day 1): -San Francisco, CA ( 4 days from the Office). Job Description:Imaging Data Pipeline delivery: Design, implement, and maintain automated pipelines for onboarding, verifying, transforming, and curating biomedical imaging data from clinical trials and real-world data sources, for therapeutic areas, Oncology, Neurology, Ophthalmology, covering all image file formats.Data Quality and Integrity: Develop and implement solutions to detect and correct anomalies and inconsistencies, to achieve the highest data quality of the imaging data set, per industry standards DICOM, and internal specifications like FFS, RTS, GDSR, etc. Ensure de-identification, PHI/PII controls, and image-specific QC checks are implemented at scale.Data Analysis and integration: Integrate ML and AI-assisted tools in the pipelines for inline image analysis, classifications, segmentations, to extract and enrich metadata for various analyses, optimize performance etc..Image Data Management : Build and maintain large-scale catalogs of curated imaging data sets, enhancing FAIR principles, and providing easy-to-discover and access imaging data assets. Compliance and Controls : Ensure applicable compliance and privacy controls are followed as required, GXP, CSV validations. Collaboration: Working closely with Image scientists, data scientists, clinicians, and biomarker research teams, supporting data needs for various primary and secondary endpoint analyses. External collaboration: Work with external partners, e.g., CROs, in ensuring the imaging data received conforms to the established agreements and quality standards and is complete. Lead to the delivery team to ensure timely delivery of product backlog/featuresParticipate with the team and lead various agile ceremonies throughout the planning and execution. An ideal candidate would have (multiple competencies from list below); Worked with medical imaging data and platforms, PACS, VNAs, etc. Worked with imaging Radiology data, e.g., CT, PET, MRI, nifti, and Ophthalmic imaging, OCT, FA, CFP, etc. Good understanding of DICOM standard, structure, metadata parsing, tags, and multi-frame imagesWorked with Clinical information data standards like SDTM, ADaM Data integration across diverse data sources, e.g., imaging data with tabular clinical dataDe-identification methodologies, PHI/PII detection and privacy controls Good understanding of GXP and CSV validation frameworksProficient in Python, its libraries: pandas, pydicom, SimpleITK, dicom-numpy, dcm2niix Hands-on experience with ETL/ELT involving large medical imaging datasets Apache Airflow, Spark, Talend or similar orchestration of complex workflows. 1Proficiency with SQL and NoSQL, and image metadata stores. (PostgreSQL, Mongo, Etc)Practical experience with AWS infrastructure Data services, e.g., RDS, Athena, Glue, EC2, Lambda, S3 …. Familiar with EKS, Docker, and HPC Experience in data analysis and report generation, e.g, Tibco, Tableau, AWS Quicksite etc Good knowledge of Git, Gitlab, and DevOps tools like Jenkins, Terraform Familiar with using ML workflows for Computer Vision tasks like segmentation, classification, etc.. Nice to have: implemented solutions on NLP and GenAI 18. Worked with cross-functional global teams in a dynamic Agile environment Lead and mentor agile team members. Has 10 years of experience with data platforms, analysis, and insightsEducational Qualifications: -Engineering Degree – BE/ME/BTech/MTech/BSc/MSc.Technical certification in multiple technologies is desirable. PSRTEK is a reputed technology recruitment and IT staffing brand with a global footprint and an admired client base. As an ideas and innovation powerhouse with a culture of excellence, we bring remarkable expertise and deliver powerfully transformative results.
Salary : $79 - $106