What are the responsibilities and job description for the Senior Data Scientist II position at LexisNexis?

Are you looking to develop your Data Scientist career?

Would you enjoy training more junior team members?

About The Team

LexisNexis Legal & Professional, which serves customers in more than 150 countries with 11,800 employees worldwide, is part of RELX (www.relx.com), a global provider of information-based analytics and decision tools for professional and business customers. Our company has been a long-time leader in deploying AI and advanced technologies to the legal market to improve productivity and transform the overall business and practice of law, deploying ethical and powerful generative AI solutions with a flexible, multi-model approach that prioritizes using the best model from today’s top model creators for each individual legal use case. The company employs over 2,000 technologists, data scientists, and experts to develop, test, and validate solutions in line with RELX Responsible AI Principles (https://stories.relx.com/responsible-ai-principles/index.html).

About The Role

Join our team to help build state-of-the-art research tools. Our Data Science teams focus on extracting key information such as entities mentioned, sentiment analysis, data enrichments, predictive insights, and more to build best in class data and news streams relied on by our global customer base. Responsible for the end‑to‑end design and continuous evolution of a multimodal document understanding and structured data extraction platform: complex PDF / scanned page layout analysis, semantic extraction, structural reconstruction, quality validation, and business integration. Leads multimodal model strategy (vision language layout) and multi‑agent collaboration (task decomposition, verification, conflict reconciliation, feedback loops) and plans future customized training and ongoing optimization of models.

Responsibilities

Design and iterate the multimodal document parsing pipeline: layout / structural modeling, semantic extraction, cross‑modal alignment, structural reconstruction.
Build and optimize a multi‑agent collaboration mechanism: task splitting, parallel / sequential scheduling, peer review, iterative quality improvement loops.
Define model selection / composition / routing strategies (dynamic dispatch by document type, structural patterns, quality signals).
Plan and execute model fine‑tuning, domain adaptation, continual learning, active learning, and data feedback loops.
Establish end‑to‑end metrics: extraction accuracy, structural consistency, agent collaboration effectiveness, latency, stability, and cost.
Build quality assurance and risk controls: drift & anomaly monitoring, confidence estimation, fallback strategies, alignment / compliance checks.
Drive mapping and consistency between agent / model outputs and business knowledge field standards.

Requirements

Education: Master’s degree or above in a quantitative or technical field (Statistics, Computer Science, Mathematics, Data Science, etc.).
Experience: 5 years of hands‑on machine learning / data science experience. Proven delivery experience in multimodal (vision text) or complex document understanding. Practical cases of orchestrating agents (or modular processing logic) in production workflows.
Capabilities: Solid foundation in machine learning / deep learning fundamentals, multimodal representations, and cross‑modal alignment concepts. Deep understanding of core principles and common algorithms for multimodal large models: cross‑modal attention & representation alignment, vision/text embedding fusion, hierarchical & layout structure modeling, instruction & contrastive paradigms, long‑context and retrieval‑augmented mechanisms, evaluation and failure mode dissection. Familiar with classic image and signal processing methods: edge & contour detection, filtering & denoising, morphological operations, segmentation & key point feature extraction, frequency / time‑frequency analysis, image enhancement & quality assessment; understands trade‑offs and complementarity with deep features. Knowledge of multi‑agent collaboration patterns: role assignment, task routing, feedback loops, redundancy & cross‑checks. Strong in statistical analysis & experimental design: hypothesis testing, factorial design, power analysis, A/B and multivariate evaluation. Able to decompose complex problems and build metric‑driven optimization paths. Rigorous in data quality & error analysis; rapid bottleneck identification. Ability to translate research pseudo‑code into maintainable, testable Python modules with benchmarking & regression harnesses.

Preferred Experience

Designed customization / fine‑tuning of multimodal foundation models, representation learning, or structural understanding subsystems.
Built an agent orchestration platform: task decomposition, iterative self‑checks, consensus or voting mechanisms.
Experience solving robustness & generalization challenges in large‑scale long documents / heterogeneous layouts.
Demonstrated results in cost optimization (model pruning, parameter‑efficient tuning, inference acceleration) or adaptive load scheduling.
Publications / patents or open‑source contributions.
Demonstrated Python systems optimization (e.g., custom Cython / CUDA kernels, vectorization replacing Python loops, latency reductions in inference pipelines).

U.S. National Base Pay Range: $102,800 - $171,300. Geographic differentials may apply in some locations to better reflect local market rates. This job is eligible for an annual incentive bonus. We know that your wellbeing and happiness are key to a long and successful career. These are some of the benefits we are delighted to offer to our US full- and part-time employees working at least 20 hours or more per week:

Health Benefits: Comprehensive, multi-carrier program for medical, dental and vision benefits
Retirement Benefits: 401(k) with match and an Employee Share Purchase Plan
Wellbeing: Wellness platform with incentives, Headspace app subscription, Employee Assistance and Time-off Programs
Short-and-Long Term Disability, Life and Accidental Death Insurance, Critical Illness, and Hospital Indemnity
Family Benefits, including bonding and family care leaves, adoption and surrogacy benefits
Health Savings, Health Care, Dependent Care and Commuter Spending Accounts
In addition to annual Paid Time Off, we offer up to two days of paid leave each to participate in Employee Resource Groups and to volunteer with your charity of choice

Salary : $102,800 - $171,300

Apply for this job

Receive alerts for other Senior Data Scientist II job openings

Senior Data Scientist II

What are the responsibilities and job description for the Senior Data Scientist II position at LexisNexis?

What is the career path for a Senior Data Scientist II?

Job openings at LexisNexis

Not the job you're looking for? Here are some other Senior Data Scientist II jobs in the Virginia, VA area that may be a better fit.

We don't have any other Senior Data Scientist II jobs in the Virginia, VA area right now.

AI Assistant is available now!