Demo

LLM Evaluation Expert

Belcan Corporation
Seattle, WA Full Time
POSTED ON 4/29/2025 CLOSED ON 4/30/2025

What are the responsibilities and job description for the LLM Evaluation Expert position at Belcan Corporation?

Details:

Job Title: LLM Evaluation Expert
Location: Remote
Start Date: Right Away
Keywords: #RemoteJobs; #LLMEvaluationExpertJobs;
Compensation: This position is offering a hourly rate range from $57 - $59 an hour.

A LLM Evaluation Expert Job Virtually is currently available through Belcan. In this role you will play a crucial role in assessing and improving our language models' coding capabilities. If you are interested in this role, Apply Today!

Job Description:
Actual Job Title: LLM Evaluation Expert - Domain: Coding
The role will follow the qualifications and job duties below. Only 10 hours per week, no wiggle room on this.

Company Overview: Artificial General Intelligence (AGI) Data Services is at the forefront of AI innovation, specializing in the development and refinement of large language models (LLMs). Our mission is to create AI systems that can understand and generate high-quality code, revolutionizing the software development landscape.

Job Description: As an LLM Evaluation Expert specializing in Coding, you will play a crucial role in assessing and improving our language models' coding capabilities. Your expertise will be instrumental in evaluating LLM-generated code responses, making high-level judgments, and setting the standard for what constitutes excellent AI-assisted coding.

Key Responsibilities:
* Critically analyze and evaluate code responses generated by our LLMs across various programming languages and paradigms
* Exercise expert judgment to select the most appropriate and efficient code solutions from multiple LLM-generated options
* Make informed decisions on behalf of our customers, ensuring that selected code meets industry standards, best practices, and specific client needs
* Develop and write coding demonstrations to illustrate "what good looks like" in AI-generated code, setting benchmarks for quality and efficiency
* Provide detailed feedback and explanations for your evaluations, helping to refine and improve the LLM's understanding and output
* Collaborate with the AI research team to identify areas for improvement in the LLM's coding capabilities
* Stay abreast of the latest developments in software engineering, coding standards, and AI to ensure our evaluations remain cutting-edge

Required Qualifications:
* Advanced degree in Computer Science, Software Engineering, or a related field
* Extensive experience (5 years) in software development across multiple programming languages and paradigms
* Demonstrated ability to critically evaluate code quality, efficiency, and adherence to best practices
* Strong analytical and decision-making skills, with the ability to make complex judgments under ambiguous circumstances
* Excellent written and verbal communication skills, with the ability to explain technical concepts clearly
* Experience in technical writing, particularly in creating coding examples or tutorials

Preferred Qualifications:
* Previous experience working with or evaluating AI systems, particularly in the context of code generation
* Familiarity with a wide range of software development methodologies and architectural patterns
* Understanding of machine learning concepts, particularly as they apply to natural language processing and code generation
* Experience in creating or contributing to coding standards or style guides
This role requires a unique blend of technical expertise, critical thinking, and communication skills. You will be the bridge between advanced AI technology and practical, real-world coding applications. Your work will directly influence the development of next-generation AI coding assistants, shaping the future of software development.
If you're passionate about code quality, have a keen eye for detail, and are excited about the potential of AI in software engineering, we encourage you to apply for this pivotal role at AGI Data Services.

desired coding languages for this role:
Python, PHP, Java, Ruby, JavaScript, TypeScript, C , Go, Cypher, SQL - must have experience with several to be considered for the role

Please ensure that you are giving them the information included in the intake form attached in the request. Including the actual job title listed in the intake form in the job description section.
If selected for an interview, the interview will be based on testing their ability to execute the duties in the job description, including practical exercises based on the job description.

In the interviews and job role, candidates will be presented with 2 to 3 examples of user prompts to an LLM and the model's response. All prompts will be coding-related questions. Candidates will need to understand the user's request (i.e. generate code, evaluate code, explain code, etc.) and evaluate the model's response based on that understanding on dimensions like correctness, logic, coherence, applicability to the user's question, etc. After taking time to evaluate the example, the candidates must be able to summarize their findings in a few sentences explaining any issues found and how they would adjust the model's answer to make it better.
In the interviews and job role, candidates will not be designing, writing, and or developing code from scratch. They will only be evaluating model responses as described above.


If you are interested in this role, please apply via the apply now link provided. Our overriding goal is to provide quality staffing solutions that help people, organizations, and communities succeed. Belcan is a leading provider of qualified personnel to many of the world's most respected enterprises. We offer excellent opportunities for contract, temporary, temp-to-hire, and direct assignments. We are the employer of choice for thousands worldwide. For more information, please visit our website at Belcan.com
EOE/F/M/D/V


Belcan is an equal opportunity employer. EOE/M/F/D/V

 

Salary : $57 - $59

Data Extraction Analyst (Temporary), MAVEN
The Institute for Health Metrics and Evaluation -
Seattle, WA
LLM Application Support Engineer
LanceSoft -
Seattle, WA
Senior Research Scientist, LLM
Axon -
Seattle, WA

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a LLM Evaluation Expert?

Sign up to receive alerts about other jobs on the LLM Evaluation Expert career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$146,673 - $180,130
Income Estimation: 
$176,149 - $220,529
Income Estimation: 
$77,657 - $95,021
Income Estimation: 
$97,257 - $120,701
Income Estimation: 
$97,257 - $120,701
Income Estimation: 
$123,167 - $152,295
Income Estimation: 
$123,167 - $152,295
Income Estimation: 
$146,673 - $180,130
This job has expired.
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Belcan Corporation

Belcan Corporation
Hired Organization Address Windsor, CT Full Time
Senior Model-Based Enterprise NX Interoperability Engineer A highly skilled Senior Model-Based Enterprise NX Interoperab...
Belcan Corporation
Hired Organization Address Louisville, CO Full Time
A Software Systems Engineering job in Louisville, CO is currently available at Belcan supporting our key aerospace clien...
Belcan Corporation
Hired Organization Address Centennial, CO Full Time
A Software Systems Engineering job is available at Belcan supporting key aerospace clients. Job Description We are seeki...
Belcan Corporation
Hired Organization Address Kent, WA Full Time
Job Title: IT Support Pay Rate: $21.50 / hr Location: Kent, WA Area Code: 253, 425 ZIP Code: 98032 Start Date: Right Awa...

Not the job you're looking for? Here are some other LLM Evaluation Expert jobs in the Seattle, WA area that may be a better fit.

LLM Evaluation Expert

Belcan Corporation, Redmond, WA

AI Assistant is available now!

Feel free to start your new journey!