This role is for one of our clients
Compensation: $150-$190 per hour
We partner with leading AI research teams to enhance the quality, reliability, and real-world usefulness of general-purpose conversational AI systems. These systems are increasingly used in healthcare-related contexts, where accuracy, clarity, and responsible reasoning are critical.
This project focuses on evaluating and improving how conversational AI systems respond to medical and healthcare topics. Your clinical expertise will help ensure that responses are factually accurate, clearly explained, clinically sound, and aligned with real-world healthcare standards.
Role Overview
As a Healthcare Expert, you will apply your professional experience to evaluate and refine AI-generated responses related to medicine, patient care, and healthcare systems. You will help identify inaccuracies, strengthen clinical reasoning, and ensure safe and appropriate communication for diverse healthcare scenarios.
Requirements
Key Responsibilities
- Write and refine prompts to guide AI behavior in healthcare-related scenarios.
- Evaluate large language model (LLM) responses for medical accuracy, reasoning quality, clarity, and completeness.
- Fact-check medical and healthcare claims using trusted public and authoritative sources.
- Annotate responses by identifying strengths, weaknesses, factual inaccuracies, and conceptual gaps.
- Assess tone, safety, and appropriateness for real-world healthcare communication, including patient-facing contexts.
- Ensure responses align with expected conversational standards and project guidelines.
- Apply consistent evaluation frameworks, taxonomies, and benchmarking standards.
Required Qualifications
- Minimum of 5 years of professional experience in healthcare.
- Relevant advanced degree or professional credential (e.g., MD, DO, RN, NP, PA, PharmD, MPH, or equivalent).
- Experience in one or more of the following areas:
- General Clinical Care
- Specialty Medicine or Surgery
- Diagnostics, Imaging, or Laboratory Medicine
- Public Health, Healthcare Systems, or Administration
-
- Strong familiarity with large language models (LLMs) and practical understanding of how they are used in healthcare and professional contexts.
- Excellent written communication skills for complex medical and clinical topics.
- Exceptional attention to detail, with the ability to detect subtle inaccuracies or gaps in clinical reasoning.
Preferred Qualifications
- Prior experience with reinforcement learning from human feedback (RLHF), model evaluation, or data annotation.
- Experience writing or editing high-quality medical or healthcare content.
- Background in clinical documentation, charting, or patient communication.
- Familiarity with structured evaluation rubrics, benchmarks, or quality scoring systems.
What Success Looks Like
- You consistently identify medical inaccuracies, unsafe reasoning patterns, and unclear explanations.
- Your feedback measurably improves the clarity, safety, and reliability of healthcare-related AI responses.
- You deliver structured, reproducible evaluation artifacts that strengthen model performance.
- AI systems evaluated under your guidance demonstrate greater trustworthiness in healthcare scenarios.
Engagement Details
- This is an independent contractor engagement.
- The role is fully remote and can be completed on a flexible schedule.
- Project duration may vary depending on performance and evolving needs.
- The work does not involve access to confidential or proprietary information from any employer, client, or institution.
- Payments are issued weekly via Stripe or Wise based on services rendered.
- Please note: Visa sponsorship, including H1-B or STEM OPT support, is not available for this role.