Tendril — AI Lessons for Real Life

Tendril

The premise

AI can turn the organization's responsible AI principles into a scored rubric procurement teams use to compare third-party models on the same axes.

What AI does well here

Translate principles into observable rubric criteria

Suggest evidence sources for each criterion (model card, audit, contract)

Format for spreadsheet-style scoring across vendors

What AI cannot do

Score the vendors on its own

Approve a vendor for use

Replace the procurement team's vendor interviews

End-of-lesson check

15 questions · take it digitally for instant feedback at tendril.neural-forge.io/learn/quiz/end-ethics-ai-third-party-model-evaluation-rubric-creators

What is the primary purpose of an AI evaluation rubric designed for procurement teams comparing third-party models?

To let AI systems automatically select vendors based on scores
To replace the need for security reviews entirely
To compare third-party models consistently using defined criteria
To eliminate vendor interviews from the procurement process

Which of the following tasks can AI appropriately perform when helping build an evaluation rubric?

Translate organizational principles into observable criteria
Score vendors based on submitted evidence
Approve a vendor for organizational use
Conduct interviews with vendor representatives

When procurement teams use the evaluation rubric to assess vendors, what must they provide for each criterion?

Automatic approval decisions
Predetermined vendor rankings
Actual evidence from model cards, audits, or contracts
AI-generated recommendations

In which category would a criterion about 'documented bias testing results' most appropriately belong?

Safety
Fairness
Transparency
Operational

Why cannot AI fully replace vendor interviews even when using a comprehensive evaluation rubric?

Vendor interviews are legally prohibited
AI systems cannot read any documents
AI has already conducted all necessary interviews
Interviews reveal qualitative insights that documents alone cannot provide

What role does a model card play in the evaluation rubric process?

It automatically generates scores for all criteria
It serves as the final approval document for vendor selection
It provides evidence for evaluating specific criteria
It replaces the need for human review

Which group or groups are responsible for actually scoring vendors against the rubric criteria?

Only senior leadership
Procurement teams, security teams, and the responsible AI team
Only the AI system that created the rubric
Only external third-party auditors

What does a 1-5 scoring scale represent on the evaluation rubric?

The age of the AI model being evaluated
The price tier of the vendor's services
The degree of compliance or achievement for each criterion
The number of employees at the vendor company

Which of the following would best represent an 'operational' category criterion?

Whether the model can explain its reasoning process
Whether the model produces toxic or harmful content
Whether the model demonstrates demographic parity across outputs
Whether the vendor provides uptime guarantees and support SLAs

What is a fundamental limitation of using AI when building and applying the evaluation rubric?

AI cannot understand organizational principles
AI cannot suggest evidence sources for criteria
AI cannot independently score vendors or approve vendors for use
AI cannot format rubrics into spreadsheet layouts

When translating organizational responsible AI principles into rubric criteria, what characteristics should each criterion have?

They should focus exclusively on cost factors
They should be vague to allow flexibility
They should be observable and tied to specific evidence sources
They should be based solely on vendor marketing claims

Why is it important for each rubric criterion to have identified evidence sources?

To make the rubric document longer and more impressive
To eliminate the need for any human judgment
To ensure scoring is based on verifiable information rather than assumptions
To automatically generate final vendor rankings

What is the key advantage of using a structured rubric approach over ad-hoc vendor evaluation?

It uses AI to make all final decisions
It removes security teams from the evaluation process
It eliminates the need for any documentation
It enables consistent criteria for comparing different vendors

Which of the following would NOT be an appropriate task for AI in the rubric process?

Translating principles into observable criteria
Formatting the rubric for spreadsheet-style scoring
Suggesting evidence sources for each criterion
Approving a vendor based on their rubric scores

Which of the following best describes an appropriate 'transparency' category criterion?

Whether the vendor guarantees 99.9% uptime
Whether the model has been tested for demographic bias
Whether the model filters harmful content
Whether the model provides explanations for its outputs

The premise

AI can turn the organization's responsible AI principles into a scored rubric procurement teams use to compare third-party models on the same axes.

What AI does well here

Translate principles into observable rubric criteria

Suggest evidence sources for each criterion (model card, audit, contract)

Format for spreadsheet-style scoring across vendors

What AI cannot do

Score the vendors on its own

Approve a vendor for use

Replace the procurement team's vendor interviews

End-of-lesson check

15 questions · take it digitally for instant feedback at tendril.neural-forge.io/learn/quiz/end-ethics-ai-third-party-model-evaluation-rubric-creators

What is the primary purpose of an AI evaluation rubric designed for procurement teams comparing third-party models?

To let AI systems automatically select vendors based on scores
To replace the need for security reviews entirely
To compare third-party models consistently using defined criteria
To eliminate vendor interviews from the procurement process

Which of the following tasks can AI appropriately perform when helping build an evaluation rubric?

Translate organizational principles into observable criteria
Score vendors based on submitted evidence
Approve a vendor for organizational use
Conduct interviews with vendor representatives

When procurement teams use the evaluation rubric to assess vendors, what must they provide for each criterion?

Automatic approval decisions
Predetermined vendor rankings
Actual evidence from model cards, audits, or contracts
AI-generated recommendations

In which category would a criterion about 'documented bias testing results' most appropriately belong?

Safety
Fairness
Transparency
Operational

Why cannot AI fully replace vendor interviews even when using a comprehensive evaluation rubric?

Vendor interviews are legally prohibited
AI systems cannot read any documents
AI has already conducted all necessary interviews
Interviews reveal qualitative insights that documents alone cannot provide

What role does a model card play in the evaluation rubric process?

It automatically generates scores for all criteria
It serves as the final approval document for vendor selection
It provides evidence for evaluating specific criteria
It replaces the need for human review

Which group or groups are responsible for actually scoring vendors against the rubric criteria?

Only senior leadership
Procurement teams, security teams, and the responsible AI team
Only the AI system that created the rubric
Only external third-party auditors

What does a 1-5 scoring scale represent on the evaluation rubric?

The age of the AI model being evaluated
The price tier of the vendor's services
The degree of compliance or achievement for each criterion
The number of employees at the vendor company

Which of the following would best represent an 'operational' category criterion?

Whether the model can explain its reasoning process
Whether the model produces toxic or harmful content
Whether the model demonstrates demographic parity across outputs
Whether the vendor provides uptime guarantees and support SLAs

What is a fundamental limitation of using AI when building and applying the evaluation rubric?

AI cannot understand organizational principles
AI cannot suggest evidence sources for criteria
AI cannot independently score vendors or approve vendors for use
AI cannot format rubrics into spreadsheet layouts

When translating organizational responsible AI principles into rubric criteria, what characteristics should each criterion have?

They should focus exclusively on cost factors
They should be vague to allow flexibility
They should be observable and tied to specific evidence sources
They should be based solely on vendor marketing claims

Why is it important for each rubric criterion to have identified evidence sources?

To make the rubric document longer and more impressive
To eliminate the need for any human judgment
To ensure scoring is based on verifiable information rather than assumptions
To automatically generate final vendor rankings

What is the key advantage of using a structured rubric approach over ad-hoc vendor evaluation?

It uses AI to make all final decisions
It removes security teams from the evaluation process
It eliminates the need for any documentation
It enables consistent criteria for comparing different vendors

Which of the following would NOT be an appropriate task for AI in the rubric process?

Translating principles into observable criteria
Formatting the rubric for spreadsheet-style scoring
Suggesting evidence sources for each criterion
Approving a vendor based on their rubric scores

Which of the following best describes an appropriate 'transparency' category criterion?

Whether the vendor guarantees 99.9% uptime
Whether the model has been tested for demographic bias
Whether the model filters harmful content
Whether the model provides explanations for its outputs

AI third-party model evaluation rubric for procurement teams

The premise

What AI does well here

What AI cannot do

End-of-lesson check

AI third-party model evaluation rubric for procurement teams

The premise

What AI does well here

What AI cannot do

End-of-lesson check