| name | dismech-terms |
| description | Skill for adding and validating ontology term references in the dismech knowledge base. This skill should be used when working with disorder YAML files that need ontology term annotations (HPO for phenotypes, CL for cell types, GO for biological processes, MONDO for diseases, UBERON for anatomical entities). Use this skill when adding phenotype_term, cell_types term, biological_processes term, or other ontology-bound fields to disorder files. |
Dismech Ontology Terms Skill
Overview
Add and validate ontology term references in the dismech disorder knowledge base. This ensures phenotypes, cell types, biological processes, and other entities are properly linked to authoritative ontology terms with correct IDs and labels.
When to Use
- Adding
phenotype_termto phenotype entries (uses HP - Human Phenotype Ontology) - Adding
termtocell_typesentries (uses CL - Cell Ontology) - Adding
termtobiological_processesentries (uses GO - Gene Ontology) - Adding
disease_termto disease entries (uses MONDO) - Validating existing ontology term references
- Fixing label mismatches between preferred_term and ontology labels
Term Object Structure
All term references follow this YAML structure:
# For phenotypes:
phenotype_term:
preferred_term: <Human readable name>
term:
id: HP:XXXXXXX
label: <Exact HP label from ontology>
# For cell types:
cell_types:
- preferred_term: <Human readable name>
term:
id: CL:XXXXXXX
label: <Exact CL label from ontology>
# For biological processes:
biological_processes:
- preferred_term: <Human readable name>
term:
id: GO:XXXXXXX
label: <Exact GO label from ontology>
Ontology Lookup with OAK
Use the Ontology Access Kit (OAK) to look up terms:
Exact Match
uv run runoak -i sqlite:obo:hp info "seizure"
# Returns: HP:0001250 ! Seizure
Fuzzy Search
uv run runoak -i sqlite:obo:hp info "l~cognitive impairment"
# Returns multiple matches - select the most appropriate
Get Full Term Details
uv run runoak -i sqlite:obo:cl info CL:0000540 -O obo
# Returns complete term information including definition
Common Ontology Prefixes
| Ontology | Prefix | Adapter | Use For |
|---|---|---|---|
| Human Phenotype | HP | sqlite:obo:hp | phenotype_term |
| Cell Ontology | CL | sqlite:obo:cl | cell_types |
| Gene Ontology | GO | sqlite:obo:go | biological_processes |
| MONDO Disease | MONDO | sqlite:obo:mondo | disease_term |
| Uberon Anatomy | UBERON | sqlite:obo:uberon | anatomical locations |
Specificity Guidelines
Critical: Always use the most specific term that accurately describes the entity:
| Incorrect (too general) | Correct (specific) |
|---|---|
| CL:0000066 epithelial cell | CL:0002202 epithelial cell of tracheobronchial tree |
| HP:0000001 All | HP:0001250 Seizure |
| CL:0000000 cell | CL:0000540 neuron |
When a fuzzy search returns multiple results:
- Review all candidates
- Check term definitions with
-O oboflag - Select the term that most precisely matches the biological context
- If no specific term exists, use the closest parent but note the limitation
Validation
After adding terms, validate with:
just validate-terms
This checks:
- Term IDs exist in the ontology
- Labels match the canonical ontology labels exactly
- Required fields are present
Fixing Label Mismatches
If validation reports a label mismatch:
LABEL MISMATCH: Cholera.yaml
Term: HP:0003394
Expected: Muscle cramps
Actual: Muscle spasm
Update the label field to match the ontology's canonical label exactly.
Batch Processing
To find entries missing term annotations:
import yaml
import glob
for f in glob.glob("kb/disorders/*.yaml"):
with open(f) as file:
data = yaml.safe_load(file)
for pheno in data.get('phenotypes', []):
if 'phenotype_term' not in pheno:
print(f"{f}: {pheno.get('name')} - missing phenotype_term")
Common Patterns
Adding HPO to a Phenotype
- Look up term:
uv run runoak -i sqlite:obo:hp info "l~<phenotype name>" - Verify specificity:
uv run runoak -i sqlite:obo:hp info <HP:ID> -O obo - Add to YAML:
phenotype_term: preferred_term: <Original Name> term: id: <HP:ID> label: <Exact label from OAK> - Validate:
just validate-terms
Adding CL to Cell Types
- Look up term:
uv run runoak -i sqlite:obo:cl info "l~<cell type>" - Verify specificity
- Add
term:block under the cell_type entry - Validate