Do images labeled "Hippo" actually match Hippo pathways best?
What this shows: We compare each "Hippo" labeled image against ALL 239 KEGG and 984 WikiPathways to verify the labeling.
Key question: If an image is labeled "Hippo signaling", does it have highest gene overlap with Hippo pathways, or does it actually match PI3K-AKT or some other pathway better?
Method: Pure Jaccard similarity using Entrez gene IDs:
Example: Image has genes {YAP1, LATS1, MST1} and KEGG Hippo has {YAP1, LATS1, LATS2, MST1, MST2, SAV1, ...157 total}
Note: Low Jaccard is expected — images show partial pathway views, not the full reference pathway.
29% of gene labels (482/1648) lack Entrez IDs. Many are Hippo-related family names:
These don't contribute to Jaccard overlap, making true scores higher than reported.