Connected Components

Connected components measure whether all nodes in the knowledge graph can reach each other through paths. For drug repurposing, this matters directly: a drug can only be linked to a disease if they exist in the same connected component. The Largest Connected Component (LCC) is the single giant cluster that, in a well-integrated graph, contains the vast majority of nodes.
79.7 %
of all nodes in the LCC
7,316,507
nodes in the LCC
1,840,418
disconnected fragments

Core Entity Coverage

For drug repurposing to work, both drugs and diseases must be reachable within the graph. These numbers show what fraction of EC-curated core entities reside in the LCC.
99.98 %
All Core Entities
24,344 of 24,348 in LCC
99.77 %
Core Drugs
1,762 of 1,766 in LCC
100.00 %
Core Diseases
22,582 of 22,582 in LCC
Weighted Connectivity Score
0.9998
Accounts for both core entity placement and component sizes relative to the LCC. A score of 1.0 means all core entities are in the LCC.

LCC Composition

The LCC contains two kinds of nodes: the core entities we care about for drug repurposing (EC-curated drugs and diseases), and all the other biological entities that connect them. Understanding what makes up the non-core majority helps assess whether the graph's connective tissue is biologically meaningful.

Core Entities

Loading...
1,762
Core Drugs
22,582
Core Diseases
24,344 core entities of 7,316,507 total LCC nodes

Non-Core Nodes by Category

The remaining 7,292,163 non-core nodes provide the connective structure between drugs and diseases. The breakdown by biolink parent category shows what kinds of biological entities form this bridge.
Loading...
No Results

Component Size Distribution

Connected components follow a power-law distribution: the LCC is enormous, and everything else is tiny. The table below buckets components by size to show this pattern clearly.
No Results

Minor Components with Core Entities

Core entities (EC-curated drugs and diseases) that are not in the LCC cannot participate in path-based drug repurposing. Any minor component containing core entities represents a gap in graph connectivity.
No Results

Minor Components Knowledge Sources

Which knowledge sources contribute edges to the disconnected minor components? If fragmentation is driven by a single source, it may indicate poorly-connected data being brought in. A spread across many sources suggests a more systemic integration gap.
No Results

Top 20 Minor Components

The largest disconnected fragments outside the LCC. Click a row to inspect the nodes and edges within that component.