Connected Components
Connected components measure whether all nodes in the knowledge graph can reach each other through
paths. For drug repurposing, this matters directly: a drug can only be linked to a disease if they
exist in the same connected component. The Largest Connected Component (LCC) is the single giant
cluster that, in a well-integrated graph, contains the vast majority of nodes.
79.7 %
of all nodes in the LCC
of all nodes in the LCC
7,316,507
nodes in the LCC
nodes in the LCC
1,840,418
disconnected fragments
disconnected fragments
Core Entity Coverage
For drug repurposing to work, both drugs and diseases must be reachable within the graph. These
numbers show what fraction of EC-curated core entities reside in the LCC.
99.98 %
All Core Entities
24,344 of 24,348 in LCC
All Core Entities
24,344 of 24,348 in LCC
99.77 %
Core Drugs
1,762 of 1,766 in LCC
Core Drugs
1,762 of 1,766 in LCC
100.00 %
Core Diseases
22,582 of 22,582 in LCC
Core Diseases
22,582 of 22,582 in LCC
Weighted Connectivity Score
0.9998
0.9998
Accounts for both core entity placement and component sizes relative to the LCC.
A score of 1.0 means all core entities are in the LCC.
LCC Composition
The LCC contains two kinds of nodes: the core entities we care about for drug repurposing (EC-curated
drugs and diseases), and all the other biological entities that connect them. Understanding what
makes up the non-core majority helps assess whether the graph's connective tissue is biologically
meaningful.
Core Entities
Loading...
1,762
Core Drugs
Core Drugs
22,582
Core Diseases
Core Diseases
24,344 core entities of 7,316,507 total LCC nodes
Non-Core Nodes by Category
The remaining 7,292,163 non-core nodes provide the
connective structure between drugs and diseases. The breakdown by biolink parent category shows what
kinds of biological entities form this bridge.
Loading...
No Results
Component Size Distribution
Connected components follow a power-law distribution: the LCC is enormous, and everything else is
tiny. The table below buckets components by size to show this pattern clearly.
No Results
Minor Components with Core Entities
Core entities (EC-curated drugs and diseases) that are not in the LCC cannot participate in
path-based drug repurposing. Any minor component containing core entities represents a gap
in graph connectivity.
No Results
Minor Components Knowledge Sources
Which knowledge sources contribute edges to the disconnected minor components? If fragmentation
is driven by a single source, it may indicate poorly-connected data being brought in. A spread
across many sources suggests a more systemic integration gap.
No Results
Page
/
3 15 of 37 records
Top 20 Minor Components
The largest disconnected fragments outside the LCC. Click a row to inspect the nodes and
edges within that component.
No Results
