Detectability of Communities in Heterogeneous Hypergraphs

# Detectability of Communities in Heterogeneous Hypergraphs
## Satellite on Higher Order Models in Network Science Networks 2021
### Phil Chodrow UCLA Mathematics July 2nd, 2021

---

exclude: true
<style type="text/css">
code.r{ 
 font-size: 16px; 
}
pre {
 font-size: 16px !important;
}
</style>

---

.column.bg-main1[

## Community Detection in Hypergraphs

**Core Problem**: Label nodes on the basis of observed (hyper)edges.

We often picture .alert[assortative], densely-connected communities.

What happens when edges of .alert2[different sizes] play .alert2[different roles]?

*E.g. Large gatherings play different social/connective roles than small meetings.*

]

---

class:

---

---

# *When is community detection in hypergraphs **possible**?*

---
layout: true
class: split-two with-border middle

.column[
  .split-three[ 
  .row.bg-main1[.content.vmiddle[.font_medium[  
.alert[**Detectability thresholds**] for communities in graphs. 
  ]]]     
  .row.bg-main2[.content.vmiddle[.font_medium[
.alert[**Experimental results**] in random hypergraphs.
  ]]] 
  .row.bg-main5[.content.vmiddle[.font_medium[
Steps toward hypergraph theory via .alert[**nonbacktracking operators**]. 
  ]]]
]]

<img src="img/newman-detection-experiment.png" width=100%>
 
<div class="footnote">
 Nadakuditi, R. R., & Newman, M. E. (2012). Graph spectra and the detectability of community structure in networks. Physical Review Letters, 108(18), 188701. 
</div>

---
class: hide-row3-col1 hide-row4-col1 hide-row5-col1
 
<img src="img/detectability-second-panel.png" width=100%>
 
<div class="footnote">
 PSC, N. Veldt, A. R. Benson, (2021). Generative hypergraph clustering: from blockmodels to modularity, Science Advances (forthcoming)
</div>

---
class: hide-row4-col1 hide-row5-col1 
 
<img src="img/hypergraph-nonbacktracking.png" width=100%>

---

class: fade-row2-col1 fade-row3-col1 fade-row4-col1 fade-row5-col1
 
 <img src="img/newman-detection-experiment.png" width=100%>
 
<div class="footnote">
 Nadakuditi, R. R., & Newman, M. E. (2012). Graph spectra and the detectability of community structure in networks. Physical Review Letters, 108(18), 188701. 
</div>

---

.column.bg-main1[

## Detectability in Graphs

In a random graph (SBM) with two, equally-sized communities:

- `$\color{#63d297}{c_i}$`: average number of .alert2[within-cluster] neighbors per node.
- `$\color{#ff5252}{c_o}$`: average number of .alert[between-cluster] neighbors per node.

**Detectability threshold**: In the large-graph limit, it is possible to *detect communities* iff

`$$(\color{#63d297}{c_i} - \color{#ff5252}{c_o})^2 > \frac{1}{2}\sqrt{\color{#63d297}{c_i} + \color{#ff5252}{c_o}}\;.$$`

]

# 
Informally, an algorithm **detects communities** if the output of the algorithm has better-than-random correlation with true communities (in expectation).

]

---

.column.bg-main1[

## Detectability in Graphs

In a random graph (SBM) with two, equally-sized communities:

- `$\color{#63d297}{c_i}$`: average number of .alert2[within-cluster] neighbors per node.
- `$\color{#ff5252}{c_o}$`: average number of .alert[between-cluster] neighbors per node.

**Detectability threshold**: In the large-graph limit, it is possible to *detect communities* iff

`$$(\color{#63d297}{c_i} - \color{#ff5252}{c_o})^2 > \frac{1}{2}\sqrt{\color{#63d297}{c_i} + \color{#ff5252}{c_o}}\;.$$`

]

## *A Brief History*

**Conjectured** using belief-propagation by Decelle, A., Krzakala, F., Moore, C., & Zdeborová, L. (2011), PRE.

**Re-conjectured** via nonbacktracking operators in Krzakala, F., Moore, C., Mossel, E., Neeman, J., Sly, A., Zdeborová, L., & Zhang, P. (2013), *PNAS*.

**Negative direction proved** via a coupling to broadcasting on trees: Mossel, E., Neeman, J., & Sly, A. (2015). *Probability Theory and Related Fields*.

**Positive direction proved** via weighted nonbacktracking paths by Mossel, E., Neeman, J., & Sly, A. (2018), *Combinatorica*.

**Helpful review**: Abbe, E. (2017), *JMLR*.
]
]

---

.column.bg-main1[

## Detectability in Graphs

In a random graph (SBM) with two, equally-sized communities:

- `$\color{#63d297}{c_i}$`: average number of .alert2[within-cluster] neighbors per node.
- `$\color{#ff5252}{c_o}$`: average number of .alert[between-cluster] neighbors per node.

**Detectability threshold**: In the large-graph limit, it is possible to *detect communities* iff

`$$(\color{#63d297}{c_i} - \color{#ff5252}{c_o})^2 > \frac{1}{2}\sqrt{\color{#63d297}{c_i} + \color{#ff5252}{c_o}}\;.$$`

]

.column[ ## Illustration
 
 <img src="img/newman-detection-experiment.png" width=100%>
.font_smaller[A spectral method is shown, but the threshold applies to **any algorithm.**]
 
<div class="footnote">
 Nadakuditi, R. R., & Newman, M. E. (2012). Graph spectra and the detectability of community structure in networks. Physical Review Letters, 108(18), 188701. 
</div>
]

---
layout: true
class: split-two with-border middle

.column[
  .split-three[ 
  .row.bg-main1[.content.vmiddle[.font_medium[
.alert[**Detectability thresholds**] for communities in graphs. 
  ]]]     
  .row.bg-main2[.content.vmiddle[.font_medium[
.alert[**Experimental results**] in random hypergraphs. 
  ]]]
  .row.bg-main5[.content.vmiddle[.font_medium[
Steps toward hypergraph theory via .alert[**nonbacktracking operators**]. 
  ]]]
]]

---

<img src="img/detectability-second-panel.png" width=100%>
 
<div class="footnote">
 PSC, N. Veldt, A. R. Benson, (2021). Generative hypergraph clustering: from blockmodels to modularity, Science Advances (forthcoming)
</div>

---
class: split-50 bg-main1 
layout: false

.row[ 
.split-three[
.column[ 
 <img src="img/nate_portrait.jpeg" width=90%> 
 ]
.column[ 
 <img src="img/austin_portrait.jpeg" width=90%> 
]
.column[ 
 <img src="img/phil_portrait.jpeg" width=90%> 
]

]
]
.row[ 
.split-three[
.column[ 
 .font_large[.alert[Nate Veldt]]
 Applied Mathematics Cornell University
 @n_veldt
]
.column[ 
 .font_large[.alert[Austin Benson]]
 Computer Science Cornell University
 @austinbenson
]
.column[ 
 .font_large[.alert[Phil Chodrow]]
 Mathematics UCLA
 @PhilChodrow
]
]
]
 
---

.column.bg-main1[
## Modularity Objective

`$$Q = \sum_k\color{#ff5252}{\beta_k}\left[ \mathbf{cut}_k(\mathbf{z}) - \color{#63d297}{\gamma_k}\sum_{\ell} \mathbf{vol}(\ell)^k\right]$$`

- `$\mathbf{z}$`: cluster labels. 
- `$\mathbf{cut}(\mathbf{z})$`: number of `$k$`-edges split by `$\mathbf{z}$`. 
- `$\mathbf{vol}(\ell)$`: Sum of degrees in cluster `$\ell$`. 
- `$\color{#ff5252}{\beta_k}$`: importance of edges of size `$k$`. 
- `$\color{#63d297}{\gamma_k}$`: resolution parameter for edges of size `$k$`.

Approximates maximum-likelihood in a certain blockmodel. 
]

1. All nodes start in their own clusters. 
2. Greedily agglomerate nodes to maximize `$Q$`. 
3. Greedily agglomerate **clusters** of nodes to further maximize `$Q$`.

Re-estimate `$\beta_k$` and `$\gamma_k$`.

Repeat...

<div class="footnote">
 PSC, N. Veldt, A. R. Benson, (2021). Generative hypergraph clustering: from blockmodels to modularity, Science Advances (forthcoming)
</div>

]

---

## Experimental Setup

.footnote[**PSC**, N. Veldt, A. R. Benson, (2021). Generative hypergraph clustering: from blockmodels to modularity. *Science Advances (forthcoming)*]

---

## Detectability limits: graphs vs. hypergraphs

.footnote[**PSC**, N. Veldt, A. R. Benson, (2021). Generative hypergraph clustering: from blockmodels to modularity. *Science Advances (forthcoming)*]

---

.column.bg-main1[
## Can we do better?

Unclear! Extant theory on hypergraphs only treats .alert[*uniform*] hypergraphs (all edges same size).

Most hypergraph data sets are non-uniform...

We seek .alert2[detectability theory] for .alert[non-uniform hypergraphs].

]

### (*Papers on Uniform Hypergraphs*)

Lin, C. Y., Chien, I. E., & Wang, I. H. (2017).  *IEEE International Symposium on Information Theory*.

Ghoshdastidar, D., & Dukkipati, A. (2017). *Annals of Statistics*.

Angelini, M. C., Caltagirone, F., Krzakala, F., & Zdeborová, L. (2015), *Allerton Conference*.

]]

---
layout: true
class: split-two with-border middle

.column[
  .split-three[ 
  .row.bg-main1[.content.vmiddle[.font_medium[
.alert[**Detectability thresholds**] for communities in graphs. 
  ]]]     
  .row.bg-main2[.content.vmiddle[.font_medium[
.alert[**Experimental results**] in random hypergraphs. 
  ]]]
  .row.bg-main5[.content.vmiddle[.font_medium[
Steps toward hypergraph theory via .alert[**nonbacktracking operators**]. 
  ]]]
]]

---

---
class: split-50 bg-main1 
layout: false 
 
.row[ 
.split-three[
.column[ 
 <img src="img/jamie_portrait.jpeg" width=90%> 
 ]
.column[ 
 <img src="img/eikmeier-3.png" width=90%> 
]
.column[ 
 <img src="img/phil_portrait.jpeg" width=90%> 
]

]
]
.row[ 
.split-three[
.column[ 
 .font_large[.alert[Jamie Haddock]]
 Mathematics Harvey Mudd College 
 @jamie_hadd
]
.column[ 
 .font_large[.alert[Nicole Eikmeier]]
 Computer Science Grinnell College 
 @NicoleEikmeier 
]
.column[ 
 .font_large[.alert[Phil Chodrow]]
 Mathematics UCLA
 @PhilChodrow 
]
]
]

---

.column.bg-main1[
## Hypergraph Nonbacktracking

Write `$(e_1, p_1) \rightarrow (e_2, p_2)$` if:

- `$p_1 \in e_1$` and `$p_2 \in e_2$`
- `$p_1 \in e_2 \setminus p_2$`
- `$e_1 \neq e_2$`

The .alert2[hypergraph nonbacktracking operator] `$\color{#63d297}{\mathbf{B}}$` is the linear operator

.font_smaller[ 
`$$\mathbf{B}[(e_1, p_1), (e_2, p_2)] = \begin{cases} 1 &\quad (e_1, p_1) \rightarrow (e_2, p_2) \\ 
0 &\quad \text{otherwise.}\end{cases}$$`
]]

---

.column.bg-main1[
## Hypergraph Nonbacktracking

First formulated by Storm, C. K. (2006). *The Electronic Journal of Combinatorics*.

"Rediscovered" by Angelini, M. C., Caltagirone, F., Krzakala, F., & Zdeborová, L. (2015), *Allerton Conference*.

---

.column.bg-main1[
## Hypergraph Nonbacktracking

First formulated by Storm, C. K. (2006). *The Electronic Journal of Combinatorics*.

"Rediscovered" by Angelini, M. C., Caltagirone, F., Krzakala, F., & Zdeborová, L. (2015), *Allerton Conference*.

.column[.content.vmiddle[.stretch[
<img src="img/eigen-illustration.png" width=100%> 
]] 
.footnote[**PSC**, J. Haddock, N. Eikmeier, *ongoing work.*]]

---

## Interlude: Faster Eigenvalue Computations

**Theorem [PC, JH, NE '21] (Ihara Bass Determinant Formula)**: Under mild restrictions, the "interesting" eigenvalues of `$\mathbf{B} \in \mathbb{R}^{m\langle k\rangle \times m\langle k\rangle}$` coincide with the eigenvalues of the matrix 
 
$$
\mathbf{B}' = \left[\begin{matrix}
 
 
\end{matrix}\right] \in \mathbb{R}^{2\bar{k}n\times 2\bar{k}n}\;.
$$

- `$\mathbb{A} \in \mathbb{R}^{\bar{k}n}$` holds "basic" adjacency information for each hyperedge size.
- `$\mathbb{D} \in \mathbb{R}^{\bar{k}n}$` holds node degrees for each hyperedge size. 
- `$\mathbf{K} \in \mathbb{R}^{\bar{k}}$` lists possible edge sizes. 
- `$\mathbf{I}_{p} \in \mathbb{R}^{p}$` is the matrix identity.

---

.column.bg-main1[
## Hypergraph Nonbacktracking

.alert[Second eigenvector] correlated with communities in detectable regime. 
 
Surprisingly, can't reach the ? corners, even though we know this to be .alert2[experimentally possible.]

]

.column[.content.vmiddle[.stretch[
<img src="img/vanilla-heatmap.png" width=100%>
]]
.footnote[**PSC**, J. Haddock, N. Eikmeier, *ongoing work.*]]
]

---
layout: true
class: split-two

.column.bg-main1[
## Reaching the Corners

We use a modified nonbacktracking operator to adapt to .alert2[qualitatively distinct roles] for edges of multiple sizes.

The community-correlated eigenvector can either the first or second eigenvector of the modified operator.

Under some approximations, we can .alert[analytically derive] the undetectable region:

`$$(2p_2 - 1)^2 c_2  + 2\left(\frac{4p_3 - 1}{3}\right)^2c_3 \leq 1$$`

]

.column[.content.center.vmiddle[.stretch[
 {{content}}
 
]].footnote[**PSC**, J. Haddock, N. Eikmeier, *ongoing work.*]]
---
<img src="img/linearized-bp-heatmap-1-panel-no-label.png" width=100%>
---
<img src="img/linearized-bp-heatmap-1-panel-label.png" width=100%>

---

.column.bg-main1[

# Summing Up

Hypergraphs have a .alert[rich detectability theory] which is still under development.

Hypergraph nonbacktracking operators are likely to play a role in both theory and algorithms.

.alert2[**There's lots left to be done!**]
- Thresholds with degree heterogeneity, more than 2 communities, etc. 
- **Conj**: failure of nonbacktracking methods coincides with fundamental .alert2[information-theoretic and computational thresholds].  
]

]

---

.column.bg-main1[

# Summing Up

Hypergraphs have a .alert[rich detectability theory] which is still under development.

Hypergraph nonbacktracking operators are likely to play a role in both theory and algorithms.

.row[ 
.split-two[
.column[.lil-stretch[ 
 <img src="img/jamie_portrait.jpeg" width=80%> 
 ]]
.column[.lil-stretch[ 
 <img src="img/eikmeier-3.png" width=80%> 
]]]]
.row[ 
.split-two[
.column[.lil-stretch[ 
 .alert[Jamie Haddock] Harvey Mudd @jamie_hadd
 ]]
.column[.lil-stretch[ 
 .alert[Nicole Eikmeier] Grinnell @NicoleEikmeier
]]]]

- Leonie Neuhäuser 
- Christopher Blöcker
- Jürgen Hackl
- Y'all!!

]]]