\section{Introduction}
Graph neural networks (GNNs) are powerful tools for learning graph-structured data in various domains ~\citep{kipf2016semi,velivckovic2017graph}. GNNs use the graph's adjacency matrix to aggregate node information from their neighbors, effectively acting as a low-pass filter that smooths graph signals ~\citep{nt2019revisiting}.
They have shown remarkable success in supervised and semi-supervised learning, where task-specific labels are available. However, obtaining high-quality labels can be costly in many domains, spurring interest in self-supervised learning on graphs to learn representations without supervision ~\citep{velickovic2019deep,peng2020graph,qiu2020gcc,hassani2020contrastive,zhu2020beyond}.

Among these self-supervised methods, Contrastive Learning (CL) has demonstrated remarkable success ~\citep{velickovic2019deep,peng2020graph,qiu2020gcc,hassani2020contrastive,zhu2020beyond}. Graph CL methods first augment the input graph, either by altering node features or the graph topology. Then, they learn representations by contrasting the augmented graph views encoded with a GNN-based encoder.
Existing graph CL methods perform well under homophily, where neighboring nodes often share the same label. However, they perform poorly on heterophilic graphs, where connected nodes tend to belong to different classes %and have dissimilar features 
\citep{zhu2020beyond}. 
%%%%%%%%%%%%%%%%%%%%%%%%%%
% When labels are available, they guide how to aggregate information from different nodes in a neighborhood ~\citep{bo2021beyond,luan2020complete,pei2020geom,wang2019demystifying,zhu2020graph}. Without label information, however, the aggregation strategy is not as clear. Moreover, even if one can identify edges connecting nodes with the same label, the number of such edges may be too few to learn high-quality representations under heterophily. 
Indeed, for learning rich representations in graphs with heterophily, %it is critical to aggregate information from
contrasting augmented views of every node is not enough, but it is crucial to
%neighbors with the same label, and
\textit{differentiate} representation of node with different labels ~\citep{bo2021beyond,luan2020complete}. However, \textit{without label} information, it is not clear how this can be achieved.

\begin{figure*}[t]
\vspace{-2mm}
\centering
\includegraphics[width=0.8\textwidth]{Fig/hlcl.png}
\vspace{-3mm}
\caption{\alg identifies a homophilic and a heterophilic subgraph $\mathcal{G}^{hom}, \mathcal{G}^{het}$, and generates two augmentations for each subgraph. Then, it applies low-pass filters $\pmb{F}_{LP}, \tilde{\pmb{F}}_{LP}$ to the augmented homophilic subgraphs and high-pass filters $\pmb{F}_{HP}, \tilde{\pmb{F}}_{HP}$ to the augmented heterophilic subgraphs, to generate low-pass $\pmb{H}_L, \tilde{\pmb{H}}_L$ and high-pass $\pmb{H}_H, \tilde{\pmb{H}}_H$ filtered views, using the same encoder $\pmb{W}$. \alg learns the final representations by contrasting the projected low-pass filtered augmented views $\pmb{z}^L, \tilde{\pmb{z}}^L$ and the high-pass filtered augmented views $\pmb{z}^h, \tilde{\pmb{z}}^h$ of every node.}  \label{fig:method}
\vspace{-2mm}
\end{figure*}

In this work, we %address the above challenges by 
propose an effective graph CL method, namely \alg, for learning node representations under heterophily. %The key concept of \alg involves selecting
\alg first uses nodes' feature similarity to identify a homophilic and heterophilic subgraph in the original graph. %to form a homophilic subgraph with edges that connect nodes of the same label, and a heterophilic subgraph with edges connecting nodes of different labels. \alg uses nodes' feature similarity as a proxy for their label similarity to build these subgraphs. 
Then, for each subgraph, it generates two augmented graph views, %via random graph perturbations, 
and applies a high-pass filter to the heterophilic subgraphs and a low-pass filter to the homophilic subgraphs.
%learning high-pass and low-pass filtered views for every node. 
The final representations are learned by contrasting the augmented high-pass filtered views and contrasting the augmented low-pass filtered views of each node, using the \textit{same GNN encoder}, as illustrated in Fig. \ref{fig:method}. %This method combines smooth and non-smooth components, maximizing their agreement to enhance representation quality. As a result,
In doing so, \alg achieves state-of-the-art performance under heterophily, surpassing graph \textit{supervised learning} methods and yielding comparable performance to state-of-the-art graph CL methods under homophily. %We demonstrate the effectiveness of our \alg framework through extensive experiments on graphs with heterophily and homophily for unsupervised representation learning, under the linear evaluation protocol. Our results
In addition, we prove that the learned representations by \alg encode both low-frequency and high-frequency information.
% \alg learns invariance between two contrasted augmentations from the spectrum domain. 
% small enough values of the contrastive loss ensures that the two contrasted augmentations are invariant at frequency $i$.

Our extensive experiments show that on seven benchmark datasets, \alg outperforms existing graph CL methods by up to 7\% and graph supervised learning methods by up to 10\% under heterophily, while maintaining comparable performance under homophily. Additionally, \alg scales well to large graphs like Penn94, outperforming other graph CL methods by up to 5\%. 

In summary, our contributions are as follows: %\ba{can cut}
% \hy{Our contribution compared to other existing self-supervised learning (SSL) methods is threefold:}
\begin{itemize}[label={$\bullet$}, itemsep=0pt, topsep=0pt, parsep=0pt, partopsep=0pt, left=1em]
\item \textit{Graph CL with graph filters.} 
\alg is the first graph CL method that utilizes graph filters, and combines high-pass and low-pass filtered representations using contrastive losses. This approach enables learning rich representations under heterophily.
\item \textit{Careful aggregation.} %Unique Aggregation Strategy:
\alg identifies a homophilic and a heterophilic subgraph
%two subgraphs with high and low homophily ratios 
based on node features or representations, for effective information aggregation. %This strategy is unique to HLCL among existing SSL methods.
\item \textit{Theoretical analysis.} %Theoretical Groundwork: 
By analyzing \alg, we theoretically prove that \alg learns the invariance information from both homophilic and heterophilic subgraphs.
% the first theoretical framework for graph CL under heterophily, and confirm the effectiveness of \alg\ with extensive experiments under homophily and heterophily.
% which we hope will pave the way for more comprehensive theoretical analyses of SSL from graphs.
\item \textit{Extensive experiments.} Empirically, we confirm that \alg achieves state-of-the-art under heterophily and a competitive performance under homophily.
\end{itemize}