% As autism is a functional disorder, structural information cannot be directly used to classify ASD subjects. However, instead of using different phenotypic information such as age, sex and acquisition sites to define connections among the subjects, the `actual similarities' of the brains' hardwares are used in this work. In the past, sMRI data has been used to understand the variability of brain hardware based on age ~\cite{brickman2007structural, su2012predicting}, gender \cite{tyan2017gender} and acquisition sites  \cite{littmann2006acquisition}. This implies that these phenotypic parameters correlate with the structural imaging data. Hence, as opposed to gathering functional data from subjects with similar arbitrary metadata, data from subjects with similar structural representations can be expected to have lower variance. Based on this motivation, we hypothesize that the structural data of the brain (T1 weighted) can provide a better measure of similarity between subjects than the conventionally used phenotypic data. 

Often the dimensionality of structural MRI data is too high to be directly used for calculating the similarity scores. Thus, a highly compressed version is desired that can still contain sufficient information to derive the extent of similarities between different brain data. To achieve this, we use a pretrained VAE\footnote{VAE was trained on the UK Biobank. Information at \url{https://imaging.ukbiobank.ac.uk/}.} to encode the structural image on to a latent space of significantly lower dimensions (a vector of 200 units). % We use a pretrained VAE that has learnt to reduce the dimensionality of structural {\color{dpk} T1-weighted images\footnote{{\color{dpk}The VAE was trained on the UK Biobank and provided by AMC. Information at \url{https://imaging.ukbiobank.ac.uk/}.}} to feature vector of 200  units}. 
Further, cosine similarity is used to determine the adjacency matrix for all the subjects. Note that unlike the adjacency matrix mentioned in Section \ref{sec_gcn_parisot}, the matrix here will not be sparse. Therefore, an automatically determined threshold is applied which ensures that the sparsity is maximized under the constraint that every node on the graph is connected to at least one other node. This has been achieved by setting the threshold to the minimum of all the maximum values of each row (or column). Finally, we create a population graph using feature vectors as in \citet{parisot2018disease}, but using sMRI information to characterize its edges.

