(1) network structure:
NN_FLM(
  (_NN_FLM__fea_ext_net): FullConnectNetwork(
    (_FullConnectNetwork__net): Sequential(
      (Fea-Ext-FCN-layer-1-Linear): Linear(in_features=21, out_features=128, bias=True)
      (Fea-Ext-FCN-layer-1-Activation): LeakyReLU(negative_slope=0.01)
      (Fea-Ext-FCN-layer-2-Linear): Linear(in_features=128, out_features=64, bias=True)
      (Fea-Ext-FCN-layer-2-Activation): LeakyReLU(negative_slope=0.01)
      (Fea-Ext-FCN-layer-3-Linear): Linear(in_features=64, out_features=32, bias=True)
      (Fea-Ext-FCN-layer-3-Activation): Softmax(dim=1)
    )
  )
  (_NN_FLM__BFR_net): BinaryFuzzyRelationNetwork()
  (_NN_FLM__FP_loss): FuzzyPermissibleLoss(alpha=0.2, beta=0.8)
  (_NN_FLM__L2_regular): L2NormRegularization()
)

(2) network parameters:
para 1	torch.Size([128, 21])	_NN_FLM__fea_ext_net._FullConnectNetwork__net.Fea-Ext-FCN-layer-1-Linear.weight
para 2	torch.Size([128])	_NN_FLM__fea_ext_net._FullConnectNetwork__net.Fea-Ext-FCN-layer-1-Linear.bias
para 3	torch.Size([64, 128])	_NN_FLM__fea_ext_net._FullConnectNetwork__net.Fea-Ext-FCN-layer-2-Linear.weight
para 4	torch.Size([64])	_NN_FLM__fea_ext_net._FullConnectNetwork__net.Fea-Ext-FCN-layer-2-Linear.bias
para 5	torch.Size([32, 64])	_NN_FLM__fea_ext_net._FullConnectNetwork__net.Fea-Ext-FCN-layer-3-Linear.weight
para 6	torch.Size([32])	_NN_FLM__fea_ext_net._FullConnectNetwork__net.Fea-Ext-FCN-layer-3-Linear.bias

(3) trade-off parameters:
gamma_FPL=	1.0
gamma_fea_ext_net_l2reg=	0.1

(4)
number of parameter in network=	6
number of parameter in optimizer=	6
