# Group think training repo




### Training

The training script of GT model is in `train/sft_text_infilling.py`

### Inference 

The inference script of a GT model is in `inference/run_inference_text_infilling.py`

### Evaluation

The evaluation script of a GT model is in `evaluate/evaluate.py`


### Example
We use bidirectional attention mask to train the model. See the attention mask example below.

![groupthink_attention_mask](groupthink_attention_mask.png)

The position ids of the path blocks are shifted. For example


``` bash
====================================================================================================
TOKEN vs POSITION ID COMPARISON
====================================================================================================
Index  Token                     Token ID   Position ID  Segment        
----------------------------------------------------------------------------------------------------
0      <|im_start|>              151644     0            prompt         
1      system                    8948       1            prompt         
2      \n                        198        2            prompt         
3      You                       2610       3            prompt         
4       are                      525        4            prompt         
5       participating            23528      5            prompt         
6       in                       304        6            prompt         
7       a                        264        7            prompt         
8       group                    1874       8            prompt         
9       think                    1744       9            prompt         
10      session                  3797       10           prompt         
11      where                    1380       11           prompt         
12      multiple                 5248       12           prompt         
13      thinkers                 68022      13           prompt         
14      are                      525        14           prompt         
15      answering                35764      15           prompt         
16      the                      279        16           prompt         
17      question                 3405       17           prompt         
18      in                       304        18           prompt         
19      parallel                 15279      19           prompt         
20      resulting                12942      20           prompt         
21      in                       304        21           prompt         
22      concurrent               34035      22           prompt         
23      thinking                 7274       23           prompt         
24      paths                    12716      24           prompt         
25     .                         13         25           prompt         
26     <|im_end|>                151645     26           prompt         
27     \n                        198        27           prompt         
28     <|im_start|>              151644     28           prompt         
29     user                      872        29           prompt         
30     \n                        198        30           prompt         
31     Distance                  14778      31           prompt         
32      between                  1948       32           prompt         
33      Taipei                   96345      33           prompt         
34      and                      323        34           prompt         
35      London                   7148       35           prompt         
36     ?                         30         36           prompt         
37     <|im_end|>                151645     37           prompt         
38     \n                        198        38           prompt         
39     <|im_start|>              151644     39           prompt         
40     assistant                 77091      40           prompt         
41     \n                        198        41           prompt         
42     <Parallel>                151665     42           parallel_start 
43     <Path>                    151667     43           path_0_start   
44     Maybe                     21390      44           path_0_content 
45                               220        45           path_0_content 
46     6                         21         46           path_0_content 
47     0                         15         47           path_0_content 
48     0                         15         48           path_0_content 
49     0                         15         49           path_0_content 
50      km                       13136      50           path_0_content 
51     ?                         30         51           path_0_content 
52      Answer                   21806      52           path_0_content 
53      is                       374        53           path_0_content 
54      \                        1124       54           path_0_content 
55     boxed                     79075      55           path_0_content 
56     {                         90         56           path_0_content 
57     6                         21         57           path_0_content 
58     0                         15         58           path_0_content 
59     0                         15         59           path_0_content 
60     0                         15         60           path_0_content 
61      km                       13136      61           path_0_content 
62     }                         92         62           path_0_content 
63     </Path>                   151668     63           path_0_end     
64     <Path>                    151667     3000         path_1_start   
65     I                         40         3001         path_1_content 
66      don                      1513       3002         path_1_content 
67     't                        944        3003         path_1_content 
68      know                     1414       3004         path_1_content 
69     ...                       1112       3005         path_1_content 
70      Wait                     13824      3006         path_1_content 
71      thinker                  97536      3007         path_1_content 
72                               220        3008         path_1_content 
73     1                         16         3009         path_1_content 
74      knows                    8788       3010         path_1_content 
75     !                         0          3011         path_1_content 
76     </Path>                   151668     3012         path_1_end     
77     <Path>                    151667     6000         path_2_start   
78     I                         40         6001         path_2_content 
79      will                     686        6002         path_2_content 
80      delegate                 13380      6003         path_2_content 
81      others                   3800       6004         path_2_content 
82      to                       311        6005         path_2_content 
83      think                    1744       6006         path_2_content 
84      about                    911        6007         path_2_content 
85      the                      279        6008         path_2_content 
86      geometry                 17047      6009         path_2_content 
87      of                       315        6010         path_2_content 
88      earth                    9393       6011         path_2_content 
89     .                         13         6012         path_2_content 
90     </Path>                   151668     6013         path_2_end     
91     <Path>                    151667     9000         path_3_start   
92     What                      3838       9001         path_3_content 
93      abstract                 8115       9002         path_3_content 
94      idea                     4522       9003         path_3_content 
95      can                      646        9004         path_3_content 
96      ...                      2503       9005         path_3_content 
97      oh                       14019      9006         path_3_content 
98      Wait                     13824      9007         path_3_content 
99     ...                       1112       9008         path_3_content 
100     Maybe                    10696      9009         path_3_content 
101     I                        358        9010         path_3_content 
102     should                   1265       9011         path_3_content 
103     verify                   10146      9012         path_3_content 
104     thinker                  97536      9013         path_3_content 
105                              220        9014         path_3_content 
106    1                         16         9015         path_3_content 
107    's                        594        9016         path_3_content 
108     solution                 6291       9017         path_3_content 
109    </Path>                   151668     9018         path_3_end     
110    </Parallel>               151666     9019         parallel_end   

====================================================================================================
```


