# FL-ICL

## Classification Dataset

| Dataset | #Class   | #Train     | #Test        | #Val      | AlmostBalanced |
| ------- | -------- | ---------: | -----------: | --------- | -------------- |
| SST2    | 2        | 6911       | 1821         |           | $\checkmark$   |
| SST5    | 5        | 8534       | 2210         |           |
| amazon  | 5        | 30000      | 3000         |           | $\checkmark$   |
| Yelp    | 5        | 30000      | 3000         |           | $\checkmark$   |
| MR      | 2        | 8662       | 2000         |           | $\checkmark$   |
| CR      | 2        | 1772       | 1996         |           |                |
| Agnews  | 4        | 29914      | 3000         |           | $\checkmark$   |
| TREC    | 6        | 5381       | 500          |           |                |
| DBPedia | 14       | 10000      | 3000         |           | $\checkmark$   |
| Yahoo   | 10       | 29150      | 3000         |           | $\checkmark$   |
| MNLI    | 3        | 263789     | 3000         | 3000      | $\checkmark$   |
| SNLI    | 3        | 130823     | 3220         | 3216      |                |
| RTE     | 2        | 2490       | use `val`    | 277       | $\checkmark$   |
| Subj    | 2        | 8000       | 2000         |           | $\checkmark$   |
| COLOA   | 2        | 8532       | 527          |           |                |



## Experiments

1. Data Partition: IID & NonIID (class number based)


### Fitst try
For both IID & NonIID case:
1. server sends raw query to all clients
2. Use rule-based retriever to select top-K (1 or 2) samples from each clients
3. All clients send back selected samples to server
4. server use recieved samples (random order) to feed into LLM for final answer
5. calculate classification performance


