# NativQA: Multilingual Culturally-Aligned Natural Query for LLMs

## Initial Setup
Run the following scripts to initialize the data processing.

```
python bin/serpApi/extended_query_scrapper.py -e envs/api_key.txt -i data/expanded_queries/batch/
python bin/serpApi/extract_completed_query.py -i data/expanded_queries/
```


## Iterative Processing

Choose one of the following scripts to prepare the next set of queries, and repeat the initial setup after running these.

```
python prepare_next_qa_queries.py -i data/expanded_queries/desired_batch/
python prepare_next_rs_query.py -i data/expanded_queries/desired_batch/
```

After running one of the above scripts, move the files from the next_batch folder to expanded_queries as required.

## Incorporating New Seed Queries
To add new seed queries into the workflow, use:

```
python bin/serpApi/prepare_next_seed_query.py -i data/loc_of_seed_queries
```

## Optional Steps
If you need to merge query results, run:

```
python bin/serpApi/merge_all.py -i data/expanded_queries/
```
