# Code of LMT


## Parts:
This source includes 7 "*.py" files.

Training: train.py

Inference: test_fasta.py


## Environment: key requirements

* torch  
* numpy   
* pyfastx  
* tqdm  

**Reference versions:**

* torch: 1.13.1  
* numpy: 1.24.2   
* pyfastx: 2.1.0  
* tqdm: 4.64.1  
* python: 3.9.13

## Training  

Example  

```
python train.py -q /path_to_your_primers/your_primer.txt -v /path_to_your_targets/your_target.txt -m /path_to_model_saved/ -n model_name(e.g., amp_models.pt) -b batch_size 
```

### About training data

Both primer and target are ".txt" files, and the paired seqs are written in the same row. For instance, if primer 'A' is written in the first row of "your_primer.txt", its paired target seq 'B' is written in the same row of "your_target.txt". Thus, these two documents are identical in line count. Meanwhile, the mapping table is implicitly encoded within this formatted one-to-one relationship.

## Inference

Example

```
python test_fasta.py -i /path_to_your_test_primers/your_test_primers.fasta -o /path_to_generated_results/ -m /Path_to_your_model/amp_models.pt(e.g., ./models/random37/amp_models.pt) -n name_of_result
```

The results include two files, '.csv' and '.fasta'. 

'.fasta': ID of primer and its generated AMP seq.

'.csv': < ID, primer seq, generated AMP seq. >

## Model

To facilitate immediate use, we also released our basic model LMT_{ran} (./models/random37/amp_models.pt).
