---
hide:
  - navigation
---

## **⏰ TODO in Coming Versions**

- [x] Faster and simpler evaluation pipeline
- [ ] Dynamic dataset
- [ ] More fine-grained datasets
- [ ] Chinese output evaluation
- [ ] Downstream application evaluation


## **Version 0.3.0**

*Release Date: 23rd Apr, 2024*

- **Support parallel retrieval of embeddings when evaluating AdvlInstruction**
- **Add exception handling for partial evaluations**
- **Fixed some bugs**
- **Add evaluation results for ChatGLM3, GLM-4, Mixtral, Llama3-8b, and Llama3-70b ([check out](https://trustllmbenchmark.github.io/TrustLLM-Website/leaderboard.html))**

## **Version 0.2.3 & 0.2.4**

*Release Date: March 2024*

- **Fixed some bugs**
- **Support Gemini API**

## **Version 0.2.2**

*Release Date: 1st Feb, 2024*

- **Support awareness evaluation in our new [work](https://arxiv.org/abs/2401.17882)**
- **Support Zhipu API evaluation (GLM-4 & GLM-3-turbo)**



## **Version 0.2.1**

*Release Date: 26th Jan, 2024*

- **Support LLMs in [replicate](https://replicate.com/) and [deepinfra](https://deepinfra.com/)**
- **Support easy pipeline for evaluation**
- **Support [Azure OpenAI](https://azure.microsoft.com/en-us/products/ai-services/openai-service) API**

## **Version 0.2.0**

*Release Date: 20th Jan, 2024*

- **Add generation section** ([details](https://howiehwong.github.io/TrustLLM/guides/generation_details.html))
- **Support concurrency when using auto-evaluation**



## **Version 0.1.0**

*Release Date: 10th Jan, 2024*

We have released the first version of the TrustLLM assessment tool, which includes all the evaluation methods from our initial research paper.
