Metadata-Version: 2.4
Name: fish-audio-preprocess
Version: 0.2.8
Summary: Preprocess audio data
Author-email: Lengyue <lengyue@lengyue.me>
License: Apache
Requires-Python: >=3.9
Description-Content-Type: text/markdown
License-File: LICENSE
Dynamic: license-file

# Fish Audio Preprocessor

[![PyPI Version](https://img.shields.io/pypi/v/fish-audio-preprocess.svg)](https://pypi.python.org/pypi/fish-audio-preprocess)

[中文文档](README.zh.md)

This repo contains some scripts for audio processing. Main features include:

- [x] Video/audio to wav
- [x] Audio vocal separation
- [x] Automatic audio slicing
- [x] Audio loudness matching
- [x] Audio data statistics (supports determining audio length)
- [x] Audio resampling
- [x] Audio transcribe (.lab)
- [x] Audio transcribe via FunASR (use `--model-type funasr` to enable, detailed usage can be found at code)
- [ ] Audio transcribe via WhisperX
- [ ] Merge .lab files (example: `fap merge-lab ./dataset list.txt "{PATH}|spkname|JP|{TEXT}"`)

([ ] indicates not completed, [x] indicates completed)

**This code has been tested on Ubuntu 22.04 / 20.04 + Python 3.10. If you encounter problems on other versions, feedback is welcome.**

## Getting Started:

```
pip install -e .
fap --help
```

## Reference

- [Batch Whisper](https://github.com/Blair-Johnson/batch-whisper)
