You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
@@ -55,6 +55,8 @@ Results are reported in Character Error Rate (CER%) for Chinese and Word Error R
55
55
## Usage
56
56
Download model files from [huggingface](https://huggingface.co/fireredteam) and place them in the folder `pretrained_models`.
57
57
58
+
If you want to use `FireRedASR-LLM-L`, you also need to download [Qwen2-7B-Instruct](https://huggingface.co/Qwen/Qwen2-7B-Instruct) and place it in the folder `pretrained_models`. Then, go to folder `FireRedASR-LLM-L` and run `$ ln -s ../Qwen2-7B-Instruct`
59
+
58
60
59
61
### Setup
60
62
Create a Python environment and install dependencies
- When performing batch beam search with FireRedASR-LLM, please ensure that the input lengths of the utterances are similar. If there are significant differences in utterance lengths, shorter utterances may experience repetition issues. You can either sort your dataset by length or set `batch_size` to 1 to avoid the repetition issue.
140
+
135
141
### Input Length Limitations
136
142
- FireRedASR-AED supports audio input up to 60s. Input longer than 60s may cause hallucination issues, and input exceeding 200s will trigger positional encoding errors.
137
143
- FireRedASR-LLM supports audio input up to 30s. The behavior for longer input is currently unknown.
0 commit comments