Batch transcription ? #52
Replies: 14 comments 31 replies
-
Yes definitly, this feauture will come. But it may take a few weeks, see: #50 (reply in thread) |
Beta Was this translation helpful? Give feedback.
-
|
Many thx for noScribe! I have a similar use case but with running noScribe on the HPC cluster. Is it possible to run the transcription process from the command line? |
Beta Was this translation helpful? Give feedback.
-
|
Is the batch feature already implemented? |
Beta Was this translation helpful? Give feedback.
-
|
I am not able to run noScribe form source. |
Beta Was this translation helpful? Give feedback.
-
|
On Windows, sorry
kaixxx ***@***.***> schrieb am Mo. 8. Sept. 2025 um 22:00:
… Are you on Windows, macOS or Linux?
—
Reply to this email directly, view it on GitHub
<#52 (reply in thread)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/BVQA4CHZ37EN4XR3QEDNTM33RXN7HAVCNFSM6AAAAACF5FFMLOVHI2DSMVQWIX3LMV43URDJONRXK43TNFXW4Q3PNVWWK3TUHMYTIMZUGQYDQMQ>
.
You are receiving this because you commented.Message ID:
***@***.***>
|
Beta Was this translation helpful? Give feedback.
-
|
Beta versions of 0.7 are available here: https://drive.switch.ch/index.php/s/EIVup04qkSHb54j?path=%2FnoScribe%20vers.%200.7 Batch transcription is integrated in the UI. Just start several jobs, and they will be sent to a queue and processed one after the other. You can also select more than one audio file in the file dialog to create multiple jobs at once. To see the CLI-options, just run |
Beta Was this translation helpful? Give feedback.
-
|
Hi, Using the beta version avalaible here (https://drive.switch.ch/index.php/s/EIVup04qkSHb54j?path=%2FnoScribe%20vers.%200.7) I'm getting an error. Here is what I'm doing, in cmd launched from folder where noscribe 0.7 beta is installed : noScribe --language fr --model precise --speaker-detection 2 --overlapping --timestamps --disfluencies "Path\to\recording\2025-11-12 - recording.mp3" "Path\to\recording\2025-11-12 - recording.txt" And here is the output in commandline : Traceback (most recent call last): In a similar way, when I try to run GUI here is what I get : Traceback (most recent call last): Details concerning my configuration :
The patch I found : within noScribe\config.yml, set locale to en => then it works. |
Beta Was this translation helpful? Give feedback.
-
|
So far, I have not been able to successfully complete a transcription with this version. With an mp3 file in German, the transcription breaks off at some point and no html transcript is available. Hier der Ausgabe-Text: === Warteschlange wird gestartet === Starte Auftrag: xxx.mp3 Audioumwandlung... Sprecher:innen identifizieren... Transkription... === Warteschlangenverarbeitung abgeschlossen === |
Beta Was this translation helpful? Give feedback.
-
|
I discovered that version 0.7 beta is okay on another PC. The "bad PC" has very restricted admin rights. Perhaps that caused the problems. |
Beta Was this translation helpful? Give feedback.
-
|
I was able to test the batch processing of version 0.7 in more detail. It is very helpful and works well overall. However, with a large number of files (400 in my case), there is a significant number of aborted processes, approximately 30 percent. ("subprocess aborted unexpectedly") Is there an error log in the background that I could provide? |
Beta Was this translation helpful? Give feedback.
-
|
Here is one exemplary error message of a job that was aborted. ERROR: Transcription worker exited unexpectedly (code 3221226505). I noticed that it is the same error as in this discussion #257 |
Beta Was this translation helpful? Give feedback.
-
|
This is what I get: `Transkription... Could not locate cudnn_ops64_9.dll. Please make sure it is in your library path! During handling of the above exception, another exception occurred: Traceback (most recent call last): === Warteschlangenverarbeitung abgeschlossen === |
Beta Was this translation helpful? Give feedback.
-
|
I continued experimenting. 1. With large batch processing, the number of erroneous interruptions increased after a certain amount of time. 2. Individual repetitions were partially successful, so it is probably not solely due to the transcribed audio file. 3. I also installed the latest Nvidia drivers. That did not change anything. If I understand correctly, the change in the yml file is only a temporary workaround that dispenses with CUDA and GPU, right? Translated with DeepL.com (free version) |
Beta Was this translation helpful? Give feedback.
-
|
Could the CUDA version be a factor? I have installed the latest CUDA version, 13.1. And I have seen that 12.8 is apparently more ideal for faster-whisper. Is 31.1 possibly not fully compatible? |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
I am a fan of Noscribe. I do a lot of chinese transcription and it has the best quality transcription compared to all other whisper interfaces I used so far. Is it possible to add a batch convert function to convert multiple audio files one after another?
Beta Was this translation helpful? Give feedback.
All reactions