Skip to content

Discussions section in repo #282

Closed Answered by sandrohanea
AncientLust asked this question in Q&A
Discussion options

You must be logged in to vote

Hey @AncientLust ,
Created the discussion page and converted this one to discussion as well.
Thanks for the suggestion!

Now, trying to answer your question as well: indeed, you'll need at least 1000 ms to perform the inference:

https://github.com/ggerganov/whisper.cpp/blob/8c6a9b8bb6a0273cc0b5915903ca1ff9206c6285/src/whisper.cpp#L5375C5-L5375C39

It seems (based on the logs) that you're short of 10ms.

Indeed, as you're sending a Memory, no header is required and shouldn't be the cause of this missing 10ms.

It seems that 16k frames of audio, only produce 990ms of mel spectogram in the whisper.cpp library (one missing mel sample):
https://github.com/ggerganov/whisper.cpp/blob/8c6a9b8bb6a0273…

Replies: 2 comments

Comment options

You must be logged in to vote
0 replies
Answer selected by AncientLust
Comment options

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants
Converted from issue

This discussion was converted from issue #281 on November 26, 2024 19:31.