Segmentation Fault with Distilled Models on CPU when word_timestamps=True #1283

zhitao-zeng · 2025-04-08T08:12:49Z

I am encountering a "Segmentation fault (core dumped)" error when using faster-whisper under specific conditions:

Running inference on the CPU (device='cpu').
Using a distilled Whisper model (e.g., distil-faster-whisper-base-it).
Requesting word-level timestamps (word_timestamps=True).

The error does not occur if:

Word timestamps are disabled (word_timestamps=False).
A standard (non-distilled) finetune faster Whisper model (e.g., base, small, tested with base) is used, even with word_timestamps=True on CPU.

Purfview · 2025-04-08T15:44:13Z

distil-faster-whisper-base-it

Post config.json file of that model.

zhitao-zeng · 2025-04-08T16:13:02Z

distil-faster-whisper-base-it

Post config.json file of that model.
Here is the model I used
https://huggingface.co/gustavv-andrzejewski/distil-whisper-base-it/blob/main/config.json

Purfview · 2025-04-08T16:57:56Z

https://huggingface.co/gustavv-andrzejewski/distil-whisper-base-it/blob/main/config.json

This model is not for faster-whisper.

zhitao-zeng · 2025-04-08T17:01:45Z

https://huggingface.co/gustavv-andrzejewski/distil-whisper-base-it/blob/main/config.json

This model is for whisper, not for faster-whisper.

config.json
Oh I see, this file should be the faster-whisper one

Purfview · 2025-04-08T17:39:02Z

config.json Oh I see, this file should be the faster-whisper one

"alignment_heads" is same as in original whisper https://huggingface.co/openai/whisper-base/blob/main/generation_config.json

I think it should be different for distil model.

zhitao-zeng · 2025-04-09T02:20:32Z

config.json Oh I see, this file should be the faster-whisper one

"alignment_heads" is same as in original whisper https://huggingface.co/openai/whisper-base/blob/main/generation_config.json

I think it should be different for distil model.

That's what I directly got through the following code:
converter = TransformersConverter(model_path) converted_model_path = converter.convert(output_dir, quantization="int8", force=True)

Purfview · 2025-04-09T20:49:03Z

Did you check in original Whisper if word_timestamps=True works there with this model?

zhitao-zeng · 2025-04-10T06:46:08Z

Did you check in original Whisper if word_timestamps=True works there with this model?

Thanks for the suggestion. I've tested this further.

Using the Hugging Face transformers library:

The distilled (gustavv-andrzejewski/distil-whisper-base-it), fine-tuned, and original base models all correctly produce word timestamps when requested (return_timestamps=True). The output format includes inline time markers within the text (e.g., <|time|>word).

Using the faster-whisper library (with CTranslate2 models):

The CTranslate2 versions of fine-tuned and original base models work correctly with word_timestamps=True on CPU, providing detailed Word objects (with start/end times and probability).
However, the CTranslate2 version of the distilled model (gustavv-andrzejewski/distil-whisper-base-it) still causes a segmentation fault when word_timestamps=True is used on the CPU. It works if timestamps are off or if a non-distilled model is used.

Using openai-whisper library

The fine-tuned and original base models work correctly with word_timestamps=True on CPU, providing detailed Word objects (with start/end times and probability).
-However the distil model can not load sucessfully because missing key(s) in state_dict. Since this distil model using hugging face framework, it's hard for me to convert it and load into the standard openai whisper framework. I need to construct the model and load weights manully and I am not sure if it's available.

Purfview · 2025-04-11T01:23:11Z

Can you share that ct2 model?

zhitao-zeng · 2025-04-14T02:20:28Z

Can you share that ct2 model?
Sure
https://drive.google.com/file/d/1MWVWFJwaFsD6ddr1u5TfSmLKKgac3Wp8/view?usp=drive_link

sssshhhhhh · 2025-04-14T08:59:44Z

transformers word timestamps is return_timestamps='word' not True. This model also errors in hf and openai because the distil process removes layers which makes alignment heads refer to heads in nonexistant layers.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Segmentation Fault with Distilled Models on CPU when word_timestamps=True #1283

Segmentation Fault with Distilled Models on CPU when word_timestamps=True #1283

zhitao-zeng commented Apr 8, 2025

Purfview commented Apr 8, 2025

zhitao-zeng commented Apr 8, 2025

Purfview commented Apr 8, 2025 •

edited

Loading

zhitao-zeng commented Apr 8, 2025

Purfview commented Apr 8, 2025

zhitao-zeng commented Apr 9, 2025

Purfview commented Apr 9, 2025

zhitao-zeng commented Apr 10, 2025 •

edited

Loading

Purfview commented Apr 11, 2025

zhitao-zeng commented Apr 14, 2025

sssshhhhhh commented Apr 14, 2025

Segmentation Fault with Distilled Models on CPU when word_timestamps=True #1283

Segmentation Fault with Distilled Models on CPU when word_timestamps=True #1283

Comments

zhitao-zeng commented Apr 8, 2025

Purfview commented Apr 8, 2025

zhitao-zeng commented Apr 8, 2025

Purfview commented Apr 8, 2025 • edited Loading

zhitao-zeng commented Apr 8, 2025

Purfview commented Apr 8, 2025

zhitao-zeng commented Apr 9, 2025

Purfview commented Apr 9, 2025

zhitao-zeng commented Apr 10, 2025 • edited Loading

Purfview commented Apr 11, 2025

zhitao-zeng commented Apr 14, 2025

sssshhhhhh commented Apr 14, 2025

Purfview commented Apr 8, 2025 •

edited

Loading

zhitao-zeng commented Apr 10, 2025 •

edited

Loading