AI Tools.

Search

automatic speech recognition

Phi-4-multimodal-instruct

Phi-4-multimodal-instruct is an open-source automatic-speech-recognition model available on HuggingFace. Details are sourced from the public model registry.

Last reviewed

Use cases

  • Building automatic-speech-recognition applications
  • Research and experimentation
  • Open-source AI prototyping

Pros

  • Open weights available
  • Community support on HuggingFace

Cons

  • Requires manual evaluation for production use
  • Licensing terms vary — check model card

FAQ

What is Phi-4-multimodal-instruct used for?

Building automatic-speech-recognition applications. Research and experimentation. Open-source AI prototyping.

Is Phi-4-multimodal-instruct free to use?

Phi-4-multimodal-instruct is an open-source model published on HuggingFace. License terms vary by model — check the model card for the specific license.

How do I run Phi-4-multimodal-instruct locally?

Most HuggingFace models can be loaded with transformers or the appropriate framework library. See the model card for framework-specific instructions and hardware requirements.

Tags

transformerssafetensorsphi4mmtext-generationnlpcodeaudioautomatic-speech-recognitionspeech-summarizationspeech-translationvisual-question-answeringphi-4-multimodalphiphi-4-minicustom_codemultilingualarzhcsda