microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition β’ 6B β’ Updated May 1 β’ 388k β’ 1.55k