--- license: mit pipeline_tag: audio-text-to-text library_name: transformers base_model: ACE-Step/acestep-transcriber base_model_relation: quantized tags: - music - audio - quantized - fp8 - compressed-tensors --- Tech Report # ACE-Step Transcriber This is an FP8 Dynamic quantized variant of [ACE-Step/acestep-transcriber](https://huggingface.co/ACE-Step/acestep-transcriber). ## Description ACE-Step Transcriber is the annotation model used by **ACE-Step v1.5** for training data labeling. It is a powerful multilingual audio transcription model capable of transcribing both **speech** and **singing voice** with high accuracy. ### Key Features - 🌍 **50+ Languages Support** - Covers major world languages and regional dialects - 🎤 **Speech Transcription** - Accurately transcribes spoken content - 🎵 **Singing Voice Transcription** - Specialized in lyrics transcription with musical structure annotations - 🏷️ **Structure Annotation** - Automatically identifies song sections (verse, chorus, bridge, etc.) ## Usage The usage is the same as [Qwen2.5 Omni-7B](https://huggingface.co/Qwen/Qwen2.5-Omni-7B). ### Prompt Format Use the following prompt to transcribe audio: ``` *Task* Transcribe this audio in detail