this paper was accepted at the adaptive foundation models (afm) workshop at neurips 2024. follow-up conversations with virtual assistants (vas) enable a user to seamlessly interact with a va...
本文讨论了在neurips 2024自适应基础模型研讨会上提出的设备导向语音检测(ddsd)方法。该方法通过建模首次查询,结合大型语言模型(llms)和自动语音识别(asr)不确定性,提升了后续对话的自然交互体验。研究表明,该方法在真实数据集上显著降低了误报率。
