bytedance/SALMONN

SALMONN: Speech Audio Language Music Open Neural Network

PythonHTMLaudiomusicspeechspeech-recognitionmulti-modalaudio-processingtsinghua-universitybytedancelarge-language-models
This is stars and forks stats for /bytedance/SALMONN repository. As of 04 May, 2024 this repository has 238 stars and 8 forks.

SALMONN: Speech Audio Language Music Open Neural Network 🚀🚀 Welcome to the repo of SALMONN! SALMONN is a large language model (LLM) enabling speech, audio event, and music inputs, which is developed by the Department of Electronic Engineering of Tsinghua University and ByteDance. Instead of speech-only input or audio-event-only input, SALMONN can perceive and understand all kinds of audio inputs and therefore obtains emerging capabilities such as multilingual speech recognition & translation...
Read on GithubGithub Stats Page
repotechsstarsweeklyforksweekly
adrianhajdin/nike_landing_pageJavaScriptCSSHTML5010630
run-llama/modal_finetune_sqlJupyter NotebookPython1800280
opentiny/cross-framework-componentLessJavaScriptVue2000
WHU-MSC/WHUMSC2023NewPythonLOLCODE50890
USNavalResearchLaboratory/FourierOpticsToolBoxMATLABHTML0000
georgia-tech-db/evadb-docsHTMLMDX0000
thingsym/hugo-theme-techdocSCSSCSSHTML18401360
Lakr233/BBackuppSwiftPython3600170
hylo-lang/hyloSwiftPythonOther9390390
Rvn0xsy/usefull-codeASP.NETHTMLJava960180