Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[WIP] Fetch upstream #6

Draft
wants to merge 183 commits into
base: tokenize_hotwords_flag_master
Choose a base branch
from
Draft
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
183 commits
Select commit Hold shift + click to select a range
59407ed
C++ API for speaker diarization (#1396)
csukuangfj Oct 9, 2024
8535b1d
Python API for speaker diarization. (#1400)
csukuangfj Oct 9, 2024
d468527
C API for speaker diarization (#1402)
csukuangfj Oct 9, 2024
9765412
docs(nodejs-addon-examples): add guide for pnpm user (#1401)
YogiLiu Oct 9, 2024
df681e9
Go API for speaker diarization (#1403)
csukuangfj Oct 9, 2024
1571344
Swift API for speaker diarization (#1404)
csukuangfj Oct 9, 2024
bd50e79
Update readme to include more external projects using sherpa-onnx (#1…
csukuangfj Oct 10, 2024
a45e5db
C# API for speaker diarization (#1407)
csukuangfj Oct 10, 2024
67349b5
JavaScript API (node-addon) for speaker diarization (#1408)
csukuangfj Oct 10, 2024
1d061df
WebAssembly exmaple for speaker diarization (#1411)
csukuangfj Oct 10, 2024
f1b311e
Handle audio files less than 10s long for speaker diarization. (#1412)
csukuangfj Oct 11, 2024
eefc172
JavaScript API with WebAssembly for speaker diarization (#1414)
csukuangfj Oct 11, 2024
2d412b1
Kotlin API for speaker diarization (#1415)
csukuangfj Oct 11, 2024
1851ff6
Java API for speaker diarization (#1416)
csukuangfj Oct 11, 2024
1ed803a
Dart API for speaker diarization (#1418)
csukuangfj Oct 11, 2024
5e273c5
Pascal API for speaker diarization (#1420)
csukuangfj Oct 12, 2024
94b26ff
Android JNI support for speaker diarization (#1421)
csukuangfj Oct 12, 2024
5a22f74
Android demo for speaker diarization (#1423)
csukuangfj Oct 13, 2024
99f320b
Release v1.10.28 (#1424)
csukuangfj Oct 13, 2024
df4150d
Upload speaker embedding models to huggingface (#1428)
csukuangfj Oct 14, 2024
77dd5f7
Update README.md (#1431)
semxum Oct 14, 2024
593b967
Add Go API for offline punctuation models (#1434)
csukuangfj Oct 16, 2024
471cbd8
updated onnxruntime-linux-aarch64.cmake so that libonnxruntime.so can…
shawl336 Oct 16, 2024
620597f
Support https://huggingface.co/Revai/reverb-diarization-v1 (#1437)
csukuangfj Oct 17, 2024
4783c8f
fix "log10" compile error by import CMATH lib (#1438)
Zazzle516 Oct 17, 2024
e0586f1
add more models for speaker diarization (#1440)
csukuangfj Oct 17, 2024
1af8ad8
Add Java API example for hotwords. (#1442)
csukuangfj Oct 18, 2024
bcaa91e
update java for hotword jar (#1444)
YeyuchenBa Oct 18, 2024
3edd8d7
add java android demo (#1454)
JameWade Oct 23, 2024
effd5ef
Add C++ API for streaming ASR. (#1455)
csukuangfj Oct 23, 2024
ceb69eb
Add C++ API for non-streaming ASR (#1456)
csukuangfj Oct 23, 2024
b3e05f6
Fix style issues (#1458)
csukuangfj Oct 24, 2024
a5295aa
Handle NaN embeddings in speaker diarization. (#1461)
csukuangfj Oct 24, 2024
2b40079
Add speaker identification with VAD and non-streaming ASR using ALSA …
Peakyxh Oct 24, 2024
b41f6d2
Support GigaAM CTC models for Russian ASR (#1464)
csukuangfj Oct 25, 2024
707cf79
Add GigaAM NeMo transducer model for Russian ASR (#1467)
csukuangfj Oct 25, 2024
d5a2f52
Release v1.10.29 (#1468)
csukuangfj Oct 25, 2024
3d6344e
Fix building node-addon for Windows x86. (#1469)
csukuangfj Oct 25, 2024
b06b460
Begin to support https://github.com/usefulsensors/moonshine (#1470)
csukuangfj Oct 26, 2024
0f2732e
Publish pre-built JNI libs for Linux aarch64 (#1472)
csukuangfj Oct 26, 2024
669f5ef
Add C++ runtime and Python APIs for Moonshine models (#1473)
csukuangfj Oct 26, 2024
bd4b223
Add Kotlin and Java API for Moonshine models (#1474)
csukuangfj Oct 26, 2024
2ca2985
Add C and C++ API for Moonshine models (#1476)
csukuangfj Oct 26, 2024
4a4659a
Add Swift API for Moonshine models. (#1477)
csukuangfj Oct 27, 2024
052b864
Add Go API examples for adding punctuations to text. (#1478)
csukuangfj Oct 27, 2024
3d3edab
Add Go API for Moonshine models (#1479)
csukuangfj Oct 27, 2024
6f261d3
Add JavaScript API for Moonshine models (#1480)
csukuangfj Oct 27, 2024
54468a7
Add Dart API for Moonshine models. (#1481)
csukuangfj Oct 27, 2024
cdd8e1b
Add Pascal API for Moonshine models (#1482)
csukuangfj Oct 27, 2024
3622104
Add C# API for Moonshine models. (#1483)
csukuangfj Oct 27, 2024
91e090f
Release v1.10.30 (#1484)
csukuangfj Oct 27, 2024
9eb493f
Publish pre-built wheels for Python 3.13 (#1485)
csukuangfj Oct 28, 2024
36a0e78
Add some commonly used models to README.md (#1486)
csukuangfj Oct 28, 2024
72dc68c
fix typo (#1488)
pengzhendong Oct 28, 2024
356da3b
Publish pre-built macos xcframework (#1490)
csukuangfj Oct 29, 2024
d9c586c
Removed unused TTS example code in .Net examples (#1492)
csukuangfj Oct 29, 2024
d9f65c9
Update pybind11 to support numpy 2.0 (#1493)
csukuangfj Oct 29, 2024
9fa3bc4
Fix reading tokens.txt on Windows. (#1497)
csukuangfj Oct 30, 2024
a3c89aa
Add two-pass ASR Android APKs for Moonshine models. (#1499)
csukuangfj Oct 31, 2024
9ab89c3
Support building GPU-capable sherpa-onnx on Linux aarch64. (#1500)
csukuangfj Nov 1, 2024
c5205f0
Add an example for computing RTF about streaming ASR. (#1501)
csukuangfj Nov 1, 2024
f0cced1
Publish pre-built wheels with CUDA support for Linux aarch64. (#1507)
csukuangfj Nov 3, 2024
6ee8c99
Fix building (#1508)
csukuangfj Nov 3, 2024
4eeb336
Export the English TTS model from MeloTTS (#1509)
csukuangfj Nov 3, 2024
86b1856
Reduce vad-sense-voice example code. (#1510)
whyb Nov 5, 2024
f94cca7
Fix: Reset sample-buffer after processing (#1521)
iteamvep Nov 8, 2024
f97daed
Fixes #1512 (#1522)
csukuangfj Nov 8, 2024
4fab3f2
Revert: [#1521] No need to reset sample-buffer (#1524)
iteamvep Nov 8, 2024
a16c9af
Add Lazarus example for Moonshine models. (#1532)
csukuangfj Nov 12, 2024
3f777b3
Add isolate_tts demo (#1529)
Spicely Nov 12, 2024
8436ba8
Add WebAssembly example for VAD + Moonshine models. (#1535)
csukuangfj Nov 13, 2024
c34ab35
Add Android APK for streaming Paraformer ASR (#1538)
csukuangfj Nov 14, 2024
b28b0c8
Support static build for windows arm64. (#1539)
csukuangfj Nov 15, 2024
e993c08
fix windows build (#1546)
endink Nov 16, 2024
9a48012
Use xcframework for Flutter iOS plugin. (#1547)
csukuangfj Nov 16, 2024
e424cc9
Support cross-compiling for HarmonyOS (#1553)
csukuangfj Nov 20, 2024
31d6206
HarmonyOS support for VAD. (#1561)
csukuangfj Nov 24, 2024
a4b79f0
Fix flutter ios (#1563)
csukuangfj Nov 26, 2024
298b6b6
Add non-streaming ASR support for HarmonyOS. (#1564)
csukuangfj Nov 26, 2024
2101227
Add streaming ASR support for HarmonyOS. (#1565)
csukuangfj Nov 26, 2024
109fb79
fix building for Android (#1568)
csukuangfj Nov 27, 2024
315d8e2
Publish `sherpa_onnx.har` for HarmonyOS (#1572)
csukuangfj Nov 28, 2024
f3f8961
Add VAD+ASR demo for HarmonyOS (#1573)
csukuangfj Nov 28, 2024
be159f9
Fix publishing har packages for HarmonyOS (#1576)
csukuangfj Nov 29, 2024
299f239
Add CI to build HAPs for HarmonyOS (#1578)
csukuangfj Nov 29, 2024
c9d3b6c
Add microphone demo about VAD+ASR for HarmonyOS (#1581)
csukuangfj Nov 30, 2024
a3d6e1a
Fix getting microphone permission for HarmonyOS VAD+ASR example (#1582)
csukuangfj Nov 30, 2024
dc3287f
Add HarmonyOS support for text-to-speech. (#1584)
csukuangfj Dec 1, 2024
0d6bf52
fix: support both old and new websockets request headers format (#1588)
JiayuXu0 Dec 3, 2024
47a2dd4
'update20241203' (#1589)
goddamnVincent Dec 4, 2024
74a8735
Add on-device tex-to-speech (TTS) demo for HarmonyOS (#1590)
csukuangfj Dec 4, 2024
9352ccf
Release v1.10.33 (#1591)
csukuangfj Dec 4, 2024
84821b1
Fix building node-addon package (#1598)
csukuangfj Dec 6, 2024
91a43cc
Update doc links for HarmonyOS (#1601)
csukuangfj Dec 6, 2024
a743a44
Add on-device real-time ASR demo for HarmonyOS (#1606)
csukuangfj Dec 9, 2024
314545f
Add speaker identification APIs for HarmonyOS (#1607)
csukuangfj Dec 9, 2024
14944d8
Add speaker identification demo for HarmonyOS (#1608)
csukuangfj Dec 10, 2024
1bae408
Add speaker diarization API for HarmonyOS. (#1609)
csukuangfj Dec 10, 2024
914cbad
Add speaker diarization demo for HarmonyOS (#1610)
csukuangfj Dec 10, 2024
e011e84
Release v1.10.34 (#1611)
csukuangfj Dec 10, 2024
9d4659f
Add missing changes about speaker identfication demo for HarmonyOS (#…
csukuangfj Dec 11, 2024
4dc4f1a
Provide sherpa-onnx.aar for Android (#1615)
csukuangfj Dec 12, 2024
be87f86
Use aar in Android Java demo. (#1616)
csukuangfj Dec 12, 2024
0f4b1f4
🔧 build(portaudio-go): Fixed version 1.0.3 (#1614)
deretame Dec 12, 2024
e54c1f4
Release v1.10.35 (#1617)
csukuangfj Dec 12, 2024
efb505f
Update AAR version in Android Java demo (#1618)
csukuangfj Dec 12, 2024
e639c70
Support linking onnxruntime statically for Android (#1619)
csukuangfj Dec 14, 2024
ed8d8e4
Update readme to include Open-LLM-VTuber (#1622)
csukuangfj Dec 16, 2024
5cc60de
Rename maxNumStences to maxNumSentences (#1625)
sawich Dec 16, 2024
70ee779
Support using onnxruntime 1.16.0 with CUDA 11.4 on Jetson Orin NX (Li…
csukuangfj Dec 19, 2024
86381e1
Update readme to include jetson orin nx and nano b01 (#1631)
csukuangfj Dec 19, 2024
7192e57
feat: add checksum action (#1632)
thewh1teagle Dec 20, 2024
b76cd90
Support decoding with byte-level BPE (bbpe) models. (#1633)
csukuangfj Dec 20, 2024
4681bdf
feat: enable c api for android ci (#1635)
thewh1teagle Dec 20, 2024
a3d6313
Update README.md (#1640)
Humorousf Dec 23, 2024
6613828
SherpaOnnxVadAsr: Offload runSecondPass to background thread for impr…
rominf Dec 24, 2024
d00d1c6
Fix GitHub actions. (#1642)
csukuangfj Dec 24, 2024
30a17b9
Release v1.10.36 (#1643)
csukuangfj Dec 24, 2024
fe3265a
Add new tts models for Latvia and Persian+English (#1644)
csukuangfj Dec 24, 2024
08d7713
Add a byte-level BPE Chinese+English non-streaming zipformer model (#…
csukuangfj Dec 24, 2024
b6f0f5f
Support removing invalid utf-8 sequences. (#1648)
csukuangfj Dec 25, 2024
268d562
Add TeleSpeech CTC to non_streaming_server.py (#1649)
csukuangfj Dec 26, 2024
38d64a6
Fix building macOS libs (#1656)
csukuangfj Dec 27, 2024
49154c9
Add Go API for Keyword spotting (#1662)
csukuangfj Dec 31, 2024
5c2cc48
Add swift online punctuation (#1661)
yujinqiu Dec 31, 2024
2c2926a
Add C++ runtime for Matcha-TTS (#1627)
csukuangfj Dec 31, 2024
b2ad6f6
Release v1.10.37 (#1663)
csukuangfj Dec 31, 2024
d353853
Fix initialize TTS in Python. (#1664)
csukuangfj Dec 31, 2024
ebe92e5
Remove spaces after punctuations for TTS (#1666)
csukuangfj Dec 31, 2024
0a43e9c
Add constructor fromPtr() for all flutter class with factory ctor. (#…
w-rui Dec 31, 2024
3422b93
Add Kotlin API for Matcha-TTS models. (#1668)
csukuangfj Dec 31, 2024
f457bae
Support Matcha-TTS models using espeak-ng (#1672)
csukuangfj Jan 2, 2025
a00d3b4
Add Java API for Matcha-TTS models. (#1673)
csukuangfj Jan 2, 2025
a4365da
Avoid adding tail padding for VAD in generate-subtitles.py (#1674)
csukuangfj Jan 3, 2025
9aa4897
Add C API for MatchaTTS models (#1675)
csukuangfj Jan 3, 2025
6489038
Add CXX API for MatchaTTS models (#1676)
csukuangfj Jan 3, 2025
0e299f3
Add JavaScript API (node-addon-api) for MatchaTTS models. (#1677)
csukuangfj Jan 3, 2025
bf3330c
Add HarmonyOS examples for MatchaTTS. (#1678)
csukuangfj Jan 3, 2025
8a60985
Upgraded to .NET 8 and made code style a little more internally consi…
Lamothe Jan 4, 2025
1ef9e5e
Update workflows to use .NET 8.0 also. (#1681)
Lamothe Jan 4, 2025
3eced3e
Add C# and JavaScript (wasm) API for MatchaTTS models (#1682)
csukuangfj Jan 5, 2025
1fe5fe4
Add Android demo for MatchaTTS models. (#1683)
csukuangfj Jan 5, 2025
6f085ba
Add Swift API for MatchaTTS models. (#1684)
csukuangfj Jan 5, 2025
46330b2
Add Go API for MatchaTTS models (#1685)
csukuangfj Jan 6, 2025
c6fcd32
Add Pascal API for MatchaTTS models. (#1686)
csukuangfj Jan 6, 2025
d7c95d3
Add Dart API for MatchaTTS models (#1687)
csukuangfj Jan 6, 2025
930986b
Release v1.10.38 (#1688)
csukuangfj Jan 6, 2025
6d18430
Fix building without TTS (#1691)
csukuangfj Jan 7, 2025
0cb2db3
Add README for android libs. (#1693)
csukuangfj Jan 7, 2025
ecc6538
Fix: export-onnx.py(expected all tensors to be on the same device) (#…
LuomingXu Jan 10, 2025
0d20558
Fix passing strings from C# to C. (#1701)
csukuangfj Jan 13, 2025
cbe07ac
Release v1.10.39 (#1702)
csukuangfj Jan 13, 2025
ce71b63
Fix building wheels (#1703)
csukuangfj Jan 13, 2025
9efe26a
Export kokoro to sherpa-onnx (#1713)
csukuangfj Jan 15, 2025
ffc6b48
Add C++ and Python API for Kokoro TTS models. (#1715)
csukuangfj Jan 16, 2025
af671e2
Add C API for Kokoro TTS models (#1717)
csukuangfj Jan 16, 2025
2d0869c
Fix style issues (#1718)
csukuangfj Jan 16, 2025
cc812e6
Add C# API for Kokoro TTS models (#1720)
csukuangfj Jan 16, 2025
ad61ad6
Add Swift API for Kokoro TTS models (#1721)
csukuangfj Jan 16, 2025
2086f8c
Add Go API for Kokoro TTS models (#1722)
csukuangfj Jan 16, 2025
4335e2a
Add Dart API for Kokoro TTS models (#1723)
csukuangfj Jan 16, 2025
46f2e32
Add Pascal API for Kokoro TTS models (#1724)
csukuangfj Jan 16, 2025
e8d499d
Add JavaScript API (node-addon) for Kokoro TTS models (#1725)
csukuangfj Jan 16, 2025
3a1de0b
Add JavaScript (WebAssembly) API for Kokoro TTS models. (#1726)
csukuangfj Jan 17, 2025
99cef41
Add Koltin and Java API for Kokoro TTS models (#1728)
csukuangfj Jan 17, 2025
bad82f3
Update README.md for KWS to not use `git lfs`. (#1729)
csukuangfj Jan 17, 2025
2df43b3
Release v1.10.40 (#1731)
csukuangfj Jan 17, 2025
9d6c0e5
Fix UI for Android TTS Engine. (#1735)
csukuangfj Jan 20, 2025
e2f096b
Add iOS TTS example for MatchaTTS (#1736)
csukuangfj Jan 20, 2025
a2650b7
Add iOS example for Kokoro TTS (#1737)
csukuangfj Jan 20, 2025
b943341
Fix `dither` binding in Pybind11 to ensure independence from `high_fr…
jacklynblack Jan 20, 2025
8b989a8
Fix keyword spotting. (#1689)
csukuangfj Jan 20, 2025
e764fa6
Update readme to include https://github.com/hfyydd/sherpa-onnx-server…
csukuangfj Jan 20, 2025
5bcd7e1
Reduce vad-moonshine-c-api example code. (#1742)
whyb Jan 21, 2025
bc3322e
Support Kokoro TTS for HarmonyOS. (#1743)
csukuangfj Jan 22, 2025
66e02d8
Release v1.10.41 (#1744)
csukuangfj Jan 22, 2025
340ebca
Fix publishing wheels (#1746)
csukuangfj Jan 22, 2025
e259529
Update README to include https://github.com/xinhecuican/QSmartAssista…
csukuangfj Jan 23, 2025
030aaa7
Add Kokoro TTS to MFC examples (#1760)
csukuangfj Jan 24, 2025
73c3695
Refactor node-addon C++ code. (#1768)
csukuangfj Jan 25, 2025
f178e96
Add keyword spotter C API for HarmonyOS (#1769)
csukuangfj Jan 26, 2025
8847151
Add ArkTS API for Keyword spotting. (#1775)
csukuangfj Jan 29, 2025
59ff854
Add Flutter example for Kokoro TTS (#1776)
csukuangfj Jan 29, 2025
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
22 changes: 16 additions & 6 deletions .github/scripts/node-addon/package-optional.json
Original file line number Diff line number Diff line change
@@ -1,7 +1,7 @@
{
"name": "sherpa-onnx-PLATFORM2-ARCH",
"version": "SHERPA_ONNX_VERSION",
"description": "Speech-to-text and text-to-speech using Next-gen Kaldi without internet connection",
"description": "Speech-to-text, text-to-speech, and speaker diarization using Next-gen Kaldi without internet connection",
"main": "index.js",
"scripts": {
"test": "echo \"Error: no test specified\" && exit 1"
Expand All @@ -16,8 +16,18 @@
"transcription",
"real-time speech recognition",
"without internet connection",
"locally",
"local",
"embedded systems",
"open source",
"diarization",
"speaker diarization",
"speaker recognition",
"speaker",
"speaker segmentation",
"speaker verification",
"spoken language identification",
"sherpa",
"zipformer",
"asr",
"tts",
Expand All @@ -30,13 +40,13 @@
"offline",
"privacy",
"open source",
"vad",
"speaker id",
"language id",
"node-addon-api",
"streaming speech recognition",
"speech",
"recognition"
"recognition",
"vad",
"node-addon-api",
"speaker id",
"language id"
],
"author": "The next-gen Kaldi team",
"license": "Apache-2.0",
Expand Down
22 changes: 16 additions & 6 deletions .github/scripts/node-addon/package.json
Original file line number Diff line number Diff line change
@@ -1,7 +1,7 @@
{
"name": "sherpa-onnx-node",
"version": "SHERPA_ONNX_VERSION",
"description": "Speech-to-text and text-to-speech using Next-gen Kaldi without internet connection",
"description": "Speech-to-text, text-to-speech, and speaker diarization using Next-gen Kaldi without internet connection",
"main": "sherpa-onnx.js",
"scripts": {
"test": "echo \"Error: no test specified\" && exit 1"
Expand All @@ -16,8 +16,18 @@
"transcription",
"real-time speech recognition",
"without internet connection",
"locally",
"local",
"embedded systems",
"open source",
"diarization",
"speaker diarization",
"speaker recognition",
"speaker",
"speaker segmentation",
"speaker verification",
"spoken language identification",
"sherpa",
"zipformer",
"asr",
"tts",
Expand All @@ -30,13 +40,13 @@
"offline",
"privacy",
"open source",
"vad",
"speaker id",
"language id",
"node-addon-api",
"streaming speech recognition",
"speech",
"recognition"
"recognition",
"vad",
"node-addon-api",
"speaker id",
"language id"
],
"author": "The next-gen Kaldi team",
"license": "Apache-2.0",
Expand Down
42 changes: 42 additions & 0 deletions .github/scripts/test-cxx-api.sh
Original file line number Diff line number Diff line change
@@ -0,0 +1,42 @@
#!/usr/bin/env bash

set -ex

log() {
# This function is from espnet
local fname=${BASH_SOURCE[1]##*/}
echo -e "$(date '+%Y-%m-%d %H:%M:%S') (${fname}:${BASH_LINENO[0]}:${FUNCNAME[1]}) $*"
}

echo "CXX_STREAMING_ZIPFORMER_EXE is $CXX_STREAMING_ZIPFORMER_EXE"
echo "CXX_WHISPER_EXE is $CXX_WHISPER_EXE"
echo "CXX_SENSE_VOICE_EXE is $CXX_SENSE_VOICE_EXE"
echo "PATH: $PATH"

log "------------------------------------------------------------"
log "Test streaming zipformer CXX API"
log "------------------------------------------------------------"
curl -SL -O https://github.com/k2-fsa/sherpa-onnx/releases/download/asr-models/sherpa-onnx-streaming-zipformer-bilingual-zh-en-2023-02-20.tar.bz2
tar xvf sherpa-onnx-streaming-zipformer-bilingual-zh-en-2023-02-20.tar.bz2
rm sherpa-onnx-streaming-zipformer-bilingual-zh-en-2023-02-20.tar.bz2
$CXX_STREAMING_ZIPFORMER_EXE
rm -rf sherpa-onnx-streaming-zipformer-bilingual-zh-en-2023-02-20

log "------------------------------------------------------------"
log "Test Whisper CXX API"
log "------------------------------------------------------------"
curl -SL -O https://github.com/k2-fsa/sherpa-onnx/releases/download/asr-models/sherpa-onnx-whisper-tiny.en.tar.bz2
tar xvf sherpa-onnx-whisper-tiny.en.tar.bz2
rm sherpa-onnx-whisper-tiny.en.tar.bz2
$CXX_WHISPER_EXE
rm -rf sherpa-onnx-whisper-tiny.en

log "------------------------------------------------------------"
log "Test SenseVoice CXX API"
log "------------------------------------------------------------"
curl -SL -O https://github.com/k2-fsa/sherpa-onnx/releases/download/asr-models/sherpa-onnx-sense-voice-zh-en-ja-ko-yue-2024-07-17.tar.bz2
tar xvf sherpa-onnx-sense-voice-zh-en-ja-ko-yue-2024-07-17.tar.bz2
rm sherpa-onnx-sense-voice-zh-en-ja-ko-yue-2024-07-17.tar.bz2

$CXX_SENSE_VOICE_EXE
rm -rf sherpa-onnx-sense-voice-*
55 changes: 39 additions & 16 deletions .github/scripts/test-dart.sh
Original file line number Diff line number Diff line change
Expand Up @@ -4,6 +4,37 @@ set -ex

cd dart-api-examples

pushd tts

echo '----------matcha tts----------'
./run-kokoro-en.sh
./run-matcha-zh.sh
./run-matcha-en.sh
ls -lh *.wav
rm -rf matcha-icefall-*
rm *.onnx

echo '----------piper tts----------'
./run-piper.sh
rm -rf vits-piper-*

echo '----------coqui tts----------'
./run-coqui.sh
rm -rf vits-coqui-*

echo '----------zh tts----------'
./run-vits-zh.sh
rm -rf sherpa-onnx-*

ls -lh *.wav

popd # tts

pushd speaker-diarization
echo '----------speaker diarization----------'
./run.sh
popd

pushd speaker-identification
echo '----------3d speaker----------'
./run-3d-speaker.sh
Expand Down Expand Up @@ -31,6 +62,10 @@ echo "----zipformer transducer----"
./run-zipformer-transducer.sh
rm -rf sherpa-onnx-*

echo "----moonshine----"
./run-moonshine.sh
rm -rf sherpa-onnx-*

echo "----whisper----"
./run-whisper.sh
rm -rf sherpa-onnx-*
Expand Down Expand Up @@ -72,6 +107,10 @@ echo '----------TeleSpeech CTC----------'
./run-telespeech-ctc.sh
rm -rf sherpa-onnx-*

echo '----------moonshine----------'
./run-moonshine.sh
rm -rf sherpa-onnx-*

echo '----------whisper----------'
./run-whisper.sh
rm -rf sherpa-onnx-*
Expand All @@ -93,22 +132,6 @@ rm -rf sherpa-onnx-*

popd # non-streaming-asr

pushd tts

echo '----------piper tts----------'
./run-piper.sh
rm -rf vits-piper-*

echo '----------coqui tts----------'
./run-coqui.sh
rm -rf vits-coqui-*

echo '----------zh tts----------'
./run-zh.sh
rm -rf sherpa-onnx-*

popd # tts

pushd streaming-asr

echo '----------streaming zipformer ctc HLG----------'
Expand Down
46 changes: 35 additions & 11 deletions .github/scripts/test-dot-net.sh
Original file line number Diff line number Diff line change
Expand Up @@ -2,7 +2,41 @@

cd dotnet-examples/

cd ./offline-decode-files
cd ./kokoro-tts
./run-kokoro-en.sh
ls -lh

cd ../offline-tts
./run-matcha-zh.sh
ls -lh *.wav
./run-matcha-en.sh
ls -lh *.wav
./run-aishell3.sh
ls -lh *.wav
./run-piper.sh
ls -lh *.wav
./run-hf-fanchen.sh
ls -lh *.wav
ls -lh

pushd ../..

mkdir tts

cp -v dotnet-examples/kokoro-tts/*.wav ./tts
cp -v dotnet-examples/offline-tts/*.wav ./tts
popd

cd ../offline-speaker-diarization
./run.sh
rm -rfv *.onnx
rm -fv *.wav
rm -rfv sherpa-onnx-pyannote-*

cd ../offline-decode-files
./run-moonshine.sh
rm -rf sherpa-onnx-*

./run-sense-voice-ctc.sh
rm -rf sherpa-onnx-*

Expand Down Expand Up @@ -67,14 +101,4 @@ cd ../spoken-language-identification
./run.sh
rm -rf sherpa-onnx-*

cd ../offline-tts
./run-aishell3.sh
./run-piper.sh
./run-hf-fanchen.sh
ls -lh

cd ../..

mkdir tts

cp dotnet-examples/offline-tts/*.wav ./tts
60 changes: 59 additions & 1 deletion .github/scripts/test-nodejs-addon-npm.sh
Original file line number Diff line number Diff line change
Expand Up @@ -10,7 +10,34 @@ arch=$(node -p "require('os').arch()")
platform=$(node -p "require('os').platform()")
node_version=$(node -p "process.versions.node.split('.')[0]")

echo "----------non-streaming asr + vad----------"
echo "----------non-streaming asr moonshine + vad----------"
curl -SL -O https://github.com/k2-fsa/sherpa-onnx/releases/download/asr-models/sherpa-onnx-moonshine-tiny-en-int8.tar.bz2
tar xvf sherpa-onnx-moonshine-tiny-en-int8.tar.bz2
rm sherpa-onnx-moonshine-tiny-en-int8.tar.bz2

curl -SL -O https://github.com/k2-fsa/sherpa-onnx/releases/download/asr-models/Obama.wav
curl -SL -O https://github.com/k2-fsa/sherpa-onnx/releases/download/asr-models/silero_vad.onnx

node ./test_vad_with_non_streaming_asr_moonshine.js
rm -rf sherpa-onnx-*
rm *.wav
rm *.onnx

echo "----------non-streaming speaker diarization----------"

curl -SL -O https://github.com/k2-fsa/sherpa-onnx/releases/download/speaker-segmentation-models/sherpa-onnx-pyannote-segmentation-3-0.tar.bz2
tar xvf sherpa-onnx-pyannote-segmentation-3-0.tar.bz2
rm sherpa-onnx-pyannote-segmentation-3-0.tar.bz2

curl -SL -O https://github.com/k2-fsa/sherpa-onnx/releases/download/speaker-recongition-models/3dspeaker_speech_eres2net_base_sv_zh-cn_3dspeaker_16k.onnx

curl -SL -O https://github.com/k2-fsa/sherpa-onnx/releases/download/speaker-segmentation-models/0-four-speakers-zh.wav

node ./test_offline_speaker_diarization.js

rm -rfv *.onnx *.wav sherpa-onnx-pyannote-*

echo "----------non-streaming asr whisper + vad----------"
curl -SL -O https://github.com/k2-fsa/sherpa-onnx/releases/download/asr-models/sherpa-onnx-whisper-tiny.en.tar.bz2
tar xvf sherpa-onnx-whisper-tiny.en.tar.bz2
rm sherpa-onnx-whisper-tiny.en.tar.bz2
Expand Down Expand Up @@ -58,6 +85,32 @@ fi

echo "----------tts----------"

curl -SL -O https://github.com/k2-fsa/sherpa-onnx/releases/download/tts-models/kokoro-en-v0_19.tar.bz2
tar xf kokoro-en-v0_19.tar.bz2
rm kokoro-en-v0_19.tar.bz2

node ./test_tts_non_streaming_kokoro_en.js
ls -lh *.wav

curl -SL -O https://github.com/k2-fsa/sherpa-onnx/releases/download/tts-models/matcha-icefall-en_US-ljspeech.tar.bz2
tar xvf matcha-icefall-en_US-ljspeech.tar.bz2
rm matcha-icefall-en_US-ljspeech.tar.bz2
curl -SL -O https://github.com/k2-fsa/sherpa-onnx/releases/download/vocoder-models/hifigan_v2.onnx

node ./test_tts_non_streaming_matcha_icefall_en.js
rm hifigan_v2.onnx
rm -rf matcha-icefall-en_US-ljspeech

curl -SL -O https://github.com/k2-fsa/sherpa-onnx/releases/download/tts-models/matcha-icefall-zh-baker.tar.bz2
tar xvf matcha-icefall-zh-baker.tar.bz2
rm matcha-icefall-zh-baker.tar.bz2
curl -SL -O https://github.com/k2-fsa/sherpa-onnx/releases/download/vocoder-models/hifigan_v2.onnx

node ./test_tts_non_streaming_matcha_icefall_zh.js
rm hifigan_v2.onnx
rm -rf matcha-icefall-zh-baker
ls -lh *.wav

curl -SL -O https://github.com/k2-fsa/sherpa-onnx/releases/download/tts-models/vits-piper-en_GB-cori-medium.tar.bz2
tar xf vits-piper-en_GB-cori-medium.tar.bz2
rm vits-piper-en_GB-cori-medium.tar.bz2
Expand Down Expand Up @@ -204,6 +257,11 @@ rm sherpa-onnx-whisper-tiny.en.tar.bz2
node ./test_asr_non_streaming_whisper.js
rm -rf sherpa-onnx-whisper-tiny.en

curl -SL -O https://github.com/k2-fsa/sherpa-onnx/releases/download/asr-models/sherpa-onnx-moonshine-tiny-en-int8.tar.bz2
tar xvf sherpa-onnx-moonshine-tiny-en-int8.tar.bz2
rm sherpa-onnx-moonshine-tiny-en-int8.tar.bz2

node ./test_asr_non_streaming_moonshine.js
rm -rf sherpa-onnx-*

ls -lh
Loading