You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
4. model size ("tiny", "small", "medium", "large", "large-v1", "large-v2", "large-v3")
49
55
50
-
5. voices_folder (contains speaker voice samples for speaker recognition)
56
+
5. ACCESS_TOKEN: huggingface acccess token (also get permission to access `pyannote/speaker-diarization@2.1`)
57
+
58
+
6. voices_folder (contains speaker voice samples for speaker recognition)
51
59
52
-
6. quantization: this determine whether to use int8 quantization or not. Quantization may speed up the process but lower the accuracy.
60
+
7. quantization: this determine whether to use int8 quantization or not. Quantization may speed up the process but lower the accuracy.
53
61
54
62
voices_folder should contain subfolders named with speaker names. Each subfolder belongs to a speaker and it can contain many voice samples. This will be used for speaker recognition to identify the speaker.
55
63
@@ -64,26 +72,34 @@ transcript will also indicate the timeframe in seconds where each speaker speaks
Copy file name to clipboardExpand all lines: setup.py
+1-1Lines changed: 1 addition & 1 deletion
Original file line number
Diff line number
Diff line change
@@ -5,7 +5,7 @@
5
5
6
6
setup(
7
7
name="speechlib",
8
-
version="1.1.0",
8
+
version="1.1.2",
9
9
description="speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with actual speaker names. This library also contain audio preprocessor functions.",
0 commit comments