ailia_speech
1.4.0.0
|
Public Member Functions | |
bool | IsError () |
Check is error occured. More... | |
string | GetErrorDetail () |
Get error detail. More... | |
int | GetEnvironmentId (bool is_gpu) |
Get the environmen id. More... | |
string | GetEnvironmentName () |
Get the environmen name. More... | |
bool | Open (string encoder_path, string decoder_path, int env_id, int memory_mode, int model_type, int task, int flag, string language) |
Create a instance. More... | |
bool | OpenVad (string vad_path, int vad_type) |
Open VAD file. More... | |
bool | OpenDictionary (string dictionary_path, int dictionary_type) |
Open dictionary file. More... | |
bool | OpenPostProcess (string encoder_path, string decoder_path, string source_path, string target_path, string prefix, int type) |
Open PostProcess file. More... | |
bool | OpenDiarization (string segmentation_path, string embedding_path, int type) |
Open AI model for speaker diarization. More... | |
virtual void | Close () |
Destroys instance. More... | |
virtual void | Dispose () |
Release resources. More... | |
bool | SetPrompt (string prompt) |
Set prompt. More... | |
bool | SetConstraint (string constraint, int constraint_type) |
Set constraint. More... | |
bool | Transcribe (float[] waveData, uint frequency, uint channels, bool tail) |
Perform speech recognition. More... | |
bool | IsProcessing () |
Check is processing sub thread. More... | |
bool | IsTranscribing () |
Check is running Speech2Text. More... | |
bool | IsCompleted () |
Gets whether all audio processing is complete. More... | |
List< string > | GetResults () |
Get results and clear of Speech2Text. More... | |
List< AiliaSpeechText > | GetStructuredResults () |
Get structured results and clear of Speech2Text. More... | |
string | GetIntermediateText () |
Get the intermediate result of Speech2Text. More... | |
bool | ResetTranscribeState () |
Initialize the Speech2Text state. More... | |
Static Public Member Functions | |
static int | IntermediateCallback (IntPtr handle, IntPtr text) |
Protected Member Functions | |
virtual void | Dispose (bool disposing) |
|
inlinevirtual |
|
inlinevirtual |
Release resources.
|
inlineprotectedvirtual |
|
inline |
Get the environmen id.
is_gpu | Whether to use GPU |
|
inline |
|
inline |
|
inline |
|
inline |
|
inline |
|
inlinestatic |
|
inline |
|
inline |
|
inline |
|
inline |
|
inline |
Create a instance.
encoder_path | Encoder onnx file path |
decoder_path | Decoder onnx file path |
env_id | Runtime environment (Ailia.AILIA_ENVIRONMENT_ID_AUTO for automatic selection) |
memory_mode | Memory mode (Ailia.AILIA_MEMORY_REDUCE_CONSTANT | Ailia.AILIA_MEMORY_REDUCE_CONSTANT_WITH_INPUT_INITIALIZER | Ailia.AILIA_MEMORY_REUSE_INTERSTAGE etc) |
model_type | Model type (AiliaSpeech.AILIA_SPEECH_MODEL_TYPE_*) |
task | Task (AiliaSpeech.AILIA_SPEECH_TASK_*) |
flag | OR of flags (AiliaSpeech.AILIA_SPEECH_FLAG_*) |
language | Language (ja or en or etc. auto is automatic selection) |
|
inline |
Open AI model for speaker diarization.
segmentation_path | The path name to the onnx file |
embedding_path | The path name to the onnx file |
type | AILIA_SPEECH_DIARIZATION_TYPE_PYANNOTE_AUDIO |
|
inline |
Open dictionary file.
dictionary_path | dictionary file path |
dictionary_type | dictionary type (AiliaSpeech.AILIA_SPEECH_DICTIONARY_TYPE_*) |
|
inline |
Open PostProcess file.
encoder_path | The path name to the onnx file |
decoder_path | The path name to the onnx file |
source_path | The path name to the tokenizer model file |
target_path | The path name to the tokenizer model file |
prefix | The prefix of T5 (UTF8), null for FuguMT |
post_process_type | AILIA_SPEECH_POST_PROCESS_TYPE_* |
|
inline |
|
inline |
Initialize the Speech2Text state.
|
inline |
Set constraint.
constraint | The text of constraint (UTF8) |
constraint_type | AILIA_SPEECH_CONSTRAINT_* |
|
inline |
Set prompt.
prompt | The text of prompt (UTF8) |
|
inline |
Perform speech recognition.
waveQueue | Input PCM |
frequency | Frequency of PCM |
channels | Number of channels of PCM |
tail | Is last input |