|
ailia_speech
1.4.0.0
|


Public Member Functions | |
| bool | IsError () |
| Check is error occured. More... | |
| string | GetErrorDetail () |
| Get error detail. More... | |
| int | GetEnvironmentId (bool is_gpu) |
| Get the environmen id. More... | |
| string | GetEnvironmentName () |
| Get the environmen name. More... | |
| bool | Open (string encoder_path, string decoder_path, int env_id, int memory_mode, int model_type, int task, int flag, string language) |
| Create a instance. More... | |
| bool | OpenVad (string vad_path, int vad_type) |
| Open VAD file. More... | |
| bool | OpenDictionary (string dictionary_path, int dictionary_type) |
| Open dictionary file. More... | |
| bool | OpenPostProcess (string encoder_path, string decoder_path, string source_path, string target_path, string prefix, int type) |
| Open PostProcess file. More... | |
| bool | OpenDiarization (string segmentation_path, string embedding_path, int type) |
| Open AI model for speaker diarization. More... | |
| virtual void | Close () |
| Destroys instance. More... | |
| virtual void | Dispose () |
| Release resources. More... | |
| bool | SetPrompt (string prompt) |
| Set prompt. More... | |
| bool | SetConstraint (string constraint, int constraint_type) |
| Set constraint. More... | |
| bool | Transcribe (float[] waveData, uint frequency, uint channels, bool tail) |
| Perform speech recognition. More... | |
| bool | IsProcessing () |
| Check is processing sub thread. More... | |
| bool | IsTranscribing () |
| Check is running Speech2Text. More... | |
| bool | IsCompleted () |
| Gets whether all audio processing is complete. More... | |
| List< string > | GetResults () |
| Get results and clear of Speech2Text. More... | |
| List< AiliaSpeechText > | GetStructuredResults () |
| Get structured results and clear of Speech2Text. More... | |
| string | GetIntermediateText () |
| Get the intermediate result of Speech2Text. More... | |
| bool | ResetTranscribeState () |
| Initialize the Speech2Text state. More... | |
Static Public Member Functions | |
| static int | IntermediateCallback (IntPtr handle, IntPtr text) |
Protected Member Functions | |
| virtual void | Dispose (bool disposing) |
|
inlinevirtual |
|
inlinevirtual |
Release resources.
|
inlineprotectedvirtual |
|
inline |
Get the environmen id.
| is_gpu | Whether to use GPU |
|
inline |
|
inline |
|
inline |
|
inline |
|
inline |
|
inlinestatic |
|
inline |
|
inline |
|
inline |
|
inline |
|
inline |
Create a instance.
| encoder_path | Encoder onnx file path |
| decoder_path | Decoder onnx file path |
| env_id | Runtime environment (Ailia.AILIA_ENVIRONMENT_ID_AUTO for automatic selection) |
| memory_mode | Memory mode (Ailia.AILIA_MEMORY_REDUCE_CONSTANT | Ailia.AILIA_MEMORY_REDUCE_CONSTANT_WITH_INPUT_INITIALIZER | Ailia.AILIA_MEMORY_REUSE_INTERSTAGE etc) |
| model_type | Model type (AiliaSpeech.AILIA_SPEECH_MODEL_TYPE_*) |
| task | Task (AiliaSpeech.AILIA_SPEECH_TASK_*) |
| flag | OR of flags (AiliaSpeech.AILIA_SPEECH_FLAG_*) |
| language | Language (ja or en or etc. auto is automatic selection) |
|
inline |
Open AI model for speaker diarization.
| segmentation_path | The path name to the onnx file |
| embedding_path | The path name to the onnx file |
| type | AILIA_SPEECH_DIARIZATION_TYPE_PYANNOTE_AUDIO |
|
inline |
Open dictionary file.
| dictionary_path | dictionary file path |
| dictionary_type | dictionary type (AiliaSpeech.AILIA_SPEECH_DICTIONARY_TYPE_*) |
|
inline |
Open PostProcess file.
| encoder_path | The path name to the onnx file |
| decoder_path | The path name to the onnx file |
| source_path | The path name to the tokenizer model file |
| target_path | The path name to the tokenizer model file |
| prefix | The prefix of T5 (UTF8), null for FuguMT |
| post_process_type | AILIA_SPEECH_POST_PROCESS_TYPE_* |
|
inline |
|
inline |
Initialize the Speech2Text state.
|
inline |
Set constraint.
| constraint | The text of constraint (UTF8) |
| constraint_type | AILIA_SPEECH_CONSTRAINT_* |
|
inline |
Set prompt.
| prompt | The text of prompt (UTF8) |
|
inline |
Perform speech recognition.
| waveQueue | Input PCM |
| frequency | Frequency of PCM |
| channels | Number of channels of PCM |
| tail | Is last input |