Skip to content

Latest commit

 

History

History
106 lines (58 loc) · 5.04 KB

training.md

File metadata and controls

106 lines (58 loc) · 5.04 KB

Protocol Documentation

Top

training/v1/training.proto

TODO:

Field Type Label Description
streaming_config google.cloud.speech.v1.RecognitionConfig Provides information to the recognizer that specifies how to process the request. The first StreamingTranslationRequest message must contain a streaming_config message.
audio_content bytes The audio data to be trained. Sequential chunks of audio data are sent in sequential StreamingTranslationRequest messages. The first StreamingTranslationRequest message must not contain audio_content data and all subsequent StreamingTranslationRequest messages must contain audio_content data. The audio bytes must be encoded as specified in RecognitionConfig. Note: as with all bytes fields, protobuffers use a pure binary representation (not base64). See audio limits.
transcript string
media_identity string the media identity
Field Type Label Description
media_identity string the media identity
language_code string the language code of the payload ISO-639-1/bcp47 Code with tags https://cloud.google.com/translate/docs/languages
format string the format of the payload
played_time double how long does this transcript been played
total_time double how long of the entire video
delay int64 in ms, can be +/-
hints string hints or keywords related to this media
transcript_identity string transcript identity if any
payload bytes the transcripts payload

Trainng that implements Sagittarius Translation API

Method Name Request Type Response Type Description
PushData TrainingData .google.rpc.Status Push accture traning data to server
StreamingTraining StreamingTrainingRequest .google.rpc.Status Performs bidirectional streaming audio translation: receive results while sending audio. This method is only available via the gRPC API (not REST).

Scalar Value Types

.proto Type Notes C++ Type Java Type Python Type
double double double float
float float float float
int32 Uses variable-length encoding. Inefficient for encoding negative numbers – if your field is likely to have negative values, use sint32 instead. int32 int int
int64 Uses variable-length encoding. Inefficient for encoding negative numbers – if your field is likely to have negative values, use sint64 instead. int64 long int/long
uint32 Uses variable-length encoding. uint32 int int/long
uint64 Uses variable-length encoding. uint64 long int/long
sint32 Uses variable-length encoding. Signed int value. These more efficiently encode negative numbers than regular int32s. int32 int int
sint64 Uses variable-length encoding. Signed int value. These more efficiently encode negative numbers than regular int64s. int64 long int/long
fixed32 Always four bytes. More efficient than uint32 if values are often greater than 2^28. uint32 int int
fixed64 Always eight bytes. More efficient than uint64 if values are often greater than 2^56. uint64 long int/long
sfixed32 Always four bytes. int32 int int
sfixed64 Always eight bytes. int64 long int/long
bool bool boolean boolean
string A string must always contain UTF-8 encoded or 7-bit ASCII text. string String str/unicode
bytes May contain any arbitrary sequence of bytes. string ByteString str