Country
Full text data for US and EP
Status
Type
Filing Date
Publication Date
Inventor
Assignee
Click to expand
IPC
No.
Publication Number
Title
Publication/Patent Number Publication/Patent Number
Publication date Publication date
Application number Application number
Filing date Filing date
Inventor Inventor
Assignee Assignee
IPC IPC
1
US10194023B1
Publication/Patent Number: US10194023B1
Publication date: 2019-01-29
Application number: 15/692,444
Filing date: 2017-08-31
Abstract: A system capable of connecting a device to a Public Switched Telephone Network (PSTN) using an adapter. The adapter may receive caller identification from the PSTN during a second ringing signal and may send notifications to remote server(s) indicating the caller identification. The remote server(s) may use the caller identification to enable additional functionality in a speech processing system. For example, the remote server(s) may identify contact information corresponding to the caller identification, may determine information about the contact, such as recent meetings, communications, or the like, and may output the information when announcing the incoming call. In addition, the remote server(s) may compare the caller identification to a database of spam and indicate that the incoming call is possible spam. A system capable of connecting a device to a Public Switched Telephone Network (PSTN) using an adapter. The adapter may receive caller identification from the PSTN during a second ringing signal and may send notifications to remote server(s) indicating the caller identification ...more ...less
2
US10186267B1
Publication/Patent Number: US10186267B1
Publication date: 2019-01-22
Application number: 15/392,844
Filing date: 2016-12-28
Abstract: Methods and systems for prioritizing messages for playback are described herein. In some embodiments, a request for messages to be output may be received by a speech-processing system. The speech-processing system may include a message database that includes messages received for a speaker of the request's user account and/or a group account associated with a shared electronic device that the request was received from. One or more prioritization rules may be applied to the messages to order the messages for playback in order to provide an optimal voice user interface for the requesting individual. For instance, messages received for the user account may be prioritized over messages received for the group account, messages received from a similar sender or a high priority sender may be prioritized over other messages, and messages that are indicating as being urgent may be prioritized over messages that are indicated as being non-urgent. Methods and systems for prioritizing messages for playback are described herein. In some embodiments, a request for messages to be output may be received by a speech-processing system. The speech-processing system may include a message database that includes messages received ...more ...less
3
US10270826B2
Publication/Patent Number: US10270826B2
Publication date: 2019-04-23
Application number: 16/002,454
Filing date: 2018-06-07
Abstract: An example embodiment may involve receiving an indication of media content selected by way of a first client device. The indication may specify that the media content has been flagged for audible playout at a later time (such as when the client device or its user is in an automobile). The example embodiment may further involve receiving a request to stream the audio file to a second client device. The second client device may be associated with the first client device. The example embodiment may further involve causing the audio file to be streamed to the second client device. An example embodiment may involve receiving an indication of media content selected by way of a first client device. The indication may specify that the media content has been flagged for audible playout at a later time (such as when the client device or its user is in an ...more ...less
4
US10235989B2
Publication/Patent Number: US10235989B2
Publication date: 2019-03-19
Application number: 15/265,836
Filing date: 2016-09-14
Inventor: Blyumen, Julia  
Abstract: A text mining tool is operated on a given text to obtain words and/or phrases ranked by frequency of occurrence. Thereafter, a text-to-speech converter is used to speak each word/phrase output by the text mining tool, and how loud each word/phrase is spoken depends on a corresponding frequency which is additionally output by the text mining tool, for each word/phrase. In certain embodiments, words/phrases are categorized into multiple themes by the text mining tool, and in these embodiments corresponding multiple voices and/or accents are used, to indicate via sonification, a specific theme of each word/phrase being spoken. A text mining tool is operated on a given text to obtain words and/or phrases ranked by frequency of occurrence. Thereafter, a text-to-speech converter is used to speak each word/phrase output by the text mining tool, and how loud each word/phrase is spoken depends on a ...more ...less
5
US10249288B2
Publication/Patent Number: US10249288B2
Publication date: 2019-04-02
Application number: 16/010,429
Filing date: 2018-06-16
Abstract: An approach is provided that assists visually impaired users. The approach analyzes a document that is being utilized by the visually impaired user. The analysis derives a sensitivity of the document. A vocal characteristic corresponding to the derived sensitivity is retrieved. Text from the document is audibly read to the visually impaired user with a text to speech process that utilizes the retrieved vocal characteristic. The retrieved vocal characteristic conveys the derived sensitivity of the document to the visually impaired user. An approach is provided that assists visually impaired users. The approach analyzes a document that is being utilized by the visually impaired user. The analysis derives a sensitivity of the document. A vocal characteristic corresponding to the derived sensitivity is retrieved ...more ...less
6
US10341825B2
Publication/Patent Number: US10341825B2
Publication date: 2019-07-02
Application number: 15/488,820
Filing date: 2017-04-17
Inventor: Dowlatkhah, Sangar  
Abstract: Systems, methods, and computer-readable storage devices for converting text messages to speech data. A text message may be received. The text message may be associated with a recipient identification of a recipient of the text message. Preference information for converting the text message to speech data may be received. The text message may be converted to the speech data based on the preference information. The speed data may be communicated to the recipient. Systems, methods, and computer-readable storage devices for converting text messages to speech data. A text message may be received. The text message may be associated with a recipient identification of a recipient of the text message. Preference information for converting the ...more ...less
7
US10320981B2
Publication/Patent Number: US10320981B2
Publication date: 2019-06-11
Application number: 15/707,951
Filing date: 2017-09-18
Inventor: Kurganov, Alexander  
Abstract: The present invention relates to a system for retrieving information from a network such as the Internet. A user creates a user-defined record in a database that identifies an information source, such as a web site, containing information of interest to the user. This record identifies the location of the information source and also contains a recognition grammar based upon a speech command assigned by the user. Upon receiving the speech command from the user that is described within the recognition grammar, a network interface system accesses the information source and retrieves the information requested by the user. The present invention relates to a system for retrieving information from a network such as the Internet. A user creates a user-defined record in a database that identifies an information source, such as a web site, containing information of interest to the user. This record ...more ...less
8
US10346536B2
Publication/Patent Number: US10346536B2
Publication date: 2019-07-09
Application number: 15/867,585
Filing date: 2018-01-10
Inventor: Sankar, Sriram  
Abstract: In one embodiment, a method includes accessing a string of symbols by a computing device. The string is divided into one or more string components each including at least one of the symbols, and each string component is associated with at least one string-position identifier. The string components and their respective associated string-position identifiers are stored for the string of symbols. In one embodiment, a method includes accessing a string of symbols by a computing device. The string is divided into one or more string components each including at least one of the symbols, and each string component is associated with at least one string-position identifier ...more ...less
9
US2019035383A1
Publication/Patent Number: US2019035383A1
Publication date: 2019-01-31
Application number: 16/147,838
Filing date: 2018-09-30
Abstract: A device for communicating with a remote device is disclosed, which includes a processor and a memory in communication with the processor. The memory includes executable instructions that, when executed, cause the processor to control the device to perform functions of establishing, via a communication network, a communication session with the remote device; capturing a speech spoken by a user and generating audio data representing the captured speech by the user; encoding the audio data for transmission to the remote device via the communication network; converting the audio data to text data representing the captured speech; and transmitting, during the communication session, the encoded audio data and the text data to the remote device via the communication network. The device thus can provide the text data representing the captured speech when a quality of the encoded audio signal received by the remote device is below a predetermined level. A device for communicating with a remote device is disclosed, which includes a processor and a memory in communication with the processor. The memory includes executable instructions that, when executed, cause the processor to control the device to perform functions of ...more ...less
10
US2019005952A1
Publication/Patent Number: US2019005952A1
Publication date: 2019-01-03
Application number: 15/635,936
Filing date: 2017-06-28
Abstract: Technologies for secure storage of utterances are disclosed. A computing device captures audio of a human making a verbal utterance. The utterance is provided to a speech-to-text (STT) service that translates the utterance to text. The STT service can also identify various speaker-specific attributes in the utterance. The text and attributes are provided to a text-to-speech (TTS) service that creates speech from the text and a subset of the attributes. The speech is stored in a data store that is less secure than that required for storing the original utterance. The original utterance can then be discarded. The STT service can also translate the speech generated by the TTS service to text. The text generated by the STT service from the speech and the text generated by the STT service from the original utterance are then compared. If the text does not match, the original utterance can be retained. Technologies for secure storage of utterances are disclosed. A computing device captures audio of a human making a verbal utterance. The utterance is provided to a speech-to-text (STT) service that translates the utterance to text. The STT service can also identify various ...more ...less
11
US2019180732A1
Publication/Patent Number: US2019180732A1
Publication date: 2019-06-13
Application number: 16/277,919
Filing date: 2019-02-15
Assignee: Baidu USA LLC
Abstract: Described herein are embodiments of an end-to-end text-to-speech (TTS) system with parallel wave generation. In one or more embodiments, a Gaussian inverse autoregressive flow is distilled from an autoregressive WaveNet by minimizing a novel regularized Kullback-Leibler (KL) divergence between their highly-peaked output distributions. Embodiments of the methodology computes the KL divergence in a closed-form, which simplifies the training process and provides very efficient distillation. Embodiments of a novel text-to-wave neural architecture for speech synthesis are also described, which are fully convolutional and enable fast end-to-end training from scratch. These embodiments significantly outperform the previous pipeline that connects a text-to-spectrogram model to a separately trained WaveNet. Also, a parallel waveform synthesizer embodiment conditioned on the hidden representation in an embodiment of this end-to-end model were successfully distilled. Described herein are embodiments of an end-to-end text-to-speech (TTS) system with parallel wave generation. In one or more embodiments, a Gaussian inverse autoregressive flow is distilled from an autoregressive WaveNet by minimizing a novel regularized Kullback-Leibler (KL) ...more ...less
12
US2019156830A1
Publication/Patent Number: US2019156830A1
Publication date: 2019-05-23
Application number: 16/251,901
Filing date: 2019-01-18
Abstract: Methods and systems for providing message playback using a shared electronic device is described herein. In response to receiving a request to output messages, a speech-processing system may determine a group account associated with a requesting device, and may determine messages stored by a message data store for the group account. Speaker identification processing may also be performed to determine a speaker of the request. A user account associated with the speaker, and messages stored for the user account, may be determined. A summary response indicating the user account's messages and the group account's message may then be generated such that the user account messages are identified prior to the group account's messages. The messages may then be analyzed to determine an appropriate voice user interface for the requester such that the playback of the messages using a shared electronic device is more natural and conversational. Methods and systems for providing message playback using a shared electronic device is described herein. In response to receiving a request to output messages, a speech-processing system may determine a group account associated with a requesting device, and may determine ...more ...less
13
US2019115007A1
Publication/Patent Number: US2019115007A1
Publication date: 2019-04-18
Application number: 16/037,872
Filing date: 2018-07-17
Abstract: Systems and methods are disclosed for providing non-lexical cues in synthesized speech. Original text is analyzed to determine characteristics of the text and/or to derive or augment an intent (e.g., an intent code). Non-lexical cue insertion points are determined based on the characteristics of the text and/or the intent. One or more non-lexical cues are inserted at insertion points to generate augmented text. The augmented text is synthesized into speech, including converting the non-lexical cues to speech output. Systems and methods are disclosed for providing non-lexical cues in synthesized speech. Original text is analyzed to determine characteristics of the text and/or to derive or augment an intent (e.g., an intent code). Non-lexical cue insertion points are determined based on the ...more ...less
14
US2019197097A1
Publication/Patent Number: US2019197097A1
Publication date: 2019-06-27
Application number: 15/852,340
Filing date: 2017-12-22
Abstract: Performing an operation comprising extracting, from an input comprising unstructured electronic text, a plurality of feature values for a plurality of features defined in a feature vector, identifying, based on a machine learning (ML) model applied to the plurality of feature values, a portion of the electronic text corresponding to an adverse event, and annotating the portion of the electronic text with an indication of the identified adverse event. Performing an operation comprising extracting, from an input comprising unstructured electronic text, a plurality of feature values for a plurality of features defined in a feature vector, identifying, based on a machine learning (ML) model applied to the plurality of feature ...more ...less