Nuance Communications, Inc.

United States of America

1-100 of 337 for Nuance Communications, Inc.

Sort by

Query

Patent
World - WIPO
Excluding Subsidiaries

Aggregations

Reset Report

Date

IPC Class

Found results for

patents

1 2 3 4 Next Page

1. VOICE BIOMETRICS FOR ANONYMOUS IDENTIFICATION AND PERSONALIZATION

Application Number	US2023031100
Publication Number	2024/072592
Status	In Force
Filing Date	2023-08-24
Publication Date	2024-04-04
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Rohatgi, Abhishek Dalmasso, Emanuele Samtani, Dinesh Olvera, Eduardo

Abstract

Example solutions for voice biometrics for anonymous identification and personalization capture an audio signal containing voice signal from a speaker. A plurality of unlabeled voiceprints are stored that are each associated with an anonymous label. The speaker's voice signal is recognized as matching one of the unlabeled voiceprints, enabling identification of the associated anonymous label. Historical information associated with the identified anonymous label is used to generate an alert specific to the speaker. Example practical applications include leveraging a customer relations management (CRM) interaction record to provide a personalized experience to the speaker and providing a warning to a user that the speaker is on a watchlist. These and other practical applications are possible, even though the speaker's identity may be unknown, and the speaker has not enrolled in a voice biometric system. Solutions for generating the unlabeled voiceprints are also disclosed.

IPC Classes ?

G10L 17/00 - Speaker identification or verification
G10L 17/02 - Preprocessing operations, e.g. segment selection; Pattern representation or modelling, e.g. based on linear discriminant analysis [LDA] or principal components; Feature selection or extraction
G10L 17/04 - Training, enrolment or model building
G10L 17/26 - Recognition of special voice characteristics, e.g. for use in lie detectors; Recognition of animal voices
G06F 21/32 - User authentication using biometric data, e.g. fingerprints, iris scans or voiceprints

2. SYSTEM AND METHOD FOR WATERMARKING TRAINING DATA FOR MACHINE LEARNING MODELS

Application Number	US2023030650
Publication Number	2024/058901
Status	In Force
Filing Date	2023-08-20
Publication Date	2024-03-21
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Sharma, Dushyant Milanovic, Ljubomir Naylor, Patrick Aubrey Jost, Uwe Helmut Ganong, William Francis, Iii

Abstract

A method, computer program product, and computing system for identifying a target output token associated with an output of a machine learning model. A portion of training data corresponding to the target output token is modified with a watermark feature, thus defining watermarked training data.

IPC Classes ?

G10L 15/16 - Speech classification or search using artificial neural networks
G10L 15/06 - Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
G10L 19/018 - Audio watermarking, i.e. embedding inaudible data in the audio signal

3. SYSTEM AND METHOD FOR WATERMARKING DATA FOR TRACING ACCESS

Application Number	US2023028901
Publication Number	2024/049598
Status	In Force
Filing Date	2023-07-28
Publication Date	2024-03-07
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Ganong, William Francis, Iii Milanovic, Ljubomir Sharma, Dushyant Jost, Uwe Helmut Naylor, Patrick Aubrey

Abstract

A method, computer program product, and computing system for receiving, from a requesting party, a request to access data from a storage device. Identity information associated with the requesting party is determined. A bespoke identity-based watermark is generated for the requesting party. The bespoke identity-based watermark is encoded into the data. The watermarked data is provided to the requesting party.

IPC Classes ?

H04L 9/40 - Network security protocols
G10L 19/018 - Audio watermarking, i.e. embedding inaudible data in the audio signal
G06F 21/16 - Program or content traceability, e.g. by watermarking

4. SYSTEM AND METHOD FOR WATERMARKING AUDIO DATA FOR AUTOMATED SPEECH RECOGNITION (ASR) SYSTEMS

Application Number	US2023028902
Publication Number	2024/049599
Status	In Force
Filing Date	2023-07-28
Publication Date	2024-03-07
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Naylor, Patrick, Aubrey Sharma, Dushyant Ganong, William, Francis, Iii Jost, Uwe, Helmut Milanovic, Ljubomir

Abstract

A method, computer program product, and computing system for processing audio information associated with a speech processing system and encoding a watermark in a non-disruptive portion of the audio information.

IPC Classes ?

G10L 19/018 - Audio watermarking, i.e. embedding inaudible data in the audio signal

5. SYSTEM AND METHOD FOR SECURE TRAINING OF SPEECH PROCESSING SYSTEMS

Application Number	US2023022815
Publication Number	2023/244404
Status	In Force
Filing Date	2023-05-19
Publication Date	2023-12-21
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Yin, Shou-Chun Park, Junho Sharma, Dushyant Kim, Doyeong

Abstract

A method, computer program product, and computing system for generating an obscured speech signal from an input speech signal and an obscured transcription from a transcription of the input speech signal. A speaker embedding may be extracted from the input speech signal. A speaker embedding delta may be generated based upon, at least in part, the extracted speaker embedding and a synthetic speaker embedding. A synthetic speech signal may be generated from the obscured speech signal using the synthetic speaker embedding. A residual signal may be generated based upon, at least in part, the obscured speech signal and the speaker embedding delta. A speech processing system may be trained using the obscured transcription, the synthetic speech signal, the speaker embedding delta, and the residual signal.

IPC Classes ?

G10L 15/06 - Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
G10L 13/00 - Speech synthesis; Text to speech systems
G10L 15/16 - Speech classification or search using artificial neural networks

6. SYSTEM AND METHOD FOR SECURE TRANSCRIPTION GENERATION

Application Number	US2023019025
Publication Number	2023/235068
Status	In Force
Filing Date	2023-04-19
Publication Date	2023-12-07
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Ganong, William F., Iii Jost, Uwe Helmut Sharma, Dushyant

Abstract

A method, computer program product, and computing system for receiving an input speech signal. A transcription of the input speech signal may be generated via an automated speech recognition (ASR) system. One or more splitting points between one or more sensitive content portions and one or more non-sensitive content portions from the transcription may be identified. The input speech signal maybe split into the one or more sensitive content portions and the one or more non-sensitive content portions based upon, at least in part, the one or more splitting points, thus defining one or more sensitive content signals and one or more non-sensitive content signals.

IPC Classes ?

G10L 15/26 - Speech to text systems
G06F 40/174 - Form filling; Merging
G16H 10/60 - ICT specially adapted for the handling or processing of patient-related medical or healthcare data for patient-specific data, e.g. for electronic patient records
G10L 21/0272 - Voice signal separating
G10L 15/18 - Speech classification or search using natural language modelling

7. SYSTEM AND METHOD FOR SECURE TRANSCRIPTION GENERATION

Application Number	US2023019024
Publication Number	2023/235067
Status	In Force
Filing Date	2023-04-19
Publication Date	2023-12-07
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Ganong, William F., Iii Jost, Uwe Helmut

Abstract

A method, computer program product, and computing system for receiving an input speech signal. A transcription of the input speech signal may be received. One or more sensitive content portions may be identified from the transcription of the input speech signal. The one or more sensitive content portions from the transcription of the input speech signal may be obscured, thus defining an obscured transcription of the input speech signal. An obscured speech signal may be generated based upon, at least in part, the input speech signal, the transcription of the input speech signal, and the obscured transcription of the input speech signal.

IPC Classes ?

G10L 21/00 - Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
G06F 21/60 - Protecting data
G10L 15/26 - Speech to text systems

8. END-TO-END AUTOMATIC SPEECH RECOGNITION SYSTEM FOR BOTH CONVERSATIONAL AND COMMAND-AND-CONTROL SPEECH

Application Number	US2023019009
Publication Number	2023/215105
Status	In Force
Filing Date	2023-04-19
Publication Date	2023-11-09
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Coucheiro Limeres, Alejandro Park, Junho

Abstract

A contextual end-to-end automatic speech recognition (ASR) system includes: an audio encoder configured to process input audio signal to produce as output encoded audio signal; a bias encoder configured to produce as output at least one bias entry corresponding to a word to bias for recognition by the ASR system; a transcription token probability prediction network configured to produce as output a probability of a selected transcription token, based at least in part on the output of the bias encoder and the output of the audio encoder; a first attention mechanism configured to receive the at least one bias entry and determine whether the at least one bias entry is suitable to be transcribed at a specific moment of an ongoing transcription; and a second attention mechanism configured to produce prefix penalties for restricting the first attention mechanism to only entries fitting a current transcription context.

IPC Classes ?

G10L 15/16 - Speech classification or search using artificial neural networks

9. FREQUENCY MAPPING IN THE VOICEPRINT DOMAIN

Application Number	US2023060672
Publication Number	2023/164332
Status	In Force
Filing Date	2023-01-13
Publication Date	2023-08-31
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Vair, Claudio Talib, Haydar Farrell, Kevin Robert Colibro, Daniele Ernesto

Abstract

There is provided a method that includes (a) obtaining a first voice vector that was derived from a signal of a voice that was sampled at a first sampling frequency, (b) obtaining a second voice vector that was derived from a signal of a voice that was sampled at a second sampling frequency, (c) mapping the second voice vector into a mapped voice vector in accordance with a machine learning model, and (d) comparing the first voice vector to the mapped voice vector to yield a score that indicates a probability that the first voice vector and the second voice vector originated from a same person.

IPC Classes ?

G10L 17/04 - Training, enrolment or model building
G10L 17/20 - Pattern transformations or operations aimed at increasing system robustness, e.g. against channel noise or different working conditions
G10L 17/00 - Speaker identification or verification

10. AUTOMATIC CANONICALIZATION IN A SEMANTIC TAGGER AND SPEECH-TO-TEXT PIPELINE

Application Number	US2023012645
Publication Number	2023/154360
Status	In Force
Filing Date	2023-02-08
Publication Date	2023-08-17
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Divay, Olivier

Abstract

A method of correcting an automatic speech recognition (ASR) output of an ASR module, includes: providing a corrector model configured to receive the ASR output; pre-training and training the corrector model to map the ASR output to desired formatting of a natural language understanding (NLU) dataset; and fine-tuning the corrector model. The mapping of the ASR output utilizes deep neural network (DNN). Out-of-domain data simulating the ASR output is utilized as the ASR output for the pre-training of the corrector model; in-domain data is utilized for the training of the corrector model; and project-specific data is utilized for fine-tuning the corrector model. The data simulating the ASR output is generated by a simulated ASR runtime process including: feeding raw text into a tokenizer to generate spelled-out text; and feeding the spelled-out text into a formatter to generate formatted text as the data simulating the ASR output.

IPC Classes ?

G10L 15/16 - Speech classification or search using artificial neural networks
G10L 15/18 - Speech classification or search using natural language modelling
G06F 40/30 - Semantic analysis
G10L 15/26 - Speech to text systems
G06F 40/253 - Grammatical analysis; Style critique

11. DATA AUGMENTATION SYSTEM AND METHOD FOR MULTI-MICROPHONE SYSTEMS

Application Number	US2023060986
Publication Number	2023/141564
Status	In Force
Filing Date	2023-01-20
Publication Date	2023-07-27
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Sharma, Dushyant Milanovic, Ljubomir Salletmayr, Philip Gong, Rong Naylor, Patrick A.

Abstract

A method, computer program product, and computing system for obtaining one or more speech signals from a first device, thus defining one or more first device speech signals. One or more speech signals may be obtained from a second device, thus defining one or more second device speech signals. An acoustic relative transfer function may be selected from a plurality of acoustic relative transfer functions based upon, at least in part, the one or more first device speech signals and the one or more second device speech signals. The one or more second device speech signals may be augmented, at run-time, based upon, at least in part, the acoustic relative transfer function

IPC Classes ?

G10L 21/02 - Speech enhancement, e.g. noise reduction or echo cancellation
G10L 21/0208 - Noise filtering
G10L 21/0216 - Noise filtering characterised by the method used for estimating noise
G10L 21/0224 - Processing in the time domain
G10L 21/0232 - Processing in the frequency domain
G10L 25/00 - Speech or voice analysis techniques not restricted to a single one of groups

12. DATA AUGMENTATION SYSTEM AND METHOD FOR MULTI-MICROPHONE SYSTEMS

Application Number	US2023060974
Publication Number	2023/141557
Status	In Force
Filing Date	2023-01-20
Publication Date	2023-07-27
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Sharma, Dushyant Milanovic, Ljubomir Salletmayr, Philip Gong, Rong Naylor, Patrick A.

Abstract

IPC Classes ?

G10L 21/02 - Speech enhancement, e.g. noise reduction or echo cancellation
G10L 21/0208 - Noise filtering
G10L 21/0216 - Noise filtering characterised by the method used for estimating noise
G10L 21/0224 - Processing in the time domain
G10L 21/0232 - Processing in the frequency domain
G10L 25/00 - Speech or voice analysis techniques not restricted to a single one of groups

13. DATA AUGMENTATION SYSTEM AND METHOD FOR MULTI-MICROPHONE SYSTEMS

Application Number	US2023060980
Publication Number	2023/141561
Status	In Force
Filing Date	2023-01-20
Publication Date	2023-07-27
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Sharma, Dushyant Milanovic, Ljubomir Salletmayr, Philip Gong, Rong Naylor, Patrick A.

Abstract

A method, computer program product, and computing system for obtaining one or more speech signals from a first device, thus defining one or more first device speech signals. One or more speech signals may be obtained from a second device, thus defining one or more second device speech signals. A noise component model may be selected from a plurality of noise component models based upon, at least in part, the one or more first device speech signals and the one or more second device speech signals. The one or more second device speech signals may be augmented, at run-time, based upon, at least in part, the noise component model.

IPC Classes ?

G10L 21/02 - Speech enhancement, e.g. noise reduction or echo cancellation
G10L 21/0208 - Noise filtering
G10L 21/0216 - Noise filtering characterised by the method used for estimating noise
G10L 21/0224 - Processing in the time domain
G10L 21/0232 - Processing in the frequency domain
G10L 25/00 - Speech or voice analysis techniques not restricted to a single one of groups

14. DATA AUGMENTATION SYSTEM AND METHOD FOR MULTI-MICROPHONE SYSTEMS

Application Number	US2023060989
Publication Number	2023/141565
Status	In Force
Filing Date	2023-01-20
Publication Date	2023-07-27
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Sharma, Dushyant Milanovic, Ljubomir Salletmayr, Philip Gong, Rong Naylor, Patrick A.

Abstract

A method, computer program product, and computing system for obtaining one or more speech signals from a first device, thus defining one or more first device speech signals. One or more speech signals may be obtained from a second device, thus defining one or more second device speech signals. One or more acoustic relative transfer functions mapping reverberation from the one or more first device speech signals to the one or more second device speech signals may be generated. One or more augmented second device speech signals may be generated based upon, at least in part, the one or more acoustic relative transfer functions and first device training data

IPC Classes ?

G10L 21/02 - Speech enhancement, e.g. noise reduction or echo cancellation
G10L 21/0208 - Noise filtering
G10L 21/0216 - Noise filtering characterised by the method used for estimating noise
G10L 21/0224 - Processing in the time domain
G10L 21/0232 - Processing in the frequency domain
G10L 25/00 - Speech or voice analysis techniques not restricted to a single one of groups

15. SYSTEMS AND METHODS FOR QUEUE CALL WAITING DEFLECTION

Application Number	US2022078629
Publication Number	2023/114571
Status	In Force
Filing Date	2022-10-25
Publication Date	2023-06-22
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Olvera, Eduardo

Abstract

A method, computer program product, and computer system for placing, by a computing device, a user into a first queue on a first communication channel to converse with a second user. A trigger may be identified for the first communication channel. The user may be sent a self-service option based upon, at least in part, identifying the trigger for the first communication channel, wherein the self-service option is sent on an alternate communication channel while the user is in the first queue.

IPC Classes ?

H04L 51/046 - Interoperability with other network applications or services

16. FEATURE DOMAIN BANDWIDTH EXTENSION AND SPECTRAL RE-BALANCE FOR ASR DATA AUGMENTATION

Application Number	US2022080193
Publication Number	2023/107816
Status	In Force
Filing Date	2022-11-18
Publication Date	2023-06-15
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Sharma, Dushyant

Abstract

A method of processing speech includes: providing a first set of audio data having audio features in a first bandwidth; down-sampling the first set of audio data to a second bandwidth lower than the first bandwidth; producing, by a high frequency reconstruction network (HFRN), an estimate of audio features in the first bandwidth for the first set of audio data, based on at least the down-sampled audio data; inputting, into the HFRN, a second set of audio data having audio features in the second bandwidth; producing, by the HFRN, based on a second set of audio data having audio features in the second bandwidth, an estimate of audio features in the first bandwidth for the second set of audio data; and training a speech processing system (SPS) using the estimates of audio features in the first bandwidth for the first and second sets of audio data.

IPC Classes ?

G10L 21/0388 - Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques - Details of processing therefor
G10L 15/06 - Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
G10L 13/00 - Speech synthesis; Text to speech systems

17. SYSTEM AND METHOD FOR GENERATING SYNTHETIC COHORTS USING GENERATIVE MODELING

Application Number	US2022078264
Publication Number	2023/076815
Status	In Force
Filing Date	2022-10-18
Publication Date	2023-05-04
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Talib, Haydar Vair, Claudio Farrell, Kevin Robert Colibro, Daniele Ernesto

Abstract

A method, computer program product, and computing system for generating a generative model representative of a plurality of natural biometric profiles. A plurality of random samples are generated from the generative model. A plurality of synthetic biometric profiles are generated based upon, at least in part, the plurality of random samples.

IPC Classes ?

G06F 21/62 - Protecting access to data via a platform, e.g. using keys or access control rules
G06F 21/32 - User authentication using biometric data, e.g. fingerprints, iris scans or voiceprints
G06F 17/18 - Complex mathematical operations for evaluating statistical data
G06F 40/44 - Statistical methods, e.g. probability models

18. TELEHEALTH ASSISTANCE SYSTEM AND METHOD

Application Number	US2022074550
Publication Number	2023/015263
Status	In Force
Filing Date	2022-08-04
Publication Date	2023-02-09
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Olvera, Eduardo

Abstract

A method, computer program product, and computing system for receiving a notification that a patient has arrived to a telehealth session before the telehealth session begins. The notification is received via a computing device. In response to receiving the notification that the patient has arrived to the telehealth session before the telehealth session begins, information associated with the patient is automatically pulled by a virtual assistant. The patient is prompted by the virtual assistant to complete a task before the telehealth session begins. A question is received from the patient before the telehealth session begins. Patient data may be obtained from one or more sources. The obtained patient data is processed to determine if the patient data is indicative of a possible medical condition and the medical condition is provided to a medical professional. An answer to the question is provided. The answer is personalized to the patient.

IPC Classes ?

G16H 80/00 - ICT specially adapted for facilitating communication between medical practitioners or patients, e.g. for collaborative diagnosis, therapy or health monitoring
G16H 50/20 - ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for computer-aided diagnosis, e.g. based on medical expert systems

19. MULTI-ENCODER END-TO-END AUTOMATIC SPEECH RECOGNITION (ASR) FOR JOINT MODELING OF MULTIPLE INPUT DEVICES

Application Number	US2022034407
Publication Number	2022/271746
Status	In Force
Filing Date	2022-06-21
Publication Date	2022-12-29
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Weninger, Felix Gaudesi, Marco Leibold, Ralf Zhan, Puming

Abstract

An end-to-end automatic speech recognition (ASR) system includes: first encoder configured for close-talk input captured by a close-talk input mechanism; second encoder configured for far-talk input captured by far-talk input mechanism; and encoder selection layer configured to select at least one of first and second encoders for use in producing ASR output. The selection is made based on at least one of short-time Fourier transform (STFT), Mel-frequency Cepstral Coefficient (MFCC) and filter bank derived from at least one of the close-talk input and far-talk input. If signals from both the close-talk input mechanism and far-talk input mechanism are present for a speech segment, the encoder selection layer dynamically selects between the close-talk encoder and far-talk encoder to select the encoder that better recognizes the speech segment. An encoder-decoder model is used to produce ASR output.

IPC Classes ?

G10L 15/34 - Adaptation of a single recogniser for parallel processing, e.g. by use of multiple processors or cloud computing
G10L 15/26 - Speech to text systems
G10L 15/20 - Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise or of stress induced speech
G10L 15/22 - Procedures used during a speech recognition process, e.g. man-machine dialog

20. TELEHEALTH SYSTEM AND METHOD

Application Number	US2022034833
Publication Number	2022/272017
Status	In Force
Filing Date	2022-06-24
Publication Date	2022-12-29
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Flechl, Martin Ganong Iii, William F. Gong, Rong Szep, Alexander

Abstract

A method, computer program product, and computing system for: monitoring a conversation between a patient and a medical entity; identifying a portion of the conversation associated with the patient, thus identifying a current patient conversation portion; and processing the current patient conversation portion to identify a condition associated with the patient, thus identifying a patient condition.

IPC Classes ?

A61B 1/00 - Instruments for performing medical examinations of the interior of cavities or tubes of the body by visual or photographical inspection, e.g. endoscopes; Illuminating arrangements therefor

21. FEEDBACK SYSTEM AND METHOD

Application Number	US2022034841
Publication Number	2022/272023
Status	In Force
Filing Date	2022-06-24
Publication Date	2022-12-29
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Brooks, Rupert A.

Abstract

A computer-implemented method, computer program product and computing system for enabling a user to initiate a problem-reporting procedure in response to an inaccurate result generated by an application when processing confidential data; processing the confidential data to generate at least one instantiation of non-confidential data that is related to the confidential data; and providing a preferred instantiation of the non-confidential data for troubleshooting the application.

IPC Classes ?

G06F 21/60 - Protecting data
G06Q 10/06 - Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
H04L 9/32 - Arrangements for secret or secure communications; Network security protocols including means for verifying the identity or authority of a user of the system

22. SYSTEM AND METHOD FOR SELF-ATTENTION-BASED COMBINING OF MULTICHANNEL SIGNALS FOR SPEECH PROCESSING

Application Number	US2022032168
Publication Number	2022/260951
Status	In Force
Filing Date	2022-06-03
Publication Date	2022-12-15
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Gong, Rong Quillen, Carl Benjamin Sharma, Dushyant Milanovic, Ljubomir

Abstract

A method, computer program product, and computing system for receiving a plurality of signals from a plurality of microphones, thus defining a plurality of channels. A weighted multichannel representation of the plurality of channels may be generated. A plurality of weights for each channel of the plurality of channels may be generated based upon, at least in part, the weighted multichannel representation of the plurality of channels. A single channel representation of the plurality of channels may be generated based upon, at least in part, the weighted multichannel representation of the plurality of channels and the plurality of weights generated for each channel of the plurality of channels.

IPC Classes ?

G06N 20/00 - Machine learning
G10L 21/00 - Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
G16H 10/60 - ICT specially adapted for the handling or processing of patient-related medical or healthcare data for patient-specific data, e.g. for electronic patient records
H04R 1/40 - Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only by combining a number of identical transducers
H04R 3/00 - Circuits for transducers

23. SYSTEM AND METHOD FOR CONTEXTUAL DENSITY RATIO-BASED BIASING OF SEQUENCE-TO-SEQUENCE PROCESSING SYSTEMS

Application Number	US2022032161
Publication Number	2022/256654
Status	In Force
Filing Date	2022-06-03
Publication Date	2022-12-08
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Ferrer, Jesus Andres Vozila, Paul Joseph Albesano, Dario

Abstract

A method, computer program product, and computer system for processing one or more portions of an input sequence to generate one or more candidate output sequences, thus defining a plurality of prediction scores for the candidate output sequences. One or more specialized entities may be identified from the candidate output sequences. A first scoring methodology may be applied on the candidate output sequences based upon the portions of the input sequence, thus defining a first set of prediction scores for the one or more candidate output sequences. A second scoring methodology may be applied on the specialized entities from the candidate output sequences based upon the portions of the input sequence, thus defining a second set of prediction scores for the specialized entities. The plurality of predictions scores for the specialized entities may be at least partially modified based upon the first set and the second set of prediction scores.

IPC Classes ?

G06N 7/00 - Computing arrangements based on specific mathematical models

24. TELEHEALTH SYSTEM AND METHOD

Application Number	US2022030284
Publication Number	2022/246221
Status	In Force
Filing Date	2022-05-20
Publication Date	2022-11-24
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Gong, Rong Milanovic, Ljubomir Kaljurand, Kaarel

Abstract

A method, computer program product, and computing system for: monitoring a conversation between a plurality of participants of a telehealth session; identifying an addressable issue within the conversation; and initiating an action to mitigate the addressable issue. A first participant of the plurality of participants may be a medical professional. A second participant of the plurality of participants may be a patient. The addressable issue may be a potential language barrier with one of the participants of the telehealth session. Initiating an action to mitigate the addressable issue may include: translating audio received by one of the participants of the telehealth session from a first language to a second language.

IPC Classes ?

G06F 40/35 - Discourse or dialogue representation
G10L 21/00 - Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
G16H 10/60 - ICT specially adapted for the handling or processing of patient-related medical or healthcare data for patient-specific data, e.g. for electronic patient records
G16H 80/00 - ICT specially adapted for facilitating communication between medical practitioners or patients, e.g. for collaborative diagnosis, therapy or health monitoring
H04N 7/14 - Systems for two-way working

25. TELEHEALTH SYSTEM AND METHOD

Application Number	US2022030300
Publication Number	2022/246231
Status	In Force
Filing Date	2022-05-20
Publication Date	2022-11-24
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Gong, Rong Flechl, Martin Milanovic, Ljubomir

Abstract

A method, computer program product, and computing system for: monitoring a conversation between a patient and a medical professional; identifying one or more medication recommendations for the patient based, at least in part, upon the conversation; and providing the one or more medication recommendations to the medical professional.

IPC Classes ?

G06N 5/04 - Inference or reasoning models

26. AUTOMATED CLINICAL DOCUMENTATION SYSTEM AND METHOD

Application Number	US2022021412
Publication Number	2022/204200
Status	In Force
Filing Date	2022-03-22
Publication Date	2022-09-29
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Gallopyn, Guido Remi Marcel Sharma, Dushyant Barreda, Daniel Paulino Almendro Naylor, Patrick A. Vozila, Paul Joseph Delaney, Brian W. Snider, Neal Pinto, Joel Praveen Ruiz, Miguel Enrique Zhang, Yi Owen, Donald E.

Abstract

A computer-implemented method, computer program product, and computing system for monitoring a plurality of encounter participants is executed on a computing device and includes obtaining encounter information of a user encounter. The encounter information is processed to: associate at least a first portion of the encounter information with at least one known encounter participant, and associate at least a second portion of the encounter information with at least one unknown encounter participant.

IPC Classes ?

G06F 21/00 - Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
A61B 5/00 - Measuring for diagnostic purposes ; Identification of persons

27. AUTOMATED CLINICAL DOCUMENTATION SYSTEM AND METHOD

Application Number	US2022021422
Publication Number	2022/204205
Status	In Force
Filing Date	2022-03-22
Publication Date	2022-09-29
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Owen, Donald, E. Erskine, Garret, N. Öz, Mehmet Mert Barreda, Daniel, Paulino, Almendro

Abstract

A computer-implemented method, computer program product, and computing system for visual diarization of a user encounter is executed on a computing device and includes obtaining encounter information of the user encounter. The encounter information is processed to: associate a first portion of the encounter information with a first encounter participant, and associate at least a second portion of the encounter information with at least a second encounter participant. A visual representation of the encounter information is rendered. A first visual representation of the first portion of the encounter information is rendered that is temporally-aligned with the visual representation of the encounter information. At least a second visual representation of the at least a second portion of the encounter information is rendered that is temporally-aligned with the visual representation of the encounter information.

IPC Classes ?

G06F 21/00 - Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
A61B 5/00 - Measuring for diagnostic purposes ; Identification of persons

28. AUTOMATED CLINICAL DOCUMENTATION SYSTEM AND METHOD

Application Number	US2022021375
Publication Number	2022/204171
Status	In Force
Filing Date	2022-03-22
Publication Date	2022-09-29
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Owen, Donald E. Erskine, Garret N. Gallopyn, Guido Remi Marcel Öz, Mehmet Mert Barreda, Daniel Paulino Almendro

Abstract

A computer-implemented method, computer program product, and computing system for automated clinical documentation is executed on a computing device and includes obtaining encounter information of a user encounter. The encounter information is processed to generate an encounter transcript. At least a portion of the encounter transcript is processed to populate at least a portion of a record associated with the user encounter.

IPC Classes ?

G16H 10/00 - ICT specially adapted for the handling or processing of patient-related medical or healthcare data
G16H 10/20 - ICT specially adapted for the handling or processing of patient-related medical or healthcare data for electronic clinical trials or questionnaires
G16H 10/60 - ICT specially adapted for the handling or processing of patient-related medical or healthcare data for patient-specific data, e.g. for electronic patient records
G16H 15/00 - ICT specially adapted for medical reports, e.g. generation or transmission thereof

29. AUTOMATED CLINICAL DOCUMENTATION SYSTEM AND METHOD

Application Number	US2022021393
Publication Number	2022/204186
Status	In Force
Filing Date	2022-03-22
Publication Date	2022-09-29
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Owen, Donald E. Erskine, Garret N. Öz, Mehmet Mert Jost, Uwe Helmut Barreda, Daniel Paulino Almendro Sharma, Dushyant Gallopyn, Guido Remi Marcel Nour-Eldin, Amr Naylor, Patrick A.

Abstract

A computer-implemented method, computer program product, and computing system for rendering content is executed on a computing device and includes receiving a request to render content during a user encounter. If it is determined that the content includes sensitive content, a complete version of the content is rendered on a first device (wherein the complete version of the content includes the sensitive content) and a limited version of the content on a second device (wherein the limited version of the content excludes the sensitive content).

IPC Classes ?

G06F 21/00 - Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
A61B 5/00 - Measuring for diagnostic purposes ; Identification of persons

30. AUTOMATED CLINICAL DOCUMENTATION SYSTEM AND METHOD

Application Number	US2022021419
Publication Number	2022/204203
Status	In Force
Filing Date	2022-03-22
Publication Date	2022-09-29
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Gallopyn, Guido Remi Marcel Sharma, Dushyant Jost, Uwe Helmut Owen, Donald E. Naylor, Patrick A. Nour-Eldin, Amr Barreda, Daniel Paulino Almendro Öz, Mehmet Mert Erskine, Garret N.

Abstract

A computer-implemented method, computer program product, and computing system for source separation is executed on a computing device and includes obtaining encounter information of a user encounter, wherein the encounter information includes first audio encounter information obtained from a first encounter participant and at least second audio encounter information obtained from at least a second encounter participant. The first audio encounter information and the at least second audio encounter information are processed to eliminate audio interference between the first audio encounter information and the at least second audio encounter information.

IPC Classes ?

G06F 21/00 - Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
A61B 5/00 - Measuring for diagnostic purposes ; Identification of persons

31. SYSTEM AND METHOD FOR DATA AUGMENTATION AND SPEECH PROCESSING IN DYNAMIC ACOUSTIC ENVIRONMENTS

Application Number	US2022016844
Publication Number	2022/178162
Status	In Force
Filing Date	2022-02-17
Publication Date	2022-08-25
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Naylor, Patrick A. Sharma, Dushyant Jost, Uwe Helmut Ganong Iii, William F.

Abstract

A method, computer program product, and computing system for defining a model representative of a plurality of acoustic variations to a speech signal, thus defining a plurality of time-varying spectral modifications. The plurality of time-varying spectral modifications may be applied to a reference signal using a filtering operation, thus generating a time-varying spectrally-augmented signal.

IPC Classes ?

G10L 15/16 - Speech classification or search using artificial neural networks

32. SYSTEM AND METHOD FOR DATA AUGMENTATION AND SPEECH PROCESSING IN DYNAMIC ACOUSTIC ENVIRONMENTS

Application Number	US2022016832
Publication Number	2022/178151
Status	In Force
Filing Date	2022-02-17
Publication Date	2022-08-25
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Naylor, Patrick A. Sharma, Dushyant Jost, Uwe Helmut Ganong Iii, William F.

Abstract

A method, computer program product, and computing system for receiving one or more inputs indicative of at least one of: a relative location of a speaker and a microphone array, and a relative orientation of the speaker and the microphone array. One or more reference signals may be received. A speech processing system may be trained using the one or more inputs and the one or more reference signals.

IPC Classes ?

G06N 99/00 - Subject matter not provided for in other groups of this subclass
G10K 11/00 - Methods or devices for transmitting, conducting or directing sound in general; Methods or devices for protecting against, or for damping, noise or other acoustic waves in general
G10L 13/033 - Voice editing, e.g. manipulating the voice of the synthesiser
G10L 13/047 - Architecture of speech synthesisers
G10L 13/10 - Prosody rules derived from text; Stress or intonation

33. SYSTEM AND METHOD FOR DATA AUGMENTATION AND SPEECH PROCESSING IN DYNAMIC ACOUSTIC ENVIRONMENTS

Application Number	US2022016839
Publication Number	2022/178157
Status	In Force
Filing Date	2022-02-17
Publication Date	2022-08-25
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Naylor, Patrick A. Sharma, Dushyant Jost, Uwe Helmut Ganong Iii, William F.

Abstract

A method, computer program product, and computing system for defining model representative of a plurality of acoustic variations to a speech signal, thus defining a plurality of time-varying spectral modifications. The plurality of time-varying spectral modifications may be applied to a plurality of feature coefficients of a target domain of a reference signal, thus generating a plurality of time-varying spectrally-augmented feature coefficients of the reference signal.

IPC Classes ?

G10L 15/16 - Speech classification or search using artificial neural networks

34. MEDICAL INTELLIGENCE SYSTEM AND METHOD

Application Number	US2022015642
Publication Number	2022/173741
Status	In Force
Filing Date	2022-02-08
Publication Date	2022-08-18
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Jancsary, Jeremy Martin Pinto, Joel Praveen Jost, Uwe Helmut Ganong Iii, William F.

Abstract

A method, computer program product, and computing system for: monitoring a meeting between a patient and a medical entity during a telehealth medical encounter; gathering information during the telehealth medical encounter, thus generating gathered encounter information; and rendering an informational window concerning the telehealth medical encounter for review by the patient and/or the medical entity, wherein the informational window is configured to provide supplemental information based, at least in part, upon the gathered encounter information.

IPC Classes ?

G06F 11/07 - Responding to the occurrence of a fault, e.g. fault tolerance

35. MEDICAL INTELLIGENCE SYSTEM AND METHOD

Application Number	US2022015651
Publication Number	2022/173744
Status	In Force
Filing Date	2022-02-08
Publication Date	2022-08-18
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Jancsary, Jeremy, Martin Pinto, Joel, Praveen Jost, Uwe, Helmut Ganong Iii, William, F.

Abstract

A method, computer program product, and computing system for: monitoring a meeting between a patient and a medical entity during a medical encounter; gathering information during the medical encounter, thus generating gathered encounter information, wherein the gathered encounter information includes image-based content of the patient; generating image-based content information via artificial intelligence, wherein the image-based content information is based at least in part upon the image-based content and/or the gathered encounter information and is configured to provide guidance to the medical entity concerning the image-based content; and providing the image-based content information to the medical entity.

IPC Classes ?

A61B 5/00 - Measuring for diagnostic purposes ; Identification of persons
G16H 10/60 - ICT specially adapted for the handling or processing of patient-related medical or healthcare data for patient-specific data, e.g. for electronic patient records
G16H 30/20 - ICT specially adapted for the handling or processing of medical images for handling medical images, e.g. DICOM, HL7 or PACS
G16H 30/40 - ICT specially adapted for the handling or processing of medical images for processing medical images, e.g. editing

36. MEDICAL INTELLIGENCE SYSTEM AND METHOD

Application Number	US2022015654
Publication Number	2022/173747
Status	In Force
Filing Date	2022-02-08
Publication Date	2022-08-18
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Jancsary, Jeremy Martin Pinto, Joel Praveen Jost, Uwe Helmut Ganong Iii, William F.

Abstract

A method, computer program product, and computing system for: monitoring a meeting between a patient and a medical entity during a medical encounter; gathering information during the medical encounter, thus generating gathered encounter information, wherein the gathered encounter information includes audio-based content of the patient; generating audio-based content information via artificial intelligence, wherein the audio-based content information is based at least in part upon the audio-based content and/or the gathered encounter information and is configured to provide guidance to the medical entity concerning the audio-based content; and providing the audio-based content information to the medical entity.

IPC Classes ?

G06F 3/16 - Sound input; Sound output
G06F 3/0482 - Interaction with lists of selectable items, e.g. menus
G16H 10/20 - ICT specially adapted for the handling or processing of patient-related medical or healthcare data for electronic clinical trials or questionnaires
G16H 15/00 - ICT specially adapted for medical reports, e.g. generation or transmission thereof
G06N 20/00 - Machine learning
G06N 3/00 - Computing arrangements based on biological models

37. MEDICAL INTELLIGENCE SYSTEM AND METHOD

Application Number	US2022015657
Publication Number	2022/173748
Status	In Force
Filing Date	2022-02-08
Publication Date	2022-08-18
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Jancsary, Jeremy Martin Pinto, Joel Praveen Jost, Uwe Helmut Ganong Iii, William F.

Abstract

A method, computer program product, and computing system for: monitoring a meeting between a patient and a medical entity during a medical encounter; gathering information during the medical encounter, thus generating gathered encounter information, wherein the gathered encounter information includes video-based content of the patient; generating video-based content information via artificial intelligence, wherein the video-based content information is based at least in part upon the video-based content and/or the gathered encounter information and is configured to provide guidance to the medical entity concerning the video-based content; and providing the video-based content information to the medical entity.

IPC Classes ?

A61B 5/00 - Measuring for diagnostic purposes ; Identification of persons
G16H 10/60 - ICT specially adapted for the handling or processing of patient-related medical or healthcare data for patient-specific data, e.g. for electronic patient records
G16H 30/20 - ICT specially adapted for the handling or processing of medical images for handling medical images, e.g. DICOM, HL7 or PACS
G16H 30/40 - ICT specially adapted for the handling or processing of medical images for processing medical images, e.g. editing

38. MEDICAL INTELLIGENCE SYSTEM AND METHOD

Application Number	US2022015667
Publication Number	2022/173752
Status	In Force
Filing Date	2022-02-08
Publication Date	2022-08-18
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Jancsary, Jeremy Martin Pinto, Joel Praveen Jost, Uwe Helmut Ganong Iii, William F.

Abstract

IPC Classes ?

G06F 3/16 - Sound input; Sound output
G06F 3/0482 - Interaction with lists of selectable items, e.g. menus
G16H 10/20 - ICT specially adapted for the handling or processing of patient-related medical or healthcare data for electronic clinical trials or questionnaires
G16H 15/00 - ICT specially adapted for medical reports, e.g. generation or transmission thereof
G06N 20/00 - Machine learning
G06N 3/00 - Computing arrangements based on biological models

39. MEDICAL INTELLIGENCE SYSTEM AND METHOD

Application Number	US2022015671
Publication Number	2022/173754
Status	In Force
Filing Date	2022-02-08
Publication Date	2022-08-18
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Jancsary, Jeremy Martin Pinto, Joel Praveen Jost, Uwe Helmut Ganong Iii, William F.

Abstract

A method, computer program product, and computing system for: monitoring a meeting between a patient and a medical entity during a medical encounter; gathering information during the medical encounter, thus generating gathered encounter information; generating medical encounter topical information via artificial intelligence, wherein the medical encounter topical information is based at least in part upon the gathered encounter information and is configured to provide guidance to the medical entity concerning one or more topics to be discussed during the medical encounter; and providing the medical encounter topical information to the medical entity.

IPC Classes ?

G06E 1/00 - Devices for processing exclusively digital data

40. MULTI-CHANNEL SPEECH COMPRESSION SYSTEM AND METHOD

Application Number	US2022016021
Publication Number	2022/173980
Status	In Force
Filing Date	2022-02-10
Publication Date	2022-08-18
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Sharma, Dushyant Naylor, Patrick A. Jost, Uwe Helmut

Abstract

A method, computer program product, and computing system for selecting a reference audio acquisition device from a plurality of audio acquisition devices of an audio recording system. Audio encounter information of the reference microphone may be encoded, thus defining encoded reference audio encounter information. A plurality of acoustic relative transfer functions between the reference microphone and the plurality of audio acquisition devices of the audio recording system may be generated. The encoded reference audio encounter information and a representation of the plurality of acoustic relative transfer functions may be transmitted.

IPC Classes ?

G10L 19/00 - Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis

41. MULTI-CHANNEL SPEECH COMPRESSION SYSTEM AND METHOD

Application Number	US2022016030
Publication Number	2022/173986
Status	In Force
Filing Date	2022-02-10
Publication Date	2022-08-18
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Sharma, Dushyant Naylor, Patrick A. Jost, Uwe Helmut

Abstract

A method, computer program product, and computing system for generating a plurality of acoustic relative transfer functions between a plurality of audio acquisition devices of an audio recording system based upon, at least in part, one or more of a predefined speech processing application and a predefined acoustic environment. An acoustic relative transfer function codebook may be generated using the plurality of acoustic relative transfer functions. One or more channels from the plurality of audio acquisition devices of the audio recording system may be encoded using the acoustic relative transfer function codebook.

IPC Classes ?

G10L 15/02 - Feature extraction for speech recognition; Selection of recognition unit
G10L 15/20 - Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise or of stress induced speech
G10L 25/51 - Speech or voice analysis techniques not restricted to a single one of groups specially adapted for particular use for comparison or discrimination

42. FIRST AND SECOND EMBEDDING OF ACOUSTIC RELATIVE TRANSFER FUNCTIONS

Application Number	US2022016032
Publication Number	2022/173988
Status	In Force
Filing Date	2022-02-10
Publication Date	2022-08-18
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Sharma, Dushyant Naylor, Patrick A. Jost, Uwe Helmut

Abstract

A method, computer program product, and computing system for generating a plurality of acoustic relative transfer functions for a plurality of audio acquisition devices of an audio recording system deployed in an acoustic environment. The plurality of acoustic relative transfer functions may be encoded into a first embedding of acoustic relative transfer functions and at least a second embedding of acoustic relative transfer functions. Information may be extracted from at least the first embedding of acoustic relative transfer functions.

IPC Classes ?

G10L 19/008 - Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
G10L 19/16 - Vocoder architecture

43. MEDICAL INTELLIGENCE SYSTEM AND METHOD

Application Number	US2022015649
Publication Number	2022/173742
Status	In Force
Filing Date	2022-02-08
Publication Date	2022-08-18
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Jancsary, Jeremy Martin Pinto, Joel Praveen Jost, Uwe Helmut Ganong, William F. Iii

Abstract

A method, computer program product, and computing system for: monitoring a meeting between a patient and a medical entity during a medical encounter; gathering information during the medical encounter, thus generating gathered encounter information; generating medical encounter workflow information via artificial intelligence, wherein the medical encounter workflow information is based at least in part upon the gathered encounter information and is configured to provide guidance to the medical entity concerning a desired workflow for the medical encounter; and providing the medical encounter workflow information to the medical entity.

IPC Classes ?

G06Q 10/06 - Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
G06Q 10/10 - Office automation; Time management
G16H 10/20 - ICT specially adapted for the handling or processing of patient-related medical or healthcare data for electronic clinical trials or questionnaires
G16H 10/40 - ICT specially adapted for the handling or processing of patient-related medical or healthcare data for data related to laboratory analysis, e.g. patient specimen analysis
G16H 10/60 - ICT specially adapted for the handling or processing of patient-related medical or healthcare data for patient-specific data, e.g. for electronic patient records

44. MEDICAL INTELLIGENCE SYSTEM AND METHOD

Application Number	US2022015660
Publication Number	2022/173749
Status	In Force
Filing Date	2022-02-08
Publication Date	2022-08-18
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Jancsary, Jeremy Martin Pinto, Joel Praveen Jost, Uwe Helmut Ganong Iii, William F.

Abstract

IPC Classes ?

G06E 1/00 - Devices for processing exclusively digital data

45. COMMUNICATION SYSTEM AND METHOD

Application Number	US2022015995
Publication Number	2022/173962
Status	In Force
Filing Date	2022-02-10
Publication Date	2022-08-18
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Pinto, Joel Praveen

Abstract

A method, computer program product, and computing system for receiving audio-based content from a user who is reviewing an image on a display screen; receiving gaze information that defines a gaze location of the user; and temporally aligning the audio-based content and the gaze information to form location-based content.

IPC Classes ?

G06F 3/00 - Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements

46. COMPARING ACOUSTIC RELATIVE TRANSFER FUNCTIONS FROM AT LEAST A PAIR OF TIME FRAMES

Application Number	US2022016024
Publication Number	2022/173982
Status	In Force
Filing Date	2022-02-10
Publication Date	2022-08-18
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Sharma, Dushyant Naylor, Patrick A. Jost, Uwe Helmut

Abstract

A method, computer program product, and computing system for generating a plurality of acoustic relative transfer functions associated with a plurality of audio acquisition devices of an audio recording system deployed in an acoustic environment. At least a pair of the plurality of acoustic relative transfer functions from time frames may be compared. A change in the acoustic environment may be detected based upon, at least in part, the comparison of the plurality of acoustic relative transfer functions from at least the pair of time frames.

IPC Classes ?

G10L 25/51 - Speech or voice analysis techniques not restricted to a single one of groups specially adapted for particular use for comparison or discrimination

47. MULTI-CHANNEL SPEECH COMPRESSION SYSTEM AND METHOD

Application Number	US2022016027
Publication Number	2022/173984
Status	In Force
Filing Date	2022-02-10
Publication Date	2022-08-18
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Sharma, Dushyant Naylor, Patrick, A. Jost, Uwe, Helmut

Abstract

A method, computer program product, and computing system for generating a plurality of acoustic relative transfer functions associated with a plurality of audio acquisition devices of an audio recording system deployed in an acoustic environment. Acoustic relative transfer functions of at least a pair of audio acquisition devices of the plurality of audio acquisition devices may be compared. Location information associated with an acoustic source within the acoustic environment may be determined based upon, at least in part, the comparison of the acoustic relative transfer functions of the at least a pair of audio acquisition devices of the plurality of audio acquisition devices.

IPC Classes ?

H04R 3/00 - Circuits for transducers
H04R 5/00 - Stereophonic arrangements
H04S 3/00 - Systems employing more than two channels, e.g. quadraphonic
H04S 5/00 - Pseudo-stereo systems, e.g. in which additional channel signals are derived from monophonic signals by means of phase shifting, time delay or reverberation

48. MULTI-CHANNEL SPEECH COMPRESSION SYSTEM AND METHOD

Application Number	US2022016033
Publication Number	2022/173989
Status	In Force
Filing Date	2022-02-10
Publication Date	2022-08-18
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Sharma, Dushyant Naylor, Patrick A. Jost, Uwe Helmut

Abstract

A method, computer program product, and computing system for encoding audio encounter information of a reference audio acquisition device of a plurality of audio acquisition devices of an audio recording system, thus defining encoded reference audio encounter information. Location information may be estimated, via a machine vision system, for an acoustic source within an acoustic environment. One or more acoustic relative transfer functions may be selected from a plurality of acoustic relative transfer functions for the plurality of audio acquisition devices of the audio recording system based upon, at least in part, the location information. The encoded reference audio encounter information and a representation of the selected one or more acoustic relative transfer function may be transmitted.

IPC Classes ?

H04R 3/00 - Circuits for transducers
H04R 5/00 - Stereophonic arrangements
H04S 3/00 - Systems employing more than two channels, e.g. quadraphonic
G10L 15/08 - Speech classification or search
G10L 15/10 - Speech classification or search using distance or distortion measures between unknown speech and reference templates
G10L 15/16 - Speech classification or search using artificial neural networks
H04R 1/40 - Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only by combining a number of identical transducers

49. MULTI-CHANNEL SPEECH COMPRESSION SYSTEM AND METHOD

Application Number	US2022016034
Publication Number	2022/173990
Status	In Force
Filing Date	2022-02-10
Publication Date	2022-08-18
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Sharma, Dushyant Naylor, Patrick A. Jost, Uwe Helmut

Abstract

A method, computer program product, and computing system for obtaining machine vision encounter information using one or more machine vision systems. Audio encounter information may be obtained using a plurality of audio acquisition devices of an audio recording system. The audio encounter information may be encoded using an audio codec. The encoding of the audio encounter information by the audio codec may be adapted based upon, at least in part, the machine vision encounter information.

IPC Classes ?

G10L 15/20 - Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise or of stress induced speech
G10L 21/02 - Speech enhancement, e.g. noise reduction or echo cancellation
G10L 15/02 - Feature extraction for speech recognition; Selection of recognition unit
G10L 15/08 - Speech classification or search
G10L 15/10 - Speech classification or search using distance or distortion measures between unknown speech and reference templates
H04R 3/00 - Circuits for transducers

50. AI PLATFORM SYSTEM AND METHOD

Application Number	US2021064665
Publication Number	2022/140424
Status	In Force
Filing Date	2021-12-21
Publication Date	2022-06-30
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Wunderink, Jamin Smith, Rob

Abstract

A computer-implemented method, computer program product and computing system for defining a test truth set from a master truth set; processing the test truth set using an automated analysis process to generate an automated result set; determining a process efficacy for the automated analysis process based, at least in part, upon the test truth set and the automated result set; and rendering the process efficacy of the automated analysis process.

IPC Classes ?

G06F 19/00 - Digital computing or data processing equipment or methods, specially adapted for specific applications (specially adapted for specific functions G06F 17/00;data processing systems or methods specially adapted for administrative, commercial, financial, managerial, supervisory or forecasting purposes G06Q;healthcare informatics G16H)
G06K 9/62 - Methods or arrangements for recognition using electronic means

51. AI PLATFORM SYSTEM AND METHOD

Application Number	US2021064684
Publication Number	2022/140440
Status	In Force
Filing Date	2021-12-21
Publication Date	2022-06-30
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Wunderink, Jamin Smith, Rob

Abstract

IPC Classes ?

G06N 3/00 - Computing arrangements based on biological models
G06N 20/00 - Machine learning
G06N 3/02 - Neural networks
G06N 20/20 - Ensemble learning
G16H 30/00 - ICT specially adapted for the handling or processing of medical images

52. AMBIENT COOPERATIVE INTELLIGENCE SYSTEM AND METHOD

Application Number	US2021064701
Publication Number	2022/140451
Status	In Force
Filing Date	2021-12-21
Publication Date	2022-06-30
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Vozila, Paul Joseph Snider, Neal

Abstract

A method, computer program product, and computing system for monitoring a plurality of conversations within a monitored space to generate a conversation data set; processing the conversation data set using machine learning to: define a system-directed command for an ACI system, and associate one or more conversational contexts with the system-directed command; detecting the occurrence of a specific conversational context within the monitored space, wherein the specific conversational context is included in the one or more conversational contexts associated with the system-directed command; and executing, in whole or in part, functionality associated with the system-directed command in response to detecting the occurrence of the specific conversational context without requiring the utterance of the system-directed command and/or a wake-up word / phrase.

IPC Classes ?

G10L 15/22 - Procedures used during a speech recognition process, e.g. man-machine dialog
G10L 15/18 - Speech classification or search using natural language modelling
G06N 20/00 - Machine learning

53. AI PLATFORM SYSTEM AND METHOD

Application Number	US2021064675
Publication Number	2022/140433
Status	In Force
Filing Date	2021-12-21
Publication Date	2022-06-30
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Wunderink, Jamin Smith, Rob

Abstract

IPC Classes ?

G06F 19/00 - Digital computing or data processing equipment or methods, specially adapted for specific applications (specially adapted for specific functions G06F 17/00;data processing systems or methods specially adapted for administrative, commercial, financial, managerial, supervisory or forecasting purposes G06Q;healthcare informatics G16H)
G06K 9/62 - Methods or arrangements for recognition using electronic means

54. FEEDBACK SYSTEM AND METHOD

Application Number	US2021058943
Publication Number	2022/103937
Status	In Force
Filing Date	2021-11-11
Publication Date	2022-05-19
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Vemula, Raghu

Abstract

A computer-implemented method, computer program product and computing system for receiving a result set for content processed by an automated analysis process; receiving human feedback concerning the result set; and providing feedback information to the developer of the automated analysis process based, at least in part, upon the result set and the human feedback.

IPC Classes ?

G16H 30/40 - ICT specially adapted for the handling or processing of medical images for processing medical images, e.g. editing
G06N 3/08 - Learning methods
G06T 7/00 - Image analysis
G16H 10/60 - ICT specially adapted for the handling or processing of patient-related medical or healthcare data for patient-specific data, e.g. for electronic patient records

55. AMBIENT COOPERATIVE INTELLIGENCE SYSTEM AND METHOD

Application Number	US2021056274
Publication Number	2022/093648
Status	In Force
Filing Date	2021-10-22
Publication Date	2022-05-05
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Gallopyn, Guido Remi Marcel Ganong Iii, William F.

Abstract

A method, computer program product, and computing system for initiating a session within an ACI platform; receiving an authentication request from a requester; and authenticating that the requester has the authority to access the ACI platform.

IPC Classes ?

H04L 29/06 - Communication control; Communication processing characterised by a protocol

56. AMBIENT COOPERATIVE INTELLIGENCE SYSTEM AND METHOD

Application Number	US2021056265
Publication Number	2022/093646
Status	In Force
Filing Date	2021-10-22
Publication Date	2022-05-05
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Gallopyn, Guido Remi Marcel Ganong Iii, William F.

Abstract

A method, computer program product, and computing system for detecting the issuance of a verbal command by a requester to a virtual assistant; authenticating that the requester has the authority to issue the verbal command to the virtual assistant; if the requester is authenticated, allowing the effectuation of the verbal command to the virtual assistant; and if the requester is not authenticated, preventing the effectuation of the verbal command to the virtual assistant.

IPC Classes ?

G06F 21/32 - User authentication using biometric data, e.g. fingerprints, iris scans or voiceprints
G10L 17/22 - Interactive procedures; Man-machine interfaces

57. FRAUD DETECTION SYSTEM AND METHOD

Application Number	US2021056251
Publication Number	2022/087409
Status	In Force
Filing Date	2021-10-22
Publication Date	2022-04-28
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Talib, Haydar

Abstract

A method, computer program product, and computing system for receiving input information concerning a conversation between a caller and a recipient; processing the input information to assess a fraud-threat-level; defining a targeted response based, at least in part, upon the fraud-threat-level assessed, wherein the targeted response is intended to refine the assessed fraud-threat-level; and effectuating the targeted response.

IPC Classes ?

H04M 3/42 - Systems providing special services or facilities to subscribers

58. SYSTEM AND METHOD FOR GENERATING RESPONSES FOR CONVERSATIONAL AGENTS

Application Number	US2021045426
Publication Number	2022/035887
Status	In Force
Filing Date	2021-08-10
Publication Date	2022-02-17
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Liu, Ding Vozila, Paul Joseph Stubley, Peter Dunlop, Aaron Joseph Fu, Zhiping Bonetta, Gionanni

Abstract

A method, computer program product, and computer system for predicting responses to at least one conversational phrase. At least one conversational phrase may be received. A first probability for a subset of candidate responses of a plurality of candidate responses may be determined based upon, at least in part, context associated with the at least one conversational phrase, the at least one conversational phrase, and each context associated with the plurality of candidate responses. A second probability for the subset of candidate responses may be determined based upon, at least in part, the subset of candidate responses, the at least one conversational phrase, and the context associated with the at least one conversational phrase. At least one candidate response for the at least one conversational phrase may be determined based upon, at least in part, the first probability and the second probability.

IPC Classes ?

G10L 21/00 - Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility

59. FRAUD DETECTION SYSTEM AND METHOD

Application Number	US2021035886
Publication Number	2021/247987
Status	In Force
Filing Date	2021-06-04
Publication Date	2021-12-09
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Talib, Haydar Robo, Damian Marchand, Simon Channasamudhram, Adiseshu

Abstract

A method, computer program product, and computing system for performing an assessment of initial input information, concerning a communication from a caller, to define an initial fraud-threat-level; if the initial fraud-threat-level is below a defined threat threshold, providing the communication to a recipient so that a conversation may occur between the recipient and the caller; performing an assessment of subsequent input information, concerning the conversation, to define a subsequent fraud-threat-level; and effectuating a targeted response based, at least in part, upon the subsequent fraud-threat-level, wherein the targeted response is intended to refine the subsequent fraud-threat-level.

IPC Classes ?

G06F 21/32 - User authentication using biometric data, e.g. fingerprints, iris scans or voiceprints
G06Q 50/26 - Government or public services
G10L 17/06 - Decision making techniques; Pattern matching strategies
G10L 17/26 - Recognition of special voice characteristics, e.g. for use in lie detectors; Recognition of animal voices
H04M 3/22 - Arrangements for supervision, monitoring or testing
H04M 3/42 - Systems providing special services or facilities to subscribers

60. SYSTEM AND METHOD FOR DATA AUGMENTATION FOR MULTI-MICROPHONE SIGNAL PROCESSING

Application Number	US2021031369
Publication Number	2021/226507
Status	In Force
Filing Date	2021-05-07
Publication Date	2021-11-11
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Sharma, Dushyant Naylor, Patrick A. Gong, Rong Kruchinin, Stanislav Milanovic, Ljubomir

Abstract

A method, computer program product, and computing system for receiving a speech signal from each microphone of a plurality of microphones, thus defining a plurality of signals. One or more noise signals associated with microphone self-noise may be received. One or more self-noise-based augmentations may be performed on the plurality of signals based upon, at least in part, the one or more noise signals associated with microphone self-noise, thus defining one or more self-noise-based augmented signals.

IPC Classes ?

H04R 3/00 - Circuits for transducers
H04R 25/00 - Deaf-aid sets
H04R 29/00 - Monitoring arrangements; Testing arrangements
H04B 15/00 - Suppression or limitation of noise or interference
H04R 5/00 - Stereophonic arrangements

61. SYSTEM AND METHOD FOR DATA AUGMENTATION FOR MULTI-MICROPHONE SIGNAL PROCESSING

Application Number	US2021031378
Publication Number	2021/226515
Status	In Force
Filing Date	2021-05-07
Publication Date	2021-11-11
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Sharma, Dushyant Naylor, Patrick A. Gong, Rong Kruchinin, Stanislav Milanovic, Ljubomir

Abstract

A method, computer program product, and computing system for receiving a signal from each microphone of a plurality of microphones, thus defining a plurality of signals. Harmonic distortion associated with at least one microphone may be determined. One or more harmonic distortion-based augmentations may be performed on the plurality of signals based upon, at least in part, the harmonic distortion associated with the at least one microphone, thus defining one or more harmonic distortion-based augmented signals.

IPC Classes ?

G10L 15/20 - Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise or of stress induced speech
G10L 15/26 - Speech to text systems
G10L 21/02 - Speech enhancement, e.g. noise reduction or echo cancellation

62. SYSTEM AND METHOD FOR MULTI-MICROPHONE AUTOMATED CLINICAL DOCUMENTATION

Application Number	US2021031498
Publication Number	2021/226568
Status	In Force
Filing Date	2021-05-10
Publication Date	2021-11-11
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Sharma, Dushyant Naylor, Patrick A.

Abstract

A method, computer program product, and computing system for receiving information associated with an acoustic environment. A plurality of filters may be predefined to produce a plurality of beams based upon, at least in part, the information associated with the acoustic environment. The plurality of filters may be predefined to produce a plurality of nulls based upon, at least in part, the information associated with the acoustic environment. Audio encounter information may be obtained, via one or more microphone arrays, using the plurality of beams and the plurality of nulls produced by the plurality of predefined filters.

IPC Classes ?

G10L 21/02 - Speech enhancement, e.g. noise reduction or echo cancellation
G10L 21/0208 - Noise filtering
G10K 11/16 - Methods or devices for protecting against, or for damping, noise or other acoustic waves in general

63. SYSTEM AND METHOD FOR MULTI-MICROPHONE AUTOMATED CLINICAL DOCUMENTATION

Application Number	US2021031504
Publication Number	2021/226570
Status	In Force
Filing Date	2021-05-10
Publication Date	2021-11-11
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Sharma, Dushyant Naylor, Patrick, A.

Abstract

A method, computer program product, and computing system for receiving audio encounter information from a microphone array. Speech activity within one or more portions of the audio encounter information may be identified based upon, at least in part, a correlation among the audio encounter information received from the microphone array. Location information for the one or more portions of the audio encounter information may be determined based upon, at least in part, the correlation among the signals received by each microphone of the microphone array. The one or more portions of the audio encounter information may be labeled with the speech activity and the location information.

IPC Classes ?

G10L 15/00 - Speech recognition
G10L 15/01 - Assessment or evaluation of speech recognition systems
G10L 15/02 - Feature extraction for speech recognition; Selection of recognition unit
G10L 15/08 - Speech classification or search
G10L 15/16 - Speech classification or search using artificial neural networks

64. SYSTEM AND METHOD FOR MULTI-MICROPHONE AUTOMATED CLINICAL DOCUMENTATION

Application Number	US2021031508
Publication Number	2021/226571
Status	In Force
Filing Date	2021-05-10
Publication Date	2021-11-11
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Sharma, Dushyant Naylor, Patrick A.

Abstract

A method, computer program product, and computing system for receiving information associated with an acoustic environment. Acoustic metadata associated with audio encounter information received by a first microphone system may be received. One or more speaker representations may be defined based upon, at least in part, the acoustic metadata associated with the audio encounter information and the information associated with the acoustic environment. One or more portions of the audio encounter information may be labeled with the one or more speaker representations and a speaker location within the acoustic environment.

IPC Classes ?

G10L 15/01 - Assessment or evaluation of speech recognition systems
G10L 15/02 - Feature extraction for speech recognition; Selection of recognition unit
G10L 15/08 - Speech classification or search
G10L 15/16 - Speech classification or search using artificial neural networks

65. SYSTEM AND METHOD FOR MULTI-MICROPHONE AUTOMATED CLINICAL DOCUMENTATION

Application Number	US2021031512
Publication Number	2021/226573
Status	In Force
Filing Date	2021-05-10
Publication Date	2021-11-11
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Sharma, Dushyant Naylor, Patrick A.

Abstract

A method, computer program product, and computing system for receiving a plurality of predefined beams associated with a microphone array. A plurality of predefined nulls associated with the microphone array may be received. One or more predefined beams from the plurality of predefined beams or one or more predefined nulls from the plurality of predefined nulls may be selected. A microphone array may obtain audio encounter information, via the microphone array, using at least one of the one or more selected beams and the one or more selected nulls.

IPC Classes ?

G10L 15/20 - Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise or of stress induced speech
G10L 15/26 - Speech to text systems
G10L 21/02 - Speech enhancement, e.g. noise reduction or echo cancellation

66. SYSTEM AND METHOD FOR MULTI-MICROPHONE AUTOMATED CLINICAL DOCUMENTATION

Application Number	US2021031516
Publication Number	2021/226574
Status	In Force
Filing Date	2021-05-10
Publication Date	2021-11-11
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Sharma, Dushyant Naylor, Patrick A.

Abstract

A method, computer program product, and computing system for receiving audio encounter information from a first microphone system, thus defining a first audio stream. Audio encounter information may be received from a second microphone system, thus defining a second audio stream. Speech activity may be detected in one or more portions of the first audio stream, thus defining one or more speech portions of the first audio stream. Speech activity may be detected in one or more portions of the second audio stream, thus defining one or more speech portions of the second audio stream. The first audio stream and the second audio stream may be aligned based upon, at least in part, the one or more speech portions of the first audio stream and the one or more speech portions of the second audio stream.

IPC Classes ?

G10L 15/01 - Assessment or evaluation of speech recognition systems
G10L 15/02 - Feature extraction for speech recognition; Selection of recognition unit
G10L 15/08 - Speech classification or search
G10L 15/16 - Speech classification or search using artificial neural networks

67. SYSTEM AND METHOD FOR DATA AUGMENTATION FOR MULTI-MICROPHONE SIGNAL PROCESSING

Application Number	US2021031363
Publication Number	2021/226503
Status	In Force
Filing Date	2021-05-07
Publication Date	2021-11-11
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Sharma, Dushyant Naylor, Patrick A. Gong, Rong Kruchinin, Stanislav Milanovic, Ljubomir

Abstract

A method, computer program product, and computing system for receiving a signal from each microphone of a plurality of microphones, thus defining a plurality of signals. One or more inter-microphone gain-based augmentations may be performed on the plurality of signals, thus defining one or more inter-microphone gain-augmented signals. Performing the one or more inter-microphone gain-based augmentations on the plurality of signals may include applying a gain level from a plurality of gain levels to the signal from each microphone. Applying the gain level from the plurality of gain levels to the signal from each microphone may include applying a gain level, from a predefined range of gain levels, to the signal from each microphone. Applying the gain level from the plurality of gain levels to the signal from each microphone may include applying a random gain level, from the predefined range of gain levels, to the signal from each microphone.

IPC Classes ?

H04R 3/00 - Circuits for transducers
H04R 3/02 - Circuits for transducers for preventing acoustic reaction
H04R 3/04 - Circuits for transducers for correcting frequency response
G06F 3/16 - Sound input; Sound output

68. SYSTEM AND METHOD FOR DATA AUGMENTATION FOR MULTI-MICROPHONE SIGNAL PROCESSING

Application Number	US2021031374
Publication Number	2021/226511
Status	In Force
Filing Date	2021-05-07
Publication Date	2021-11-11
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Sharma, Dushyant Naylor, Patrick A. Gong, Rong Kruchinin, Stanislav Milanovic, Ljubomir

Abstract

A method, computer program product, and computing system for receiving a signal from each microphone of a plurality of microphones, thus defining a plurality of signals. One or more microphone frequency responses associated with at least one microphone may be received. One or more microphone frequency response-based augmentations may be performed on the plurality of signals based upon, at least in part, the one or more microphone frequency responses, thus defining one or more microphone frequency response-based augmented signals.

IPC Classes ?

G10L 15/20 - Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise or of stress induced speech
G10L 15/26 - Speech to text systems
G10L 21/02 - Speech enhancement, e.g. noise reduction or echo cancellation

69. SYSTEM AND METHOD FOR DATA AUGMENTATION OF FEATURE-BASED VOICE DATA

Application Number	US2021021716
Publication Number	2021/183652
Status	In Force
Filing Date	2021-03-10
Publication Date	2021-09-16
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Sharma, Dushyant Naylor, Patrick A. Fosburgh, James W.

Abstract

A method, computer program product, and computing system for receiving feature-based voice data associated with a first acoustic domain. One or more gain-based augmentations may be performed on at least a portion of the feature-based voice data, thus defining gain-augmented feature-based voice data.

IPC Classes ?

G10L 15/06 - Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
G10L 15/00 - Speech recognition
G10L 15/02 - Feature extraction for speech recognition; Selection of recognition unit
G10L 15/16 - Speech classification or search using artificial neural networks
G10L 15/18 - Speech classification or search using natural language modelling
G10L 21/02 - Speech enhancement, e.g. noise reduction or echo cancellation

70. SYSTEM AND METHOD FOR DATA AUGMENTATION OF FEATURE-BASED VOICE DATA

Application Number	US2021021721
Publication Number	2021/183655
Status	In Force
Filing Date	2021-03-10
Publication Date	2021-09-16
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Sharma, Dushyant Naylor, Patrick A.

Abstract

A method, computer program product, and computing system for receiving feature-based voice data associated with a first acoustic domain. One or more rate-based augmentations may be performed on at least a portion of the feature-based voice data, thus defining rate-based augmented feature-based voice data.

IPC Classes ?

G10L 15/06 - Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
G10L 15/00 - Speech recognition
G10L 15/02 - Feature extraction for speech recognition; Selection of recognition unit
G10L 15/16 - Speech classification or search using artificial neural networks
G10L 15/18 - Speech classification or search using natural language modelling
G10L 21/02 - Speech enhancement, e.g. noise reduction or echo cancellation

71. SYSTEM AND METHOD FOR DATA AUGMENTATION OF FEATURE-BASED VOICE DATA

Application Number	US2021021725
Publication Number	2021/183657
Status	In Force
Filing Date	2021-03-10
Publication Date	2021-09-16
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Sharma, Dushyant Naylor, Patrick A.

Abstract

A method, computer program product, and computing system for receiving feature-based voice data associated with a first acoustic domain. One or more audio feature-based augmentations may be performed on at least a portion of the feature-based voice data. Performing the one or more audio feature-based augmentations may include adding one or more audio features to the at least a portion of the feature-based voice data and/or removing one or more audio features from the at least a portion of the feature-based voice data.

IPC Classes ?

G10L 15/06 - Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
G10L 15/00 - Speech recognition
G10L 15/02 - Feature extraction for speech recognition; Selection of recognition unit
G10L 15/16 - Speech classification or search using artificial neural networks
G10L 15/18 - Speech classification or search using natural language modelling
G10L 21/02 - Speech enhancement, e.g. noise reduction or echo cancellation

72. AMBIENT COOPERATIVE INTELLIGENCE SYSTEM AND METHOD

Application Number	US2021021965
Publication Number	2021/183801
Status	In Force
Filing Date	2021-03-11
Publication Date	2021-09-16
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Sharma, Dushyant Naylor, Patrick A. Pinto, Joel Praveen Barreda, Daniel Paulino Ahmendro

Abstract

A A method, computer program product, and computing system for generating a three-dimensional model of at least a portion of a three-dimensional space incorporating an ACI system via a video recording subsystem of an ACI calibration platform; and generating one or more audio calibration signals for receipt by an audio recording system included within the ACI system via an audio generation subsystem of the ACI calibration platform.

IPC Classes ?

G06F 3/16 - Sound input; Sound output
G06N 5/02 - Knowledge representation; Symbolic representation

73. AMBIENT COOPERATIVE INTELLIGENCE SYSTEM AND METHOD

Application Number	US2021021968
Publication Number	2021/183804
Status	In Force
Filing Date	2021-03-11
Publication Date	2021-09-16
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Sharma, Dushyant Naylor, Patrick A. Pinto, Joel Praveen Barreda, Daniel Paulino Almendro

Abstract

A method, computer program product, and computing system for obtaining calibration information for a three-dimensional space incorporating an ACI system; and processing the calibration information to calibrate the ACI system.

IPC Classes ?

G06F 3/16 - Sound input; Sound output
G06N 5/02 - Knowledge representation; Symbolic representation

74. SYSTEM AND METHOD FOR DATA AUGMENTATION OF FEATURE-BASED VOICE DATA

Application Number	US2021021712
Publication Number	2021/183649
Status	In Force
Filing Date	2021-03-10
Publication Date	2021-09-16
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Sharma, Dushyant Naylor, Patrick A. Do Yeong, Kim

Abstract

A method, computer program product, and computing system for extracting acoustic metadata from a signal. The signal may be converted from the time domain to the feature domain, thus defining feature-based voice data associated with the signal. The feature-based voice data associated with the signal may be processed based upon, at least in part, the acoustic metadata.

IPC Classes ?

G10L 15/06 - Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
G10L 15/00 - Speech recognition
G10L 15/02 - Feature extraction for speech recognition; Selection of recognition unit
G10L 15/16 - Speech classification or search using artificial neural networks
G10L 15/18 - Speech classification or search using natural language modelling
G10L 21/02 - Speech enhancement, e.g. noise reduction or echo cancellation

75. SYSTEM AND METHOD FOR DATA AUGMENTATION OF FEATURE-BASED VOICE DATA

Application Number	US2021021730
Publication Number	2021/183660
Status	In Force
Filing Date	2021-03-10
Publication Date	2021-09-16
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Sharma, Dushyant Naylor, Patrick A. Do Yeong, Kim Fosburgh, James W.

Abstract

A method, computer program product, and computing system for receiving feature based voice data associated with a first acoustic domain. One or more reverberation-based augmentations may be performed on at least a portion of the feature-based voice data, thus defining reverberation-augmented feature-based voice data. Performing the one or more reverberation-based augmentations to the at least a portion of the feature-based voice data may include performing the one or more reverberation-based augmentations to the at least a portion of the feature-based voice data based upon, at least in part, the target acoustic domain.

IPC Classes ?

G10L 15/06 - Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
G10L 15/00 - Speech recognition
G10L 15/02 - Feature extraction for speech recognition; Selection of recognition unit
G10L 15/16 - Speech classification or search using artificial neural networks
G10L 15/18 - Speech classification or search using natural language modelling
G10L 21/02 - Speech enhancement, e.g. noise reduction or echo cancellation

76. SYSTEM AND METHOD FOR DATA AUGMENTATION OF FEATURE-BASED VOICE DATA

Application Number	US2021021745
Publication Number	2021/183668
Status	In Force
Filing Date	2021-03-10
Publication Date	2021-09-16
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Sharma, Dushyant Naylor, Patrick A. Do Yeong, Kim Fosburgh, James W.

Abstract

A method, computer program product, and computing system for receiving feature-based voice data. One or more data augmentation characteristics may be received. One or more augmentations of the feature-based voice data may be generated, via a machine learning model, based upon, at least in part, the feature-based voice data and the one or more data augmentation characteristics

IPC Classes ?

G06F 15/18 - in which a program is changed according to experience gained by the computer itself during a complete run; Learning machines (adaptive control systems G05B 13/00;artificial intelligence G06N)

77. SYSTEM AND METHOD FOR REVIEW OF AUTOMATED CLINICAL DOCUMENTATION

Application Number	US2020053504
Publication Number	2021/067413
Status	In Force
Filing Date	2020-09-30
Publication Date	2021-04-08
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Pinto, Joel, Praveen

Abstract

A method, computer program product, and computing system for obtaining, by a computing device, encounter information of a patient encounter, wherein the encounter information may include audio encounter information and video encounter information obtained from at least a first encounter participant. A report of the patient encounter may be generated based upon, at least in part, the encounter information. A relative importance of a word in the report may be determined. A portion of the video encounter information that corresponds to the word in the report may be determined. The portion of the video encounter information that corresponds to the word in the report may be stored at a first location, wherein the video encounter information may be stored at a second location remote from the first location.

IPC Classes ?

G10L 19/00 - Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis

78. VEHICLE AVATAR DEVICES FOR INTERACTIVE VIRTUAL ASSISTANT

Application Number	CN2019104012
Publication Number	2021/042238
Status	In Force
Filing Date	2019-09-02
Publication Date	2021-03-11
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Zhao, Shenbin Lin, Jianchao Li, Nan Yu, David Shiah, Lei Lin, Fatty Xu, Bruno Xia, Feng Liu, Feng

Abstract

A system and method for providing avatar device (115, 125, 135, 145) status indicators for voice assistants in multi-zone vehicles. The method comprises: receiving at least one signal from a plurality of microphones (114, 124, 134, 144), wherein each microphone (114, 124, 134, 144) is associated with one of a plurality of spatial zones (110, 120, 130, 140), and one of a plurality of avatar devices (115, 125, 135, 145); wherein the at least one signal further comprises a speech signal component from a speaker; wherein the speech signal component is a voice command or question; sending zone information associated with the speaker and with one of the plurality of spatial zones (110, 120, 130, 140) to an avatar (115, 125, 135, 145); activating one the plurality of avatar devices (115, 125, 135, 145) in a respective one of the plurality of spatial zones (110, 120, 130, 140) associated with the speaker.

IPC Classes ?

G10L 15/22 - Procedures used during a speech recognition process, e.g. man-machine dialog

79. SYSTEM AND METHOD FOR QUERYING DATA POINTS FROM GRAPH DATA STRUCTURES

Application Number	US2020037226
Publication Number	2020/252160
Status	In Force
Filing Date	2020-06-11
Publication Date	2020-12-17
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Öz, Mehmet Mert Helletzgruber, Matthais Ungar, Peter

Abstract

A method, computer program product, and computing system includes generating a graph data structure including a plurality of data points. A query for the graph data structure may be received via a user interface. At least one data point from the plurality of data points may be identified, via the user interface, in the graph data structure based upon, at least in part, the query. A selection of a data point from the identified at least one data point may be received via the user interface. The selected data point may be provided to one or more electronic data sources.

IPC Classes ?

G06N 5/04 - Inference or reasoning models
G06N 99/00 - Subject matter not provided for in other groups of this subclass
H04L 29/08 - Transmission control procedure, e.g. data link level control procedure

80. AMBIENT CLINICAL INTELLIGENCE SYSTEM AND METHOD

Application Number	US2020037284
Publication Number	2020/252196
Status	In Force
Filing Date	2020-06-11
Publication Date	2020-12-17
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Owen, Donald E. Gallopyn, Guido Remi Marcel Vozila, Paul Joseph Öz, Mehmet Mert Hebert, Matthieu

Abstract

A method, computer program product, and computing system for obtaining encounter information during a patient encounter; processing the encounter information to detect the execution of a physical event during the patient encounter, thus defining a detected physical event; and deriving information for the detected physical event

IPC Classes ?

G16H 10/20 - ICT specially adapted for the handling or processing of patient-related medical or healthcare data for electronic clinical trials or questionnaires
G16H 10/40 - ICT specially adapted for the handling or processing of patient-related medical or healthcare data for data related to laboratory analysis, e.g. patient specimen analysis
G16H 10/60 - ICT specially adapted for the handling or processing of patient-related medical or healthcare data for patient-specific data, e.g. for electronic patient records
G16H 10/65 - ICT specially adapted for the handling or processing of patient-related medical or healthcare data for patient-specific data, e.g. for electronic patient records stored on portable record carriers, e.g. on smartcards, RFID tags or CD

81. MULTI-CHANNEL MICROPHONE SIGNAL GAIN EQUALIZATION BASED ON EVALUATION OF CROSS TALK COMPONENTS

Application Number	US2020032517
Publication Number	2020/242758
Status	In Force
Filing Date	2020-05-12
Publication Date	2020-12-03
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Matheja, Timo Buck, Markus

Abstract

Gain mismatch and related problems can be solved by a system and method that applies an automatic microphone signal gain equalization without any direct absolute reference or calibration phase. The system and method performs the steps of receiving, by a computing device, a speech signal from a speaking person via a plurality of microphones, determining a speech signal component in the time- frequency domain for each microphone of the plurality of microphones, calculating an instantaneous cross-talk coupling matrix based on the speech signal components across the microphones, estimating gain factors based on calculated cross-talk couplings and a given expected cross-talk attenuation, limiting the gain factors to appropriate maximum and minimum values, and applying the gain factors to the speech signal used in the control path to control further speech enhancement algorithms or used in the signal path for direct influence on the speech enhanced audio output signal.

IPC Classes ?

G10L 21/0208 - Noise filtering
G10L 21/02 - Speech enhancement, e.g. noise reduction or echo cancellation
H04B 3/32 - Reducing cross-talk, e.g. by compensating
H03G 3/00 - Gain control in amplifiers or frequency changers
G10L 15/20 - Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise or of stress induced speech
H04J 1/12 - Arrangements for reducing cross-talk between channels
H04J 3/10 - Arrangements for reducing cross-talk between channels
G10L 21/00 - Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility

82. MULTI-MICROPHONE SPEECH DIALOG SYSTEM FOR MULTIPLE SPATIAL ZONES

Application Number	US2020032521
Publication Number	2020/242759
Status	In Force
Filing Date	2020-05-12
Publication Date	2020-12-03
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Matheja, Timo Buck, Markus Kirbach, Andreas Roessler, Martin Haulick, Tim Premont, Julien Anastasiadis, Josef Vuerinckx, Rudi Ris, Christophe Verschaeren, Stijn Ari, Hakan Ranz, Dieter

Abstract

There is provided a speech dialog system that includes a first microphone, a second microphone, a processor and a memory. The first microphone captures first audio from a first spatial zone, and produces a first audio signal. The second microphone captures second audio from a second spatial zone, and produces a second audio signal. The processor receives the first audio signal and the second audio signal, and the memory contains instructions that control the processor to perform operations of a speech enhancement module, an automatic speech recognition module, and a speech dialog module that performs a zone-dedicated speech dialog.

IPC Classes ?

G10L 17/06 - Decision making techniques; Pattern matching strategies
G10L 15/20 - Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise or of stress induced speech
G10L 15/22 - Procedures used during a speech recognition process, e.g. man-machine dialog
G10L 15/08 - Speech classification or search
G06F 40/20 - Natural language analysis

83. SPEECH DIALOG SYSTEM AWARE OF ONGOING CONVERSATIONS

Application Number	US2020030403
Publication Number	2020/223304
Status	In Force
Filing Date	2020-04-29
Publication Date	2020-11-05
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Wolff, Tobias Lenke, Nils

Abstract

Disclosed are systems and methods aware of ongoing conversations and configured to intelligently schedule a speech prompt to an intended addressee. A method for intelligently scheduling a speech prompt in a speech dialog system includes monitoring an acoustic environment to detect an intended addressee's availability for a speech prompt having a measure of urgency corresponding therewith. Based on the intended addressee's availability, the method predicts a time that is convenient to present the speech prompt to the intended addressee, and schedules the speech prompt based on the predicted time and the measure of urgency. A measure of rudeness can be estimated using a cost function that includes cost for presence of an utterance, cost for presence of a conversation, and cost for involvement of the intended addressee in the conversation. Scheduling the speech prompt can include trading off the measure of urgency and the measure of rudeness.

IPC Classes ?

G10L 15/22 - Procedures used during a speech recognition process, e.g. man-machine dialog
G06F 3/16 - Sound input; Sound output

84. SYSTEM AND METHOD FOR ACOUSTIC LOCALIZATION OF MULTIPLE SOURCES USING SPATIAL PRE-FILTERING

Application Number	US2019065172
Publication Number	2020/118290
Status	In Force
Filing Date	2019-12-09
Publication Date	2020-06-11
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Wolff, Tobias Graf, Simon Matheja, Timo

Abstract

A method, computer program product, and computer system for identifying, by a computing device, a plurality of sources, wherein a first source of the plurality of sources is a source of interest and wherein a second source of the plurality of sources is an interference source. The first source and the second source may be monitored simultaneously by implementing a spatial pre-filter for acoustic source localization.

IPC Classes ?

G01S 5/22 - Position of source determined by co-ordinating a plurality of position lines defined by path-difference measurements
G10L 19/008 - Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
G10L 21/02 - Speech enhancement, e.g. noise reduction or echo cancellation
G10L 21/0208 - Noise filtering
G10L 21/0272 - Voice signal separating
G10L 21/028 - Voice signal separating using properties of sound source
H04R 3/00 - Circuits for transducers

85. SYSTEM AND METHOD FOR FEATURE BASED BEAM STEERING

Application Number	US2019065177
Publication Number	2020/118291
Status	In Force
Filing Date	2019-12-09
Publication Date	2020-06-11
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Wolff, Tobias Graf, Simon

Abstract

A method, computer program product, and computer system for identifying, by a computing device, a plurality of sources. One or more feature values of a plurality of features may be assigned to a first source of the plurality of sources. One or more feature values of the plurality of features may be assigned to a second source of the plurality of sources. A first score for the first source and a second score for the second source may be determined based upon, at least in part, the one or more feature values assigned to the first source and the second source. One of the first source and the second source may be selected for spatial processing based upon, at least in part, the first score for the first source and the second score for the second source.

IPC Classes ?

G10L 15/00 - Speech recognition
G10L 15/04 - Segmentation; Word boundary detection
G10L 15/08 - Speech classification or search
G01S 3/00 - Direction-finders for determining the direction from which infrasonic, sonic, ultrasonic, or electromagnetic waves, or particle emission, not having a directional significance, are being received
H04R 1/00 - LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS - Details of transducers
H04R 1/32 - Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only
H04R 1/40 - Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only by combining a number of identical transducers
H04R 3/00 - Circuits for transducers

86. SYSTEM AND METHOD FOR ACCELERATING USER AGENT CHATS

Application Number	US2019061772
Publication Number	2020/102703
Status	In Force
Filing Date	2019-11-15
Publication Date	2020-05-22
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Vozila, Paul Joseph Stubley, Peter Beaumont, Jean-Francois Liu, Ding Ganong, William

Abstract

A method, computer program product, and computer system for identifying, by a computing device, a model for predicting conversational phrases for a communication between at least a first user and a second user. The model may be trained based upon, at least in part, an attribute associated with the second user. At least one conversational phrase may be predicted for the communication between the first user and the second user. The at least one conversational phrase may be provided to the second user as an optional phrase to be sent to the first user.

IPC Classes ?

G10L 15/18 - Speech classification or search using natural language modelling
H04M 3/493 - Interactive information services, e.g. directory enquiries
H04M 11/00 - Telephonic communication systems specially adapted for combination with other electrical systems

87. CALLER DEFLECTION AND RESPONSE SYSTEM AND METHOD

Application Number	US2019060913
Publication Number	2020/097616
Status	In Force
Filing Date	2019-11-12
Publication Date	2020-05-14
Owner	NUANCE COMMUNICATIONS, INC (USA)
Inventor	Dougherty, Theodore Mak, Adam Stuczynski, Adam Ellis, Matt

Abstract

Provided are a call deflection and response system and method, wherein a voice call from a caller device is received, a skill group is determined to resolve an issue associated with the call, and a callback or a text response to the issue is provided to the caller device, providing a context-based personalized response. A caller leaves a detailed voicemail explaining an issue needing resolution, which is electronically transcribed and then run through a classifier to determine concepts and intents associated with the call. Based on the concepts and intents, responsibility for the call and associated files are transferred to a particular skill group on a response system for resolution. A response entity from the appropriate skill group determines and provides an issue response via callback or text message to the caller device, e.g., to the caller's mobile phone.

IPC Classes ?

H04M 3/523 - Centralised call answering arrangements requiring operator intervention with call distribution or queuing
H04M 3/493 - Interactive information services, e.g. directory enquiries
G06F 17/27 - Automatic analysis, e.g. parsing, orthograph correction

88. SYSTEM AND METHOD FOR MANAGING A MUTE BUTTON SETTING FOR A CONFERENCE CALL

Application Number	US2019055129
Publication Number	2020/076779
Status	In Force
Filing Date	2019-10-08
Publication Date	2020-04-16
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Lenke, Nils Montague, Eric Ganong Iii, William F.

Abstract

A system, method and computer-readable storage device are disclosed for managing a mute and unmute feature on a device which is used to communicate data in a communication conference. The method includes detecting, when the device is set to mute, whether the user is speaking and whether the speech is meant for the conference. Background noises are distinguished from the speech of the user. If the user is speaking and the device is set to mute, the device will automatically switch to and unmute setting such that people in the indication conference can hear the user speak. Facial recognition, and gaze detection or other data can also be used to determine when to automatically mute or unmute the device and can aid in inferring an intent of the user to speak to the conference participants.

IPC Classes ?

G06F 3/16 - Sound input; Sound output
G06F 3/01 - Input arrangements or combined input and output arrangements for interaction between user and computer
G06T 7/20 - Analysis of motion
G10L 21/0208 - Noise filtering
G06K 9/00 - Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
H04N 7/15 - Conference systems

89. SYSTEM AND METHOD FOR ACOUSTIC DETECTION OF EMERGENCY SIRENS

Application Number	US2019042582
Publication Number	2020/072116
Status	In Force
Filing Date	2019-07-19
Publication Date	2020-04-09
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Buck, Markus Premont, Julien Faubel, Friedrich

Abstract

A system and method for detecting multi-tone sirens despite environmental noises that may be present obtains a microphone input signal, applies, in real time, a time-frequency analysis to the microphone input signal to determine a time-frequency representation, provides at least one multi-tone model that has a plurality of tone duration patterns, performs multi-tone siren detection on the time-frequency representation, the detection based on the at least one multi-tone model and factoring of doppler shifts, and generates a detection result that can be used in systems for automated vehicles.

IPC Classes ?

G01S 13/58 - Velocity or trajectory determination systems; Sense-of-movement determination systems
B60W 30/08 - Predicting or avoiding probable or impending collision

90. MULTI-CHARACTER TEXT INPUT SYSTEM WITH AUDIO FEEDBACK AND WORD COMPLETION

Application Number	US2019049505
Publication Number	2020/051209
Status	In Force
Filing Date	2019-09-04
Publication Date	2020-03-12
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Morwing, Jonas Friberg, Christer Sternby, Jakob Andersson, Jonas

Abstract

A system for inputting and processing handwritten, multi -character text may comprise a handwriting recognition subsystem, a word completion subsystem, and an audio feedback system. The handwriting recognition system may be configured to capture a series of handwritten characters formed by a user and to convert the handwritten characters into a set of candidate partial text strings. The word completion subsystem may be configured to identify if a candidate partial text string constitutes a word segment and if so, generate one or both of (i) at least one clarifying word and (ii) at least one clarifying phrase that includes the clarifying word. The word segment may be an arbitrary string and not correspond to a valid complete word in a language associated with the system. The audio feedback subsystem may be configured to produce an audio representation of the word segment(s), the clarifying word(s), and the clarifying phrase(s).

IPC Classes ?

G06F 40/274 - Converting codes to words; Guess-ahead of partial word inputs
G10L 13/02 - Methods for producing synthetic speech; Speech synthesisers
G06F 3/0488 - Interaction techniques based on graphical user interfaces [GUI] using specific features provided by the input device, e.g. functions controlled by the rotation of a mouse with dual sensing arrangements, or of the nature of the input device, e.g. tap gestures based on pressure sensed by a digitiser using a touch-screen or digitiser, e.g. input of commands through traced gestures

91. SYSTEM AND METHOD FOR ACOUSTIC SPEAKER LOCALIZATION

Application Number	US2019047689
Publication Number	2020/041580
Status	In Force
Filing Date	2019-08-22
Publication Date	2020-02-27
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Karimian-Azari, Sam Sharma, Dushyant Nour-Eldin, Amr Naylor, Patrick A.

Abstract

A method, computer program product, and computing system for acoustic speech localization, comprising receiving, via a plurality of microphones, a plurality of audio signals. Modulation properties of the plurality of audio signals may be analyzed. Speech sounds may be localized from the plurality of audio signals based upon, at least in part, the modulation properties of the plurality of audio signals.

IPC Classes ?

G10L 15/00 - Speech recognition
G10L 19/008 - Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
G10L 19/06 - Determination or coding of the spectral characteristics, e.g. of the short-term prediction coefficients
G10L 19/26 - Pre-filtering or post-filtering
G10L 21/02 - Speech enhancement, e.g. noise reduction or echo cancellation
G10L 21/0272 - Voice signal separating
G10L 21/028 - Voice signal separating using properties of sound source
G10L 21/0308 - Voice signal separating characterised by the type of parameter measurement, e.g. correlation techniques, zero crossing techniques or predictive techniques
H04R 1/00 - LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS - Details of transducers
H04R 5/027 - Spatial or constructional arrangements of microphones, e.g. in dummy heads

92. AUDIO STREAM MIXING SYSTEM AND METHOD

Application Number	US2019044775
Publication Number	2020/033239
Status	In Force
Filing Date	2019-08-02
Publication Date	2020-02-13
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Lenke, Nils Couvreur, Christophe

Abstract

Provided are a system and method of mixing a second audio stream with a first audio stream in an audio output device. The system is configured to execute the method, comprising buffering and outputting the first audio stream via the audio output device as unmodified output, determining at least one insertion spot within the first audio stream, modifying the first audio stream at an insertion spot to avoid content loss, outputting the second audio stream at the insertion spot, and resuming unmodified output of the first audio stream at or near a completion of the second audio stream. Modifying the first audio stream can include pausing and/or warping the first audio stream at the insertion spot. The audio output device can be a vehicle head unit or a wireless device, such as a mobile phone.

IPC Classes ?

G06F 3/16 - Sound input; Sound output
H04H 20/62 - Arrangements specially adapted for specific applications, e.g. for traffic information or for mobile receivers for local area broadcast, e.g. instore broadcast for transportation systems, e.g. in vehicles

93. SYSTEM AND METHOD FOR GENERATING DIALOGUE GRAPHS

Application Number	US2019040128
Publication Number	2020/006558
Status	In Force
Filing Date	2019-07-01
Publication Date	2020-01-02
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Beaumont, Jean-Francois Khameneh, Nastaran Jafarpour Stubley, Peter Tepper, Paul A. Rohatgi, Abhishek Negrean, Flaviu Gelu Chavez Padron, Marco Antonio

Abstract

A method, computer program product, and computing system for automatically generating a dialogue graph is executed on a computing device and includes receiving a plurality of conversation data. A plurality of utterance pairs from the plurality of conversation data may be clustered into a plurality of utterance pair clusters. A dialogue graph may be generated with a plurality of nodes representative of the plurality of utterance pair clusters.

IPC Classes ?

G10L 15/22 - Procedures used during a speech recognition process, e.g. man-machine dialog
G06F 16/35 - Clustering; Classification
G06F 17/21 - Text processing
G10L 15/00 - Speech recognition
G10L 15/18 - Speech classification or search using natural language modelling
G10L 15/26 - Speech to text systems

94. SYSTEM AND METHOD FOR DISCRIMINATIVE TRAINING OF REGRESSION DEEP NEURAL NETWORKS

Application Number	US2019028742
Publication Number	2019/209841
Status	In Force
Filing Date	2019-04-23
Publication Date	2019-10-31
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Faubel, Friedrich Sautter, Jonas

Abstract

A method, computer program product, and computer system for transforming, by a computing device, a speech signal into a speech signal representation. A regression deep neural network may be trained with a cost function to minimize a mean squared error between actual values of the speech signal representation and estimated values of the speech signal representation, wherein the cost function may include one or more discriminative terms. Bandwidth of the speech signal may be extended by extending the speech signal representation of the speech signal using the regression deep neural

IPC Classes ?

G06N 20/00 - Machine learning

95. INTELLIGENT CALL CENTER AGENT ASSISTANT

Application Number	US2019027272
Publication Number	2019/200287
Status	In Force
Filing Date	2019-04-12
Publication Date	2019-10-17
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Rohatgi, Abhishek Padron, Marco, Antonio Negrean, Flaviu, Gelu Tepper, Paul, Andrew

Abstract

A system for processing user requests provides for recommending products or services for an agent to offer a customer or potential customer. The system classifies a user request by an intent, and presents documentation to assist the agent in handling the request. The system further parses the user request to detect life events experienced by the user that may raise the prospect of the user's interest in other products or services. Based on the detected life events, a number of offers are presented to an agent for recommendation to the user.

IPC Classes ?

G06F 17/27 - Automatic analysis, e.g. parsing, orthograph correction

96. SYSTEM AND METHOD FOR REVIEW OF AUTOMATED CLINICAL DOCUMENTATION

Application Number	US2019020739
Publication Number	2019/173331
Status	In Force
Filing Date	2019-03-05
Publication Date	2019-09-12
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Vozila, Paul Joseph Gallopyn, Guido Remi Marcel Jost, Uwe Helmut Helletzgruber, Matthias Jancsary, Jeremy Martin Abhinav, Kumar Pinto, Joel Praveen Owen, Donald E. Oz, Mehmet Mert Sbihili, Scott Erskine, Garret N. Delaney, Brian William

Abstract

A method, computer program product, and computing system for obtaining, by a computing device, encounter information of a patient encounter, wherein the encounter information may include audio encounter information obtained from at least a first encounter participant. The audio encounter information obtained from at least the first encounter participant may be processed. A user interface may be generated displaying a plurality of layers associated with the audio encounter information obtained from at least the first encounter participant.

IPC Classes ?

G06F 19/00 - Digital computing or data processing equipment or methods, specially adapted for specific applications (specially adapted for specific functions G06F 17/00;data processing systems or methods specially adapted for administrative, commercial, financial, managerial, supervisory or forecasting purposes G06Q;healthcare informatics G16H)

97. AUTOMATED CLINICAL DOCUMENTATION SYSTEM AND METHOD

Application Number	US2019020742
Publication Number	2019/173333
Status	In Force
Filing Date	2019-03-05
Publication Date	2019-09-12
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Vozila, Paul Joseph Pinto, Joel Praveen Abhinav, Kumar Li, Haibo Amoia, Marilisa Diehl, Frank

Abstract

A method, computer program product, and computing system for: receiving an initial portion of an encounter record; processing the initial portion of the encounter record to generate initial content for a medical report; receiving one or more additional portions of the encounter record; and processing the one or more additional portions of the encounter record to modify the medical report.

IPC Classes ?

G10L 15/00 - Speech recognition
G10L 15/22 - Procedures used during a speech recognition process, e.g. man-machine dialog
G10L 17/00 - Speaker identification or verification

98. SYSTEM AND METHOD FOR REVIEW OF AUTOMATED CLINICAL DOCUMENTATION

Application Number	US2019020755
Publication Number	2019/173340
Status	In Force
Filing Date	2019-03-05
Publication Date	2019-09-12
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Vozila, Paul, Joseph Gallopyn, Guido, Remi, Marcel Jost, Uwe, Helmut Helletzgruber, Matthias Jancsary, Jeremy, Martin Abhinav, Kumar Pinto, Joel, Praveen Owen, Donald, E. Oz, Mehmet, Mert Sbihili, Scott Erskine, Garret, N. Delaney, Brian, William

Abstract

A method, computer program product, and computing system for obtaining, by a computing device, encounter information of a patient encounter, wherein the encounter information may include audio encounter information obtained from at least a first encounter participant. The audio encounter information obtained from at least the first encounter participant may be processed. A user interface may be generated displaying a plurality of layers associated with the audio encounter information obtained from at least the first encounter participant, wherein at least one of the plurality of layers is one of exposed to the user interface and not exposed to the user interface based upon, at least in part, a confidence level.

IPC Classes ?

G06Q 50/24 - Patient record management (processing of medical or biological data for scientific purposes G06F 19/00)

99. SYSTEM AND METHOD FOR REVIEW OF AUTOMATED CLINICAL DOCUMENTATION

Application Number	US2019020765
Publication Number	2019/173349
Status	In Force
Filing Date	2019-03-05
Publication Date	2019-09-12
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Vozila, Paul Joseph Gallopyn, Guido Remi Marcel Jost, Uwe Helmut Helletzgruber, Matthias Jancsary, Jeremy Martin Abhinav, Kumar Pinto, Joel Praveen Owen, Donald E. Oz, Mehmet Mert Sbihili, Scott Erskine, Garret N. Delaney, Brian William

Abstract

A method, computer program product, and computing system for obtaining, by a computing device, encounter information of a patient encounter, wherein the encounter information may include audio encounter information obtained from at least a first encounter participant. The audio encounter information obtained from at least the first encounter participant may be processed. A user interface may be generated displaying a plurality of layers associated with the audio encounter information obtained from at least the first encounter participant. A user input may be received from a peripheral device to navigate through each of the plurality of layers associated with the audio encounter information displayed on the user interface.

IPC Classes ?

G06F 3/00 - Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements

100. AUTOMATED CLINICAL DOCUMENTATION SYSTEM AND METHOD

Application Number	US2019020788
Publication Number	2019/173362
Status	In Force
Filing Date	2019-03-05
Publication Date	2019-09-12
Owner	NUANCE COMMUNICATIONS, INC. (USA)
Inventor	Sharma, Dushyant Naylor, Patrick A. Jost, Uwe Helmut

Abstract

A method, computer program product, and computing system for initially aligning two or more audio signals to address coarse temporal misalignment between the two or more audio signals. The two or more audio signals are detected by two or more audio detection systems within a monitored space. The two or more audio signals are subsequently realigned to address ongoing temporal signal drift between the two or more audio signals. A method, computer program product, and computing system for determining a time delay between a first audio signal received on a first audio detection system and a second audio signal received on a second audio detection system. The first and second audio detection systems are located within a monitored space. The first audio detection system is located with respect to the second audio detection system within the monitored space.

IPC Classes ?

G10L 21/00 - Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
G10L 25/00 - Speech or voice analysis techniques not restricted to a single one of groups

1 2 3 4 Next Page