BibSLEIGH — speech tag

speech

speech

Tag #speech

247 papers:

CoG-2019-SykownikBM #analysis #automation #pipes and filters #sentiment: Can You Hear the Player Experienceƒ A Pipeline for Automated Sentiment Analysis of Player Speech (PS, FB, MM), pp. 1–4.
ICML-2019-FuLTL #black box #generative #metric #named #network #optimisation: MetricGAN: Generative Adversarial Networks based Black-box Metric Scores Optimization for Speech Enhancement (SWF, CFL, YT0, SDL), pp. 2031–2041.
ICML-2019-KenterWCCV #named #network #synthesis: CHiVE: Varying Prosody in Speech Synthesis with a Linguistically Driven Dynamic Hierarchical Conditional Variational Network (TK, VW, CaC, RC, JV), pp. 3331–3340.
ICML-2019-QinCCGR #automation #recognition #robust: Imperceptible, Robust, and Targeted Adversarial Examples for Automatic Speech Recognition (YQ, NC, GWC, IJG, CR), pp. 5231–5240.
ICML-2019-RenTQZZL #automation #recognition: Almost Unsupervised Text to Speech and Automatic Speech Recognition (YR, XT, TQ, SZ, ZZ, TYL), pp. 5410–5419.
ICST-2019-IwamaF #automation #recognition #testing: Automated Testing of Basic Recognition Capability for Speech Recognition Systems (FI, TF), pp. 13–24.
EDM-2018-GautamMGR #automation #categorisation #chat: Automated Speech Act Categorization of Chat Utterances in Virtual Internships (DG, NM, AG, VR).
ICSME-2018-KrasniqiM #component #developer #generative: TraceLab Components for Generating Speech Act Types in Developer Question/Answer Conversations (RK, CM), p. 713.
CIKM-2018-FangZYMZ #benchmark #metric #named #video: TED-KISS: A Known-Item Speech Video Search Benchmark (FF, BWZ, XCY, HXM, FZ), pp. 1803–1806.
ICML-2018-OordLBSVKDLCSCG #parallel #performance #synthesis: Parallel WaveNet: Fast High-Fidelity Speech Synthesis (AvdO, YL, IB, KS, OV, KK, GvdD, EL, LCC, FS, NC, DG, SN, SD, EE, NK, HZ, AG, HK, TW, DB, DH), pp. 3915–3923.
ICML-2018-Skerry-RyanBXWS #synthesis #towards: Towards End-to-End Prosody Transfer for Expressive Speech Synthesis with Tacotron (RJSR, EB, YX, YW, DS, JS, RJW, RC, RAS), pp. 4700–4709.
ICML-2018-WangSZRBSXJRS #modelling #synthesis: Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis (YW, DS, YZ, RJSR, EB, JS, YX, YJ, FR, RAS), pp. 5167–5176.
ICPR-2018-DingLXKS #generative #network #recognition #robust #towards: Mutual-optimization Towards Generative Adversarial Networks For Robust Speech Recognition (KD, NL, YX, DK, KS), pp. 2699–2704.
ICPR-2018-LiZXGX18a #network #recognition: Recurrent Neural Network Based Small-footprint Wake-up-word Speech Recognition System with a Score Calibration Method (CL, LZ, SX, PG, BX0), pp. 3222–3227.
ICPR-2018-SaitohK #database #named #recognition #smarttech #visual notation: SSSD: Speech Scene database by Smart Device for Visual Speech Recognition (TS, MK), pp. 3228–3232.
ICPR-2018-XiaoW #animation #network: Dense Convolutional Recurrent Neural Network for Generalized Speech Animation (LX, ZW), pp. 633–638.
ESEC-FSE-2018-WoodRAM #debugging #detection #developer: Detecting speech act types in developer question/answer conversations during bug repair (AW, PR, AA, CM), pp. 491–502.
ICML-2017-NagamineM #case study #comprehension #multi #recognition #representation: Understanding the Representation and Computation of Multilayer Perceptrons: A Case Study in Speech Recognition (TN, NM), pp. 2564–2573.
ICML-2017-OchiaiWHH #multi #recognition: Multichannel End-to-end Speech Recognition (TO, SW, TH, JRH), pp. 2632–2641.
ICSME-2016-OlneyHTL #java: Part of Speech Tagging Java Method Names (WO, EH0, CT, BL), pp. 483–487.
MSR-2016-MoslehiAR #documentation #mining #on the: On mining crowd-based speech documentation (PM, BA, JR), pp. 259–268.
DiGRA-FDG-2016-LyonLZ #design #game studies: Combining Speech Intervention and Cooperative Game Design for Children with ASD (NL, DIL, JZ).
CIKM-2016-ManshaKKA #identification #network #self: A Self-Organizing Map for Identifying InfluentialCommunities in Speech-based Networks (SM, FK, AK, AA), pp. 1965–1968.
ICML-2016-AmodeiABCCCCCCD #recognition: Deep Speech 2 : End-to-End Speech Recognition in English and Mandarin (DA, SA, RA, JB, EB, CC, JC, BC, JC, MC, AC, GD, EE, JHE, LF, CF, AYH, BJ, TH, PL, XL, LL, SN, AYN, SO, RP, SQ, JR, SS, DS, SS, CW0, YW, ZW, BX, YX, DY, JZ, ZZ), pp. 173–182.
ICPR-2016-ChakrabortyPK #information management #recognition #using: Spontaneous speech emotion recognition using prior knowledge (RC, MP, SKK), pp. 2866–2871.
ICPR-2016-PironkovDD #automation #learning #multi #recognition: Speaker-aware Multi-Task Learning for automatic speech recognition (GP, SD, TD), pp. 2900–2905.
EDM-2015-BlanchardDON #analysis #automation #education #towards: Classifying Q&A from Teachers' Speech: Moving Toward an Automated System of Dialogic Analysis (NB, SKD, AO, MN), pp. 282–288.
CIG-2015-Kendall #game studies: Keynote speech IV: Where games meet hyper-heuristics (GK), p. 19.
CIG-2015-Lucas #challenge #game studies #video: Keynote speech II: General video game AI: Challenges and applications (SL), p. 17.
CIG-2015-Muller #challenge #research: Keynote speech III: Computer go research - The challenges ahead (MM0), p. 18.
CIG-2015-Yao #game studies #learning: Keynote speech I: Co-evolutionary learning in game-playing (XY0), p. 16.
CHI-2015-LimerickMC #empirical #interface: Empirical Evidence for a Diminished Sense of Agency in Speech Interfaces (HL, JWM, DC), pp. 3967–3970.
CHI-2015-McMillanLB: Repurposing Conversation: Experiments with the Continuous Speech Stream (DM, AL, BATB), pp. 3953–3962.
CHI-2015-McNaneyPVBZO #named #people: LApp: A Speech Loudness Application for People with Parkinson’s on Google Glass (RM, IP, JV, MB, PZ, PO), pp. 497–500.
DUXU-IXD-2015-CaiLLH #case study #experience #research #user interface: User Experience Research on the Rehabilitation System of Speech-Impaired Children — A Case Study on Speech Training Product (WC, JL, QL, TH), pp. 562–574.
DUXU-IXD-2015-WangWG #comparison: Cross Cultural Comparison of Users’ Barge-in with the In-Vehicle Speech System (PW, UW, TJG), pp. 529–540.
RecSys-2015-Stock #automation #persuasion: A (Persuasive?) Speech on Automated Persuasion (OS), pp. 1–2.
SAC-2015-Soares0W #approach #modelling #named #recognition #requirements: VoiceToModel: an approach to generate requirements models from speech recognition mechanisms (FS, JA, FW), pp. 1350–1357.
SIGITE-2014-Jonas #experience #research: Capstone experience: achieving success with an undergraduate research group in speech (MJ), pp. 55–60.
CHI-PLAY-2014-LanAABG #game studies #interactive: Flappy voice: an interactive game for childhood apraxia of speech therapy (TL, SA, BA, KJB, RGO), pp. 429–430.
CHI-2014-HamidiB #interface #named: Rafigh: a living media interface for speech intervention (FH, MB), pp. 1817–1820.
CHI-2014-Vosoughi #automation #recognition #visual notation: Improving automatic speech recognition through head pose driven visual grounding (SV), pp. 3235–3238.
HCI-AIMT-2014-AlmeidaST #design #development #interactive: Design and Development of Speech Interaction: A Methodology (NA, SSS, AJST), pp. 370–381.
HCI-AIMT-2014-JonssonD #interactive #performance: Driving with a Speech Interaction System: Effect of Personality on Performance and Attitude of Driver (IMJ, ND), pp. 417–428.
HCI-TMT-2014-ColetiMN #evaluation #named #recognition #usability #using: ErgoSV: An Environment to Support Usability Evaluation Using Face and Speech Recognition (TAC, MM, FdLdSN), pp. 554–564.
ICEIS-v3-2014-SilvaFG14a #artificial reality: Assisting Speech Therapy for Autism Spectrum Disorders with an Augmented Reality Application (CAdS, ARF, APG), pp. 38–45.
ICML-c2-2014-GravesJ #network #recognition #towards: Towards End-To-End Speech Recognition with Recurrent Neural Networks (AG, NJ), pp. 1764–1772.
ICPR-2014-DengZS #learning #recognition: Linked Source and Target Domain Subspace Feature Transfer Learning — Exemplified by Speech Emotion Recognition (JD, ZZ, BWS), pp. 761–766.
ICPR-2014-MustiZP #3d #animation #estimation #image #visual notation: Facial 3D Shape Estimation from Images for Visual Speech Animation (UM, ZZ, MP), pp. 40–45.
ICPR-2014-NishimuraOAN #image #modelling #using #web: Selection of Unknown Objects Specified by Speech Using Models Constructed from Web Images (HN, YO, YA, MN), pp. 477–482.
ICPR-2014-YuncuHB #automation #modelling #recognition #using: Automatic Speech Emotion Recognition Using Auditory Models with Binary Decision Tree and SVM (EY, HH, CB), pp. 773–778.
SIGIR-2014-Jones #retrieval #tool support: Speech search: techniques and tools for spoken content retrieval (GJFJ), p. 1287.
ICDAR-2013-LiWTLG #image #locality: Unsupervised Speech Text Localization in Comic Images (LL, YW, ZT, XL, LG), pp. 1190–1194.
ICDAR-2013-RigaudBOKW #detection: An Active Contour Model for Speech Balloon Detection in Comics (CR, JCB, JMO, DK, JvdW), pp. 1240–1244.
VS-Games-2013-LoaizaOCPALN #game studies #prototype #video: A Video Game Prototype for Speech Rehabilitation (DL, CO, AC, AP, GIA, DL, AN), pp. 1–4.
CHI-2013-RazaHTPRSR: Job opportunities through entertainment: virally spread speech-based services for low-literate users (AAR, FuH, ZT, MP, SR, US, RR), pp. 2803–2812.
DHM-SET-2013-GeorgeK #modelling #towards: Towards Enhancing the Acoustic Models for Dysarthric Speech (KKG, CSK), pp. 183–188.
DUXU-CXC-2013-TailebAAAA #named #recognition: YUSR: Speech Recognition Software for Dyslexics (MT, RAS, AAG, MAZ, SAS), pp. 296–303.
HCI-AMTE-2013-ColetiMN #automation #evaluation #recognition #usability: Analyzing Face and Speech Recognition to Create Automatic Information for Usability Evaluation (TAC, MM, FdLdSN), pp. 184–192.
HCI-AS-2013-Harris #interface: Emotion and Emotion Regulation Considerations for Speech-Based In-Vehicle Interfaces (HH), pp. 571–577.
HCI-AS-2013-WeiHCHK #design #elicitation: Ergonomics Design with Novice Elicitation on an Auditory-Only In-Vehicle Speech System (MHW, SLH, HCC, JYH, CCK), pp. 654–660.
HCI-III-2013-RebenitschO: Facial Electromyogram Activation as Silent Speech Method (LR, CBO), pp. 464–473.
HCI-IMT-2013-GalatasPM #artificial reality #distance #multi #recognition #robust #video: Robust Multi-Modal Speech Recognition in Two Languages Utilizing Video and Distance Information from the Kinect (GG, GP, FM), pp. 43–48.
HCI-IMT-2013-KuncMLK: Speech-Based Text Correction Patterns in Noisy Environment (LK, TM, ML, JK), pp. 59–66.
HCI-IMT-2013-MedjkouneMPV #multimodal #recognition: Multimodal Mathematical Expressions Recognition: Case of Speech and Handwriting (SM, HM, SP, CVG), pp. 77–86.
HCI-IMT-2013-RigasA13a #communication #interface: Investigating the Impact of Combining Speech and Earcons to Communicate Information in E-government Interfaces (DR, BA), pp. 23–31.
HCI-IMT-2013-WangGHL #collaboration #communication #elicitation #nondeterminism #using: A Knowledge Elicitation Study for Collaborative Dialogue Strategies Used to Handle Uncertainties in Speech Communication While Using GIS (HW, AG, DH, RL), pp. 135–144.
HIMI-HSM-2013-HirayamaKK #user interface #visual notation: A Dialog Based Speech User Interface of a Makeup Support System for Visually Impaired Persons (MJH, NK, YK), pp. 261–268.
MLDM-2013-PohlZ #automation #n-gram #recognition #using: Using Part of Speech N-Grams for Improving Automatic Speech Recognition of Polish (AP, BZ), pp. 492–504.
RecSys-2013-GraschFR #interactive #named #recommendation #towards: ReComment: towards critiquing-based recommendation with speech interaction (PG, AF, FR), pp. 157–164.
SIGIR-2013-MalionekOSH: Linking transcribed conversational speech (JM, DWO, AS, JHLH), pp. 961–964.
PDP-2013-DziurzanskiM #automation #case study #network #recognition: Core Mapping into an Irregular Network on Chip — Features Extraction System for Automatic Speech Recognition Case Study (PD, TM), pp. 494–498.
AIIDE-2012-OrkinR #comprehension #crowdsourcing #interactive: Understanding Speech in Interactive Narratives with Crowdsourced Data (JO, DKR).
CHI-2012-KumarPL #interactive #type system: Voice typing: a new speech interaction model for dictation on touchscreen devices (AK, TP, BL), pp. 2277–2286.
CHI-2012-KumarRTAK #game studies #mobile #using: Improving literacy in developing countries using speech recognition-supported games on mobile devices (AK, PR, AT, RA, MK), pp. 1149–1158.
ECIR-2012-EskevichMJ #evaluation #metric #retrieval: New Metrics for Meaningful Evaluation of Informally Structured Speech Retrieval (ME, WM, GJFJ), pp. 170–181.
ICML-2012-YuSL #network #using: Conversational Speech Transcription Using Context-Dependent Deep Neural Networks (DY, FS, GL), p. 1.
ICPR-2012-ZhaoXY #learning #network: Unsupervised Tibetan speech features Learning based on Dynamic Bayesian Networks (YZ, XX, GY), pp. 2319–2322.
ICPR-2012-ZhengZ #kernel #recognition: Speech emotion recognition based on kernel reduced-rank regression (WZ, XZ), pp. 1972–1976.
CASE-2012-KawarazakiY #recognition #using: Remote control system of home electrical appliances using speech recognition (NK, TY), pp. 761–764.
SIGITE-2011-Jonas #experience #lessons learnt #research: Capstone experience: lessons from an undergraduate research group in speech at UNH Manchester (MJ), pp. 275–280.
MSR-2011-BinkleyHL #identifier #using: Improving identifier informativeness using part of speech information (DB, MH, DL), pp. 203–206.
HCI-ITE-2011-KarpovRK #multi #recognition #user interface: An Assistive Bi-modal User Interface Integrating Multi-channel Speech Recognition and Computer Vision (AK, AR, ISK), pp. 454–463.
HCI-MIIE-2011-Wang #communication #elicitation: A Knowledge Elicitation Study for a Speech Enabled GIS to Handle Vagueness in Communication (HW), pp. 338–345.
HCI-UA-2011-NisimuraMKKI #automation #development #identification #interface #recognition: Development of Web-Based Voice Interface to Identify Child Users Based on Automatic Speech Recognition System (RN, SM, LK, HK, TI), pp. 607–616.
HIMI-v2-2011-MacchiarellaKCHE #visual notation: Pilot Information Presentation on the Flight Deck: An Application of Synthetic Speech and Visual Digital Displays (NDM, JPK, MSC, TH, ZE), pp. 500–506.
ICEIS-v2-2011-FirozeAQR #recognition #word: Bangla Isolated Word Speech Recognition (AF, MSA, RQ, RMR), pp. 73–82.
SAC-2011-SharminRARF #education #game studies #interactive: Teaching intelligible speech to the autistic children by interactive computer games (MAS, MMR, SIA, MMR, SMF), pp. 1208–1209.
SAC-2011-VenkateshGBC #fixpoint #implementation #markov #modelling #recognition #using: Fixed-point implementation of isolated sub-word level speech recognition using hidden Markov models (NV, RG, RB, MGC), pp. 368–373.
CASE-2011-MaiHHH #adaptation #algorithm #performance: A fast adaptive Kalman filtering algorithm for speech enhancement (QM, DH, YH, ZH), pp. 327–332.
CSMR-2010-MadaniGPGA #identifier #recognition #source code #using #word: Recognizing Words from Source Code Identifiers Using Speech Recognition Techniques (NM, LG, MDP, YGG, GA), pp. 68–77.
CHI-2010-VertanenM #performance #using: Speech dasher: fast writing using speech and gaze (KV, DJCM), pp. 595–598.
ICPR-2010-BozkurtEEE #recognition #using: Use of Line Spectral Frequencies for Emotion Recognition from Speech (EB, EE, ÇEE, ATE), pp. 3708–3711.
ICPR-2010-ChakrabortyG #recognition: Role of Synthetically Generated Samples on Speech Recognition in a Resource-Scarce Language (RC, UG), pp. 1618–1621.
ICPR-2010-FaselB #network #realtime: Deep Belief Networks for Real-Time Extraction of Tongue Contours from Ultrasound During Speech (IF, JB), pp. 1493–1496.
ICPR-2010-HeracleousHB #gesture #integration #recognition: Gestures and Lip Shape Integration for Cued Speech Recognition (PH, NH, DB), pp. 2238–2241.
ICPR-2010-KellyH #recognition #robust: Auditory Features Revisited for Robust Speech Recognition (FK, NH), pp. 4456–4459.
ICPR-2010-KrajewskiBK #case study #classification #detection #multi #self: Comparing Multiple Classifiers for Speech-Based Detection of Self-Confidence — A Pilot Study (JK, AB, SK), pp. 3716–3719.
ICPR-2010-MahdhaouiC #classification #multi: Emotional Speech Classification Based on Multi View Characterization (AM, MC), pp. 4488–4491.
ICPR-2010-Nolazco-FloresLG #automation #recognition: Speech Magnitude-Spectrum Information-Entropy (MSIE) for Automatic Speech Recognition in Noisy Environments (JANF, RAAL, LPGP), pp. 4364–4367.
ICPR-2010-OGorman #analysis #latency: Latency in Speech Feature Analysis for Telepresence Event Coding (LO), pp. 4464–4467.
ICPR-2010-SaeidiMKTCJF #identification #independence: Signal-to-Signal Ratio Independent Speaker Identification for Co-channel Speech Signals (RS, PM, TK, ZHT, MGC, SHJ, PF), pp. 4565–4568.
ICPR-2010-StadelmannWSEF #algorithm #design #development: Rethinking Algorithm Design and Development in Speech Processing (TS, YW, MS, RE, BF), pp. 4476–4479.
ICPR-2010-StarkWP #representation #using: Single Channel Speech Separation Using Source-Filter Representation (MS, MW, FP), pp. 826–829.
ICPR-2010-SwaminathanTFEAB: Improving and Aligning Speech with Presentation Slides (RS, MET, SF, AE, AA, KB), pp. 3280–3283.
ICPR-2010-TawariT #analysis: Speech Emotion Analysis in Noisy Real-World Environment (AT, MMT), pp. 4605–4608.
ICPR-2010-ZhangSQ #modelling #recognition: Modeling Syllable-Based Pronunciation Variation for Accented Mandarin Speech Recognition (SZ, QS, YQ), pp. 1606–1609.
SIGIR-2010-Popescu-BelisKPNBW #automation #multi #retrieval: Automatic content linking: speech-based just-in-time retrieval for multimedia archives (APB, JK, PP, AN, EB, JdW), p. 703.
CASE-2010-DhupatiKRR #analysis #detection #novel #using #validation: A novel drowsiness detection scheme based on speech analysis with validation using simultaneous EEG recordings (LSD, SK, AR, AR), pp. 917–921.
CHI-2009-PatelARNDP #case study #comparative #interface: A comparative study of speech and dialed input voice interfaces in rural India (NP, SKA, NR, AAN, PD, TSP), pp. 51–54.
HCI-AUII-2009-YamamotoOW #video: Video Content Production Support System with Speech-Driven Embodied Entrainment Character by Speech and Hand Motion Inputs (MY, KO, TW), pp. 358–367.
HCI-NIMT-2009-NisimuraMKI #development #interactive: Development of Speech Input Method for Interactive VoiceWeb Systems (RN, JM, HK, TI), pp. 710–719.
HCI-NIMT-2009-TongW #recognition: Compensate the Speech Recognition Delays for Accurate Speech-Based Cursor Position Control (QT, ZW), pp. 752–760.
HCI-NT-2009-DZmuraDLTS #towards: Toward EEG Sensing of Imagined Speech (MD, SD, TL, ST, RS), pp. 40–48.
HCI-NT-2009-HippP #assurance #quality: Reference Model for Quality Assurance of Speech Applications (CH, MP), pp. 259–266.
HCI-NT-2009-LeeP #evaluation #synthesis: Interpretation of User Evaluation for Emotional Speech Synthesis System (HJL, JCP), pp. 295–303.
ECIR-2009-LarsonTHR #fault #recognition #semantics: Investigating the Global Semantic Impact of Speech Recognition Error on Spoken Content Collections (ML, MT, JH, MdR), pp. 755–760.
ECIR-2009-LiomaB #information retrieval: Part of Speech Based Term Weighting for Information Retrieval (CL, RB), pp. 412–423.
SIGIR-2009-OlssonO #independence #retrieval #robust: Combining LVCSR and vocabulary-independent ranked utterance retrieval for robust speech search (JSO, DWO), pp. 91–98.
CHI-2008-LohrB #interactive #user interface #visual notation: Mixed-initiative dialog management for speech-based interaction with graphical user interfaces (AL, BB), pp. 979–988.
CHI-2008-OviattSA #adaptation #interface: Implicit user-adaptive system engagement in speech and pen interfaces (SLO, CS, AMA), pp. 969–978.
CHI-2008-VertanenK #on the #recognition #visualisation: On the benefits of confidence visualization in speech recognition (KV, POK), pp. 1497–1500.
CSCW-2008-KalnikaiteW #feedback #question #social #summary: Social summarization: does social feedback improve access to speech data? (VK, SW), pp. 9–12.
ICML-2008-HeigoldDSN #evaluation #recognition: Modified MMI/MPE: a direct evaluation of the margin in speech recognition (GH, TD, RS, HN), pp. 384–391.
ICPR-2008-DehzangiMCL #classification #fuzzy #learning #using: Fuzzy rule selection using Iterative Rule Learning for speech data classification (OD, BM, CES, HL), pp. 1–4.
ICPR-2008-KottiK #classification #database #gender: Gender classification in two Emotional Speech databases (MK, CK), pp. 1–4.
ICPR-2008-SerCY #classification #hybrid #recognition: A Hybrid PNN-GMM classification scheme for speech emotion recognition (WS, LC, ZLY), pp. 1–4.
SIGIR-2008-TsagiasLR: Term clouds as surrogates for user generated speech (MT, ML, MdR), pp. 773–774.
SAC-2008-FariaM #recognition #scalability: When a mismatch can be good: large vocabulary speech recognition trained with idealized tandem features (AF, NM), pp. 1574–1577.
ECDL-2007-BernareggiD #named #network: aScience: A Thematic Network on Speech and Tactile Accessibility to Scientific Digital Resources (CB, GCD), pp. 515–517.
HT-2007-KehoeP #synthesis #topic: Transforming DITA topics for speech synthesis output (AK, IJP), pp. 147–148.
ITiCSE-2007-KheirW #realtime #student #using: Inclusion of deaf students in computer science classes using real-time speech transcription (RK, TW), pp. 261–265.
CSMR-2007-Herweijer: Keynote Speech (JPH), p. 3.
CHI-2007-DahlbackWNA #interface #similarity: Similarity is more important than expertise: accent effects in speech interfaces (ND, QW, CN, JA), pp. 1553–1556.
CHI-2007-KaiserBEC #interactive #multimodal: Multimodal redundancy across handwriting and speech during computer mediated human-human interactions (ECK, PB, CE, PRC), pp. 1009–1018.
HCI-IDU-2007-YinC #analysis #automation #metric #towards: Towards Automatic Cognitive Load Measurement from Speech Analysis (BY, FC), pp. 1011–1020.
HCI-MIE-2007-KimLRHH #analysis #design #network #performance #quality: Performance Analysis of Perceptual Speech Quality and Modules Design for Management over IP Network (JK, HWL, WR, SHH, MH), pp. 84–93.
HCI-MIE-2007-LeeP07a #behaviour #generative #synthesis: Customized Message Generation and Speech Synthesis in Response to Characteristic Behavioral Patterns of Children (HJL, JCP), pp. 114–123.
HCI-MIE-2007-XuBAM #empirical #fault #recognition: An Empirical Study on Users’ Acceptance of Speech Recognition Errors in Text-Messaging (SX, SB, MA, DM), pp. 232–242.
HCI-MIE-2007-ZhuL #case study #recognition: Study on Speech Emotion Recognition System in E-Learning (AZ, QL), pp. 544–552.
ICEIS-HCI-2007-ArjunanWKY #human-computer #recognition #using: Silent Bilingual Vowel Recognition — Using fSEMG for HCI based Speech Commands (SPA, HW, DKK, WCY), pp. 68–78.
SIGIR-2007-HeerenWOHJ: Radio Oranje: searching the queen’s speech(es) (WH, LvdW, RO, AvH, FdJ), p. 903.
SIGIR-2007-IrcingOH: First experiments searching spontaneous Czech speech (PI, DWO, JH), pp. 835–836.
CHI-2006-KuriharaGOI #multimodal #predict #recognition: Speech pen: predictive handwriting based on ambient multimodal recognition (KK, MG, JO, TI), pp. 851–860.
CHI-2006-MunteanuBPTJ #recognition #usability: The effect of speech recognition accuracy rates on the usefulness and usability of webcast archives (CM, RB, GP, EGT, DJ), pp. 493–502.
ICPR-v1-2006-PaoCYL #recognition: Mandarin Emotional Speech Recognition Based on SVM and NN (TLP, YTC, JHY, PJL), pp. 1096–1100.
ICPR-v1-2006-XieL #animation #markov #modelling #using: Speech Animation Using Coupled Hidden Markov Models (LX, ZQL), pp. 1128–1131.
ICPR-v2-2006-AndelicSKK #hybrid #kernel #modelling #using: A Hybrid HMM-Based Speech Recognizer Using Kernel-Based Discriminants as Acoustic Models (EA, MS, MK, SEK), pp. 1158–1161.
ICPR-v3-2006-HalavatiSTCR #approach #novel #performance #recognition #robust #word: A Novel Approach to Very Fast and Noise Robust, Isolated Word Speech Recognition (RH, SBS, HT, AC, MR), pp. 190–193.
ICPR-v3-2006-YouCBLT #analysis: Emotional Speech Analysis on Nonlinear Manifold (MY, CC, JB, JL, JT), pp. 91–94.
ICPR-v4-2006-Choi #cumulative #recognition #robust #using: A Noise Robust Front-end for Speech Recognition Using Hough Transform and Cumulative Distribution Mapping (EHCC), pp. 286–289.
ICPR-v4-2006-JinW #music: Speech Separation from Background of Music Based on Single-channel Recording (XCJ, ZFW), pp. 278–281.
ICPR-v4-2006-KrugerSKAW #recognition: Mixture of Support Vector Machines for HMM based Speech Recognition (SEK, MS, MK, EA, AW), pp. 326–329.
ICPR-v4-2006-LeilaC #performance #recognition: Efficient Gaussian Mixture for Speech Recognition (LZ, GC), pp. 294–297.
ICPR-v4-2006-LinO #network #recognition: Switching Auxiliary Chains for Speech Recognition based on Dynamic Bayesian Networks (HL, ZO), pp. 258–261.
ICPR-v4-2006-LiuH #automation #predict #segmentation: A Bayesian Predictive Method for Automatic Speech Segmentation (ML, TSH), pp. 290–293.
ICPR-v4-2006-MaierHNNHRS #evaluation #recognition: Intelligibility of Children with Cleft Lip and Palate: Evaluation by Speech Recognition Techniques (AKM, CH, EN, EN, TH, FR, MS), pp. 274–277.
ICPR-v4-2006-ZiokoMW #segmentation: Phoneme segmentation of speech (BZ, SM, RCW), pp. 282–285.
SIGIR-2006-LiuO #effectiveness #metric #retrieval: One-sided measures for evaluating ranked retrieval effectiveness with spontaneous conversational speech (BL, DWO), pp. 673–674.
SAC-2006-DoyleB #effectiveness #interactive #mobile: Combining speech and pen input for effective interaction in mobile geospatial environments (JD, MB), pp. 1182–1183.
DATE-2006-LahiriBCM #clustering: Battery-aware code partitioning for a text to speech system (AL, AB, MC, SM), pp. 672–677.
AIIDE-2005-GorniakR #comprehension #game studies: Speaking with your Sidekick: Understanding Situated Speech in Computer Role Playing Games (PG, DR), pp. 57–62.
SIGIR-2005-CarvalhoC #classification #email #on the: On the collective classification of email “speech acts” (VRdC, WWC), pp. 345–352.
DAC-2005-NedevschiPB #hardware #low cost #power management #recognition #user interface: Hardware speech recognition for user interfaces in low cost, low power devices (SN, RKP, EAB), pp. 684–689.
SOSP-2005-Tanenbaum: Keynote speech (AST).
CHI-2004-VemuriDBS #recognition #using: Improving speech playback using time-compression and speech recognition (SV, PD, WB, CS), pp. 295–302.
CHI-2004-WhittakerA #editing #semantics: Semantic speech editing (SW, BA), pp. 527–534.
CIKM-2004-MekhaldiLI #clustering #documentation #using: Using bi-modal alignment and clustering techniques for documents and speech thematic segmentations (DM, DL, RI), pp. 69–77.
ICPR-v1-2004-CoskerMRH #animation #markov #using: Speech Driven Facial Animation using a Hidden Markov Coarticulation Model (DC, ADM, PLR, YH), pp. 128–131.
ICPR-v2-2004-BeierholmB #music #using: Speech Music Discrimination Using Class-Specific Features (TB, PMB), pp. 379–382.
ICPR-v3-2004-GutkinK #classification #representation: Structural Representation of Speech for Phonetic Classification (AG, SK), pp. 438–441.
SIGIR-2004-OardSDHMWRFGMKS #information retrieval: Building an information retrieval test collection for spontaneous conversational speech (DWO, DS, DSD, XH, GCM, JW, BR, MF, SG, JM, LK, SS), pp. 41–48.
DocEng-2003-MekhaldiLI #documentation: Thematic alignment of recorded speech with documents (DM, DL, RI), pp. 52–54.
CIKM-2003-GilbertZ #information retrieval #user interface: Speech user interfaces for information retrieval (JEG, YZ), pp. 77–82.
MLDM-2003-LazliS #fuzzy #logic #probability #recognition #using: Connectionist Probability Estimators in HMM Arabic Speech Recognition Using Fuzzy Logic (LL, MS), pp. 379–388.
SIGIR-2003-HayashiOBMMMHHI #multi: Speech-based and video-supported indexing of multimedia broadcast news (YH, KO, KB, OM, YM, SM, MH, TH, NI), pp. 441–442.
JCDL-2002-HauptmannJN #information retrieval #multi #recognition #using #video: Multi-modal information retrieval from broadcast video using OCR and speech recognition (AGH, RJ, TDN), pp. 160–161.
CGDC-2002-Tosca #community: The EverQuest Speech Community (SPT).
CHI-2002-SuhmBMFGGP #case study #comparative #natural language: A comparative study of speech in the call center: natural language call routing vs. touch-tone menus (BS, JB, DM, BF, DG, KG, PP), pp. 283–290.
CHI-2002-WhittakerHASBISZR #interface #named: SCANMail: a voicemail interface that makes speech browsable, readable and searchable (SW, JH, BA, LAS, MB, PLI, LS, GZ, AER), pp. 275–282.
ICML-2002-MeyerB #scalability #towards: Towards “Large Margin” Speech Recognizers by Boosting and Discriminative Training (CM, PB), pp. 419–426.
ICPR-v1-2002-BraySE #gesture #recognition: Recognition of Gestures in the Context of Speech (MB, HS, JOE), pp. 356–359.
ICPR-v2-2002-WachsmuthS #analysis #image #probability #process: Integrated Analysis of Speech and Images as a Probabilistic Decoding Process (SW, GS), pp. 588–592.
ICPR-v3-2002-Bourlard #pattern matching #pattern recognition #recognition #statistics: Some Recent Advances in Speech Recognition with Potential Applications in Other Statistical Pattern Recognition Areas (HB), p. 727.
ICPR-v3-2002-KatzMDK #analysis #automation #linear #robust: Robustness of Linear Discriminant Analysis in Automatic Speech Recognitio (MK, HGM, HD, DK), pp. 371–374.
ICPR-v3-2002-QuanTH #recognition #robust: A Robust Method for the Vietnamese Handwritten and Speech Recognition (VHQ, PNT, NDHH), pp. 732–735.
ICPR-v3-2002-TanakaKFI #modelling: Constructing Speech Processing Systems on Universal Phonetic Codes Accompanied with Reference Acoustic Models (KT, HK, NF, YI), pp. 728–731.
ICPR-v4-2002-StephensonMB #automation #network #recognition: Mixed Bayesian Networks with Auxiliary Variables for Automatic Speech Recognition (TAS, MMD, HB), p. 293–?.
JCDL-2001-CooperVBC #enterprise: Building searchable collections of enterprise speech data (JWC, MV, DKB, MC), pp. 226–234.
CHI-2001-GongL #performance: Shall we mix synthetic speech and human speech?: impact on users’ performance, perception, and attitude (LG, JL), pp. 158–165.
CHI-2001-LaiCGT #comprehension #on the #web: On the road and on the Web?: comprehension of synthetic and human speech while driving (JL, KC, PAG, OT), pp. 206–212.
CHI-2001-StifelmanAS #interactive: The audio notebook: paper and pen interaction with structured speech (LS, BA, CS), pp. 182–189.
CIKM-2001-BrownSCPCAP #towards: Towards Speech as a Knowledge Resource (EWB, SS, AC, DBP, JWC, AA, JP), pp. 526–528.
CIKM-2001-PonceleonS #automation: Automatic Discovery of Salient Segments in Imperfect Speech Transcripts (DBP, SS), pp. 490–497.
SIGIR-2001-PonceleonS #segmentation: Structure and Content-Based Segmentation of Speech Transcripts (DBP, SS), pp. 404–405.
DL-2000-SmithCS #interface: A speech interface for building musical score collections (LAS, EFC, BLS), pp. 165–173.
CHI-2000-LaiWC: The effect of task conditions on the comprehensibility of synthetic speech (JL, DW, MC), pp. 321–328.
CHI-2000-NassL: Does computer-generated speech manifest personality? an experimental test of similarity-attraction (CN, KML), pp. 329–336.
ICPR-v2-2000-UgenaAA #independence #network #recognition: Speaker-Independent Speech Recognition by Means of Functional-Link Neural Networks (AU, FdA, MEA), pp. 6018–6021.
ICPR-v3-2000-FaruquieMRS #modelling #recognition #scalability #using: Large Vocabulary Audio-Visual Speech Recognition Using Active Shape Models (TAF, AM, NR, LVS), pp. 3110–3113.
ICPR-v3-2000-GravierSC #automation #markov #random #recognition: A Markov Random Field Model for Automatic Speech Recognition (GG, MS, GC), pp. 3258–3261.
ICPR-v3-2000-Ney #classification #modelling #probability #recognition: Stochastic Modeling: From Pattern Classification to Speech Recognition and Translation (HN), pp. 3025–3032.
ICPR-v3-2000-SanchisVJ #performance #recognition #using #verification #word: Efficient Use of the Grammar Scale Factor to Classify Incorrect Words in Speech Recognition Verification (AS, EV, VMJ), pp. 3278–3281.
SIGIR-2000-McCarleyF #detection #fault #recognition #topic: Influence of speech recognition errors on topic detection (JSM, MF), pp. 342–344.
CHI-1999-KaratHHK #recognition #scalability: Patterns of Entry and Correction in Large Vocabulary Continuous Speech Recognition System (CMK, CH, DBH, JK), pp. 568–575.
HCI-CCAD-1999-SearsB #recognition: Redesigning speech recognition for use by individuals with spinal cord injuries (AS, JBR), pp. 966–969.
HCI-EI-1999-Brandt-PookFWS #recognition: Integrated Recognition and Interpretation of Speech for a Construction Task Domain (HBP, GAF, SW, GS), pp. 550–554.
HCI-EI-1999-CarbonellD #empirical #gesture #human-computer #multimodal #using: Empirical data on the use of speech and gestures in a multimodal human-computer environment (NC, PD), pp. 446–450.
HCI-EI-1999-Fischer #design #human-computer #interface: Repeats, Reformulations, and Emotional Speech: Evidence for the Design of Human-Computer Speech Interfaces (KF), pp. 560–565.
HCI-EI-1999-StedmonB #development #interface: Evaluating Stress in the Development of Speech Interface Technology (AWS, CB), pp. 545–549.
HCI-EI-1999-SundareswaranBCW #3d #artificial reality #distributed #recognition: A Distributed System for Device Diagnostics Utilizing Augmented Reality, 3D Audio, and Speech Recognition (VS, RB, SC, KW), pp. 466–470.
SIGIR-1999-SinghalP #documentation #retrieval: Document Expansion for Speech Retrieval (AS, FCNP), pp. 34–41.
SIGIR-1999-WhittakerHCHPS #design #named #retrieval #user interface: SCAN: Designing and Evaluating User Interfaces to Support Retrieval From Speech Archives (SW, JH, JC, DH, FCNP, AS), pp. 26–33.
DL-1998-SlaughterOWHW #interface #retrieval #visual notation: A Graphical Interface for Speech-Based Retrieval (LAS, DWO, VLW, JLH, GJW), pp. 305–306.
SIGIR-1998-NgZ #fault #retrieval #using: Speech Retrieval Using Phonemes with Error Correction (CN, JZ), pp. 365–366.
ICSE-1998-SrinivasanV #experience #framework #object-oriented #recognition #reuse: Object Oriented Reuse: Experience in Developing a Framework for Speech Recognition Applications (SS, JV), pp. 322–330.
SAC-1998-DuruDA #fuzzy #logic #reduction: Fuzzy logic based noise reduction of digitally recorded speech signal (ND, TD, NA), pp. 287–291.
WIA-1997-KirazE #automaton #implementation #multi #prolog: Multi-tape Automata for Speech and Language Systems: A Prolog Implementation (GAK, EGE), pp. 87–103.
CHI-1997-LaiV #named #recognition: MedSpeak: Report Creation with Continuous Speech Recognition (JL, JV), pp. 431–438.
SIGIR-1997-KlavansTJ #automation #effectiveness #multi #natural language #semiparsing #using: Effective Use of Natural Language Processing Techniques for Automatic Conflation of Multi-Word Terms: The Role of Derivational Morphology, Part of Speech Tagging, and Shallow Parsing (ET, JK, CJ), pp. 148–155.
SIGIR-1997-SheridanWS #performance #retrieval: Cross Language Speech Retrieval: Establishing a Baseline Performance (PS, MW, PS), pp. 99–108.
CHI-1996-Raman #interface #named: Emacspeak — A Speech Interface (TVR), pp. 66–71.
ICPR-1996-LuettinTB: Locating and tracking facial speech features (JL, NAT, SWB), pp. 652–656.
ICPR-1996-Nouza #feature model #markov #modelling #recognition: Feature selection methods for hidden Markov model-based speech recognition (JN), pp. 186–190.
ICPR-1996-SchwartzLMRZ #independence #recognition #using: Language-independent OCR using a continuous speech recognition system (RMS, CL, JM, CR, YZ), pp. 99–103.
ICPR-1996-SharmaHPZLCS #gesture #interface #visual notation: Speech/gesture interface to a visual computing environment for molecular biologists (RS, TSH, VIP, YZ, ZL, SMC, KS), pp. 964–968.
ICPR-1996-WouwerSD #analysis #classification: Wavelet-FILVQ classifier for speech analysis (GVdW, PS, DVD), pp. 214–218.
ICRE-1996-SaekiMSK #elicitation #requirements: Structuring utterance records of requirements elicitation meetings based on speech act theory (MS, KM, JS, HK), pp. 21–30.
PDP-1996-GarnerHBT #parallel: A Parallel Processing Environment for Speech Signal Processing Applications (NRG, DMH, PAB, AMT), pp. 470–477.
CHI-1995-YankelovichLM #design #user interface: Designing SpeechActs: Issues in Speech User Interfaces (NY, GAL, MM), pp. 369–376.
SAC-1995-Bothe #fuzzy #modelling #visual notation: Fuzzy input coding for an artificial neural--network modelling visual speech movements (HHB), pp. 450–454.
CAiSE-1994-Johannesson #approach #communication #information management #representation: Representation and Communication in Information Systems — A Speech Act Based Approach (PJ), pp. 200–213.
CAiSE-1994-NellbornH #enterprise #information management #modelling #requirements: Capturing Information Systems Requirements Through Enterprise and Speech Act Modelling (CN, PH), pp. 172–185.
SAC-1994-RusnokLC: Freedom’93: a portable speech device (KLR, MSL, JMC), pp. 556–560.
HCI-SHI-1993-GreavesWO #image #using #visual notation: Enhancing Speech Intelligibility Using Visual Images (CG, MW, OÖ), pp. 1097–1102.
HCI-SHI-1993-TamuraCS #image: Effect of Image Presentation to the Cognition of Plural Speech (HT, YC, YS), pp. 62–67.
HCI-SHI-1993-WangSHPW #evaluation #interface #usability: A Usability Evaluation of Text and Speech Redundant Help Messages on a Reader Interface (EMYW, HS, LH, KP, NW), pp. 724–729.
HCI-SHI-1993-Watanabe #feedback: Voice-Responsive Eye-Blinking Feedback for Improved Human-to-Machine Speech Input (TW), pp. 1091–1096.
INTERCHI-1993-StifelmanASH #interface #named: VoiceNotes: a speech interface for a hand-held voice notetaker (LS, BA, CS, EAH), pp. 179–186.
CHI-1992-Sellen: Speech Patterns in Video-Mediated Conversations (AS), pp. 49–59.
SIGIR-1992-GlavitschS #documentation: A System for Retrieving Speech Documents (UG, PS), pp. 168–176.
CHI-1991-ChalfonteFK #comparison: Expressive richness: a comparison of speech and text as media for revision (BLC, RSF, REK), pp. 21–26.
CHI-1991-RudnickyH #interactive #modelling #protocol #recognition: Models for evaluating interaction protocols in speech recognition (AIR, AGH), pp. 285–291.
CHI-1989-Hauptmann #gesture #image: Speech and gestures for graphic image manipulation (AGH), pp. 241–245.
HCI-CE-1987-BennettG: Evaluating Synthetic Speech Devices (RWB, SLG), pp. 391–398.
HCI-CE-1987-Glenn #human-computer #interactive #question: Where Does Speech Technology Fit in Human-Computer Interaction? (JWG), pp. 431–438.
DIPL-1976-Horning: After-dinner speech (JJH), pp. 444–445.