BibSLEIGH
BibSLEIGH corpus
BibSLEIGH tags
BibSLEIGH bundles
BibSLEIGH people
EDIT!
CC-BY
Open Knowledge
XHTML 1.0 W3C Rec
CSS 2.1 W3C CanRec
email twitter
speech
Google speech

Tag #speech

247 papers:

CoGCoG-2019-SykownikBM #analysis #automation #pipes and filters #sentiment
Can You Hear the Player Experienceƒ A Pipeline for Automated Sentiment Analysis of Player Speech (PS, FB, MM), pp. 1–4.
ICMLICML-2019-FuLTL #black box #generative #metric #named #network #optimisation
MetricGAN: Generative Adversarial Networks based Black-box Metric Scores Optimization for Speech Enhancement (SWF, CFL, YT0, SDL), pp. 2031–2041.
ICMLICML-2019-KenterWCCV #named #network #synthesis
CHiVE: Varying Prosody in Speech Synthesis with a Linguistically Driven Dynamic Hierarchical Conditional Variational Network (TK, VW, CaC, RC, JV), pp. 3331–3340.
ICMLICML-2019-QinCCGR #automation #recognition #robust
Imperceptible, Robust, and Targeted Adversarial Examples for Automatic Speech Recognition (YQ, NC, GWC, IJG, CR), pp. 5231–5240.
ICMLICML-2019-RenTQZZL #automation #recognition
Almost Unsupervised Text to Speech and Automatic Speech Recognition (YR, XT, TQ, SZ, ZZ, TYL), pp. 5410–5419.
ICSTICST-2019-IwamaF #automation #recognition #testing
Automated Testing of Basic Recognition Capability for Speech Recognition Systems (FI, TF), pp. 13–24.
EDMEDM-2018-GautamMGR #automation #categorisation #chat
Automated Speech Act Categorization of Chat Utterances in Virtual Internships (DG, NM, AG, VR).
ICSMEICSME-2018-KrasniqiM #component #developer #generative
TraceLab Components for Generating Speech Act Types in Developer Question/Answer Conversations (RK, CM), p. 713.
CIKMCIKM-2018-FangZYMZ #benchmark #metric #named #video
TED-KISS: A Known-Item Speech Video Search Benchmark (FF, BWZ, XCY, HXM, FZ), pp. 1803–1806.
ICMLICML-2018-OordLBSVKDLCSCG #parallel #performance #synthesis
Parallel WaveNet: Fast High-Fidelity Speech Synthesis (AvdO, YL, IB, KS, OV, KK, GvdD, EL, LCC, FS, NC, DG, SN, SD, EE, NK, HZ, AG, HK, TW, DB, DH), pp. 3915–3923.
ICMLICML-2018-Skerry-RyanBXWS #synthesis #towards
Towards End-to-End Prosody Transfer for Expressive Speech Synthesis with Tacotron (RJSR, EB, YX, YW, DS, JS, RJW, RC, RAS), pp. 4700–4709.
ICMLICML-2018-WangSZRBSXJRS #modelling #synthesis
Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis (YW, DS, YZ, RJSR, EB, JS, YX, YJ, FR, RAS), pp. 5167–5176.
ICPRICPR-2018-DingLXKS #generative #network #recognition #robust #towards
Mutual-optimization Towards Generative Adversarial Networks For Robust Speech Recognition (KD, NL, YX, DK, KS), pp. 2699–2704.
ICPRICPR-2018-LiZXGX18a #network #recognition
Recurrent Neural Network Based Small-footprint Wake-up-word Speech Recognition System with a Score Calibration Method (CL, LZ, SX, PG, BX0), pp. 3222–3227.
ICPRICPR-2018-SaitohK #database #named #recognition #smarttech #visual notation
SSSD: Speech Scene database by Smart Device for Visual Speech Recognition (TS, MK), pp. 3228–3232.
ICPRICPR-2018-XiaoW #animation #network
Dense Convolutional Recurrent Neural Network for Generalized Speech Animation (LX, ZW), pp. 633–638.
ESEC-FSEESEC-FSE-2018-WoodRAM #debugging #detection #developer
Detecting speech act types in developer question/answer conversations during bug repair (AW, PR, AA, CM), pp. 491–502.
ICMLICML-2017-NagamineM #case study #comprehension #multi #recognition #representation
Understanding the Representation and Computation of Multilayer Perceptrons: A Case Study in Speech Recognition (TN, NM), pp. 2564–2573.
ICMLICML-2017-OchiaiWHH #multi #recognition
Multichannel End-to-end Speech Recognition (TO, SW, TH, JRH), pp. 2632–2641.
ICSMEICSME-2016-OlneyHTL #java
Part of Speech Tagging Java Method Names (WO, EH0, CT, BL), pp. 483–487.
MSRMSR-2016-MoslehiAR #documentation #mining #on the
On mining crowd-based speech documentation (PM, BA, JR), pp. 259–268.
DiGRADiGRA-FDG-2016-LyonLZ #design #game studies
Combining Speech Intervention and Cooperative Game Design for Children with ASD (NL, DIL, JZ).
CIKMCIKM-2016-ManshaKKA #identification #network #self
A Self-Organizing Map for Identifying InfluentialCommunities in Speech-based Networks (SM, FK, AK, AA), pp. 1965–1968.
ICMLICML-2016-AmodeiABCCCCCCD #recognition
Deep Speech 2 : End-to-End Speech Recognition in English and Mandarin (DA, SA, RA, JB, EB, CC, JC, BC, JC, MC, AC, GD, EE, JHE, LF, CF, AYH, BJ, TH, PL, XL, LL, SN, AYN, SO, RP, SQ, JR, SS, DS, SS, CW0, YW, ZW, BX, YX, DY, JZ, ZZ), pp. 173–182.
ICPRICPR-2016-ChakrabortyPK #information management #recognition #using
Spontaneous speech emotion recognition using prior knowledge (RC, MP, SKK), pp. 2866–2871.
ICPRICPR-2016-PironkovDD #automation #learning #multi #recognition
Speaker-aware Multi-Task Learning for automatic speech recognition (GP, SD, TD), pp. 2900–2905.
EDMEDM-2015-BlanchardDON #analysis #automation #education #towards
Classifying Q&A from Teachers' Speech: Moving Toward an Automated System of Dialogic Analysis (NB, SKD, AO, MN), pp. 282–288.
CoGCIG-2015-Kendall #game studies
Keynote speech IV: Where games meet hyper-heuristics (GK), p. 19.
CoGCIG-2015-Lucas #challenge #game studies #video
Keynote speech II: General video game AI: Challenges and applications (SL), p. 17.
CoGCIG-2015-Muller #challenge #research
Keynote speech III: Computer go research - The challenges ahead (MM0), p. 18.
CoGCIG-2015-Yao #game studies #learning
Keynote speech I: Co-evolutionary learning in game-playing (XY0), p. 16.
CHICHI-2015-LimerickMC #empirical #interface
Empirical Evidence for a Diminished Sense of Agency in Speech Interfaces (HL, JWM, DC), pp. 3967–3970.
CHICHI-2015-McMillanLB
Repurposing Conversation: Experiments with the Continuous Speech Stream (DM, AL, BATB), pp. 3953–3962.
CHICHI-2015-McNaneyPVBZO #named #people
LApp: A Speech Loudness Application for People with Parkinson’s on Google Glass (RM, IP, JV, MB, PZ, PO), pp. 497–500.
HCIDUXU-IXD-2015-CaiLLH #case study #experience #research #user interface
User Experience Research on the Rehabilitation System of Speech-Impaired Children — A Case Study on Speech Training Product (WC, JL, QL, TH), pp. 562–574.
HCIDUXU-IXD-2015-WangWG #comparison
Cross Cultural Comparison of Users’ Barge-in with the In-Vehicle Speech System (PW, UW, TJG), pp. 529–540.
RecSysRecSys-2015-Stock #automation #persuasion
A (Persuasive?) Speech on Automated Persuasion (OS), pp. 1–2.
SACSAC-2015-Soares0W #approach #modelling #named #recognition #requirements
VoiceToModel: an approach to generate requirements models from speech recognition mechanisms (FS, JA, FW), pp. 1350–1357.
SIGITESIGITE-2014-Jonas #experience #research
Capstone experience: achieving success with an undergraduate research group in speech (MJ), pp. 55–60.
CHI-PLAYCHI-PLAY-2014-LanAABG #game studies #interactive
Flappy voice: an interactive game for childhood apraxia of speech therapy (TL, SA, BA, KJB, RGO), pp. 429–430.
CHICHI-2014-HamidiB #interface #named
Rafigh: a living media interface for speech intervention (FH, MB), pp. 1817–1820.
CHICHI-2014-Vosoughi #automation #recognition #visual notation
Improving automatic speech recognition through head pose driven visual grounding (SV), pp. 3235–3238.
HCIHCI-AIMT-2014-AlmeidaST #design #development #interactive
Design and Development of Speech Interaction: A Methodology (NA, SSS, AJST), pp. 370–381.
HCIHCI-AIMT-2014-JonssonD #interactive #performance
Driving with a Speech Interaction System: Effect of Personality on Performance and Attitude of Driver (IMJ, ND), pp. 417–428.
HCIHCI-TMT-2014-ColetiMN #evaluation #named #recognition #usability #using
ErgoSV: An Environment to Support Usability Evaluation Using Face and Speech Recognition (TAC, MM, FdLdSN), pp. 554–564.
ICEISICEIS-v3-2014-SilvaFG14a #artificial reality
Assisting Speech Therapy for Autism Spectrum Disorders with an Augmented Reality Application (CAdS, ARF, APG), pp. 38–45.
ICMLICML-c2-2014-GravesJ #network #recognition #towards
Towards End-To-End Speech Recognition with Recurrent Neural Networks (AG, NJ), pp. 1764–1772.
ICPRICPR-2014-DengZS #learning #recognition
Linked Source and Target Domain Subspace Feature Transfer Learning — Exemplified by Speech Emotion Recognition (JD, ZZ, BWS), pp. 761–766.
ICPRICPR-2014-MustiZP #3d #animation #estimation #image #visual notation
Facial 3D Shape Estimation from Images for Visual Speech Animation (UM, ZZ, MP), pp. 40–45.
ICPRICPR-2014-NishimuraOAN #image #modelling #using #web
Selection of Unknown Objects Specified by Speech Using Models Constructed from Web Images (HN, YO, YA, MN), pp. 477–482.
ICPRICPR-2014-YuncuHB #automation #modelling #recognition #using
Automatic Speech Emotion Recognition Using Auditory Models with Binary Decision Tree and SVM (EY, HH, CB), pp. 773–778.
SIGIRSIGIR-2014-Jones #retrieval #tool support
Speech search: techniques and tools for spoken content retrieval (GJFJ), p. 1287.
ICDARICDAR-2013-LiWTLG #image #locality
Unsupervised Speech Text Localization in Comic Images (LL, YW, ZT, XL, LG), pp. 1190–1194.
ICDARICDAR-2013-RigaudBOKW #detection
An Active Contour Model for Speech Balloon Detection in Comics (CR, JCB, JMO, DK, JvdW), pp. 1240–1244.
CoGVS-Games-2013-LoaizaOCPALN #game studies #prototype #video
A Video Game Prototype for Speech Rehabilitation (DL, CO, AC, AP, GIA, DL, AN), pp. 1–4.
CHICHI-2013-RazaHTPRSR
Job opportunities through entertainment: virally spread speech-based services for low-literate users (AAR, FuH, ZT, MP, SR, US, RR), pp. 2803–2812.
HCIDHM-SET-2013-GeorgeK #modelling #towards
Towards Enhancing the Acoustic Models for Dysarthric Speech (KKG, CSK), pp. 183–188.
HCIDUXU-CXC-2013-TailebAAAA #named #recognition
YUSR: Speech Recognition Software for Dyslexics (MT, RAS, AAG, MAZ, SAS), pp. 296–303.
HCIHCI-AMTE-2013-ColetiMN #automation #evaluation #recognition #usability
Analyzing Face and Speech Recognition to Create Automatic Information for Usability Evaluation (TAC, MM, FdLdSN), pp. 184–192.
HCIHCI-AS-2013-Harris #interface
Emotion and Emotion Regulation Considerations for Speech-Based In-Vehicle Interfaces (HH), pp. 571–577.
HCIHCI-AS-2013-WeiHCHK #design #elicitation
Ergonomics Design with Novice Elicitation on an Auditory-Only In-Vehicle Speech System (MHW, SLH, HCC, JYH, CCK), pp. 654–660.
HCIHCI-III-2013-RebenitschO
Facial Electromyogram Activation as Silent Speech Method (LR, CBO), pp. 464–473.
HCIHCI-IMT-2013-GalatasPM #artificial reality #distance #multi #recognition #robust #video
Robust Multi-Modal Speech Recognition in Two Languages Utilizing Video and Distance Information from the Kinect (GG, GP, FM), pp. 43–48.
HCIHCI-IMT-2013-KuncMLK
Speech-Based Text Correction Patterns in Noisy Environment (LK, TM, ML, JK), pp. 59–66.
HCIHCI-IMT-2013-MedjkouneMPV #multimodal #recognition
Multimodal Mathematical Expressions Recognition: Case of Speech and Handwriting (SM, HM, SP, CVG), pp. 77–86.
HCIHCI-IMT-2013-RigasA13a #communication #interface
Investigating the Impact of Combining Speech and Earcons to Communicate Information in E-government Interfaces (DR, BA), pp. 23–31.
HCIHCI-IMT-2013-WangGHL #collaboration #communication #elicitation #nondeterminism #using
A Knowledge Elicitation Study for Collaborative Dialogue Strategies Used to Handle Uncertainties in Speech Communication While Using GIS (HW, AG, DH, RL), pp. 135–144.
HCIHIMI-HSM-2013-HirayamaKK #user interface #visual notation
A Dialog Based Speech User Interface of a Makeup Support System for Visually Impaired Persons (MJH, NK, YK), pp. 261–268.
MLDMMLDM-2013-PohlZ #automation #n-gram #recognition #using
Using Part of Speech N-Grams for Improving Automatic Speech Recognition of Polish (AP, BZ), pp. 492–504.
RecSysRecSys-2013-GraschFR #interactive #named #recommendation #towards
ReComment: towards critiquing-based recommendation with speech interaction (PG, AF, FR), pp. 157–164.
SIGIRSIGIR-2013-MalionekOSH
Linking transcribed conversational speech (JM, DWO, AS, JHLH), pp. 961–964.
PDPPDP-2013-DziurzanskiM #automation #case study #network #recognition
Core Mapping into an Irregular Network on Chip — Features Extraction System for Automatic Speech Recognition Case Study (PD, TM), pp. 494–498.
AIIDEAIIDE-2012-OrkinR #comprehension #crowdsourcing #interactive
Understanding Speech in Interactive Narratives with Crowdsourced Data (JO, DKR).
CHICHI-2012-KumarPL #interactive #type system
Voice typing: a new speech interaction model for dictation on touchscreen devices (AK, TP, BL), pp. 2277–2286.
CHICHI-2012-KumarRTAK #game studies #mobile #using
Improving literacy in developing countries using speech recognition-supported games on mobile devices (AK, PR, AT, RA, MK), pp. 1149–1158.
ECIRECIR-2012-EskevichMJ #evaluation #metric #retrieval
New Metrics for Meaningful Evaluation of Informally Structured Speech Retrieval (ME, WM, GJFJ), pp. 170–181.
ICMLICML-2012-YuSL #network #using
Conversational Speech Transcription Using Context-Dependent Deep Neural Networks (DY, FS, GL), p. 1.
ICPRICPR-2012-ZhaoXY #learning #network
Unsupervised Tibetan speech features Learning based on Dynamic Bayesian Networks (YZ, XX, GY), pp. 2319–2322.
ICPRICPR-2012-ZhengZ #kernel #recognition
Speech emotion recognition based on kernel reduced-rank regression (WZ, XZ), pp. 1972–1976.
CASECASE-2012-KawarazakiY #recognition #using
Remote control system of home electrical appliances using speech recognition (NK, TY), pp. 761–764.
SIGITESIGITE-2011-Jonas #experience #lessons learnt #research
Capstone experience: lessons from an undergraduate research group in speech at UNH Manchester (MJ), pp. 275–280.
MSRMSR-2011-BinkleyHL #identifier #using
Improving identifier informativeness using part of speech information (DB, MH, DL), pp. 203–206.
HCIHCI-ITE-2011-KarpovRK #multi #recognition #user interface
An Assistive Bi-modal User Interface Integrating Multi-channel Speech Recognition and Computer Vision (AK, AR, ISK), pp. 454–463.
HCIHCI-MIIE-2011-Wang #communication #elicitation
A Knowledge Elicitation Study for a Speech Enabled GIS to Handle Vagueness in Communication (HW), pp. 338–345.
HCIHCI-UA-2011-NisimuraMKKI #automation #development #identification #interface #recognition
Development of Web-Based Voice Interface to Identify Child Users Based on Automatic Speech Recognition System (RN, SM, LK, HK, TI), pp. 607–616.
HCIHIMI-v2-2011-MacchiarellaKCHE #visual notation
Pilot Information Presentation on the Flight Deck: An Application of Synthetic Speech and Visual Digital Displays (NDM, JPK, MSC, TH, ZE), pp. 500–506.
ICEISICEIS-v2-2011-FirozeAQR #recognition #word
Bangla Isolated Word Speech Recognition (AF, MSA, RQ, RMR), pp. 73–82.
SACSAC-2011-SharminRARF #education #game studies #interactive
Teaching intelligible speech to the autistic children by interactive computer games (MAS, MMR, SIA, MMR, SMF), pp. 1208–1209.
SACSAC-2011-VenkateshGBC #fixpoint #implementation #markov #modelling #recognition #using
Fixed-point implementation of isolated sub-word level speech recognition using hidden Markov models (NV, RG, RB, MGC), pp. 368–373.
CASECASE-2011-MaiHHH #adaptation #algorithm #performance
A fast adaptive Kalman filtering algorithm for speech enhancement (QM, DH, YH, ZH), pp. 327–332.
CSMRCSMR-2010-MadaniGPGA #identifier #recognition #source code #using #word
Recognizing Words from Source Code Identifiers Using Speech Recognition Techniques (NM, LG, MDP, YGG, GA), pp. 68–77.
CHICHI-2010-VertanenM #performance #using
Speech dasher: fast writing using speech and gaze (KV, DJCM), pp. 595–598.
ICPRICPR-2010-BozkurtEEE #recognition #using
Use of Line Spectral Frequencies for Emotion Recognition from Speech (EB, EE, ÇEE, ATE), pp. 3708–3711.
ICPRICPR-2010-ChakrabortyG #recognition
Role of Synthetically Generated Samples on Speech Recognition in a Resource-Scarce Language (RC, UG), pp. 1618–1621.
ICPRICPR-2010-FaselB #network #realtime
Deep Belief Networks for Real-Time Extraction of Tongue Contours from Ultrasound During Speech (IF, JB), pp. 1493–1496.
ICPRICPR-2010-HeracleousHB #gesture #integration #recognition
Gestures and Lip Shape Integration for Cued Speech Recognition (PH, NH, DB), pp. 2238–2241.
ICPRICPR-2010-KellyH #recognition #robust
Auditory Features Revisited for Robust Speech Recognition (FK, NH), pp. 4456–4459.
ICPRICPR-2010-KrajewskiBK #case study #classification #detection #multi #self
Comparing Multiple Classifiers for Speech-Based Detection of Self-Confidence — A Pilot Study (JK, AB, SK), pp. 3716–3719.
ICPRICPR-2010-MahdhaouiC #classification #multi
Emotional Speech Classification Based on Multi View Characterization (AM, MC), pp. 4488–4491.
ICPRICPR-2010-Nolazco-FloresLG #automation #recognition
Speech Magnitude-Spectrum Information-Entropy (MSIE) for Automatic Speech Recognition in Noisy Environments (JANF, RAAL, LPGP), pp. 4364–4367.
ICPRICPR-2010-OGorman #analysis #latency
Latency in Speech Feature Analysis for Telepresence Event Coding (LO), pp. 4464–4467.
ICPRICPR-2010-SaeidiMKTCJF #identification #independence
Signal-to-Signal Ratio Independent Speaker Identification for Co-channel Speech Signals (RS, PM, TK, ZHT, MGC, SHJ, PF), pp. 4565–4568.
ICPRICPR-2010-StadelmannWSEF #algorithm #design #development
Rethinking Algorithm Design and Development in Speech Processing (TS, YW, MS, RE, BF), pp. 4476–4479.
ICPRICPR-2010-StarkWP #representation #using
Single Channel Speech Separation Using Source-Filter Representation (MS, MW, FP), pp. 826–829.
ICPRICPR-2010-SwaminathanTFEAB
Improving and Aligning Speech with Presentation Slides (RS, MET, SF, AE, AA, KB), pp. 3280–3283.
ICPRICPR-2010-TawariT #analysis
Speech Emotion Analysis in Noisy Real-World Environment (AT, MMT), pp. 4605–4608.
ICPRICPR-2010-ZhangSQ #modelling #recognition
Modeling Syllable-Based Pronunciation Variation for Accented Mandarin Speech Recognition (SZ, QS, YQ), pp. 1606–1609.
SIGIRSIGIR-2010-Popescu-BelisKPNBW #automation #multi #retrieval
Automatic content linking: speech-based just-in-time retrieval for multimedia archives (APB, JK, PP, AN, EB, JdW), p. 703.
CASECASE-2010-DhupatiKRR #analysis #detection #novel #using #validation
A novel drowsiness detection scheme based on speech analysis with validation using simultaneous EEG recordings (LSD, SK, AR, AR), pp. 917–921.
CHICHI-2009-PatelARNDP #case study #comparative #interface
A comparative study of speech and dialed input voice interfaces in rural India (NP, SKA, NR, AAN, PD, TSP), pp. 51–54.
HCIHCI-AUII-2009-YamamotoOW #video
Video Content Production Support System with Speech-Driven Embodied Entrainment Character by Speech and Hand Motion Inputs (MY, KO, TW), pp. 358–367.
HCIHCI-NIMT-2009-NisimuraMKI #development #interactive
Development of Speech Input Method for Interactive VoiceWeb Systems (RN, JM, HK, TI), pp. 710–719.
HCIHCI-NIMT-2009-TongW #recognition
Compensate the Speech Recognition Delays for Accurate Speech-Based Cursor Position Control (QT, ZW), pp. 752–760.
HCIHCI-NT-2009-DZmuraDLTS #towards
Toward EEG Sensing of Imagined Speech (MD, SD, TL, ST, RS), pp. 40–48.
HCIHCI-NT-2009-HippP #assurance #quality
Reference Model for Quality Assurance of Speech Applications (CH, MP), pp. 259–266.
HCIHCI-NT-2009-LeeP #evaluation #synthesis
Interpretation of User Evaluation for Emotional Speech Synthesis System (HJL, JCP), pp. 295–303.
ECIRECIR-2009-LarsonTHR #fault #recognition #semantics
Investigating the Global Semantic Impact of Speech Recognition Error on Spoken Content Collections (ML, MT, JH, MdR), pp. 755–760.
ECIRECIR-2009-LiomaB #information retrieval
Part of Speech Based Term Weighting for Information Retrieval (CL, RB), pp. 412–423.
SIGIRSIGIR-2009-OlssonO #independence #retrieval #robust
Combining LVCSR and vocabulary-independent ranked utterance retrieval for robust speech search (JSO, DWO), pp. 91–98.
CHICHI-2008-LohrB #interactive #user interface #visual notation
Mixed-initiative dialog management for speech-based interaction with graphical user interfaces (AL, BB), pp. 979–988.
CHICHI-2008-OviattSA #adaptation #interface
Implicit user-adaptive system engagement in speech and pen interfaces (SLO, CS, AMA), pp. 969–978.
CHICHI-2008-VertanenK #on the #recognition #visualisation
On the benefits of confidence visualization in speech recognition (KV, POK), pp. 1497–1500.
CSCWCSCW-2008-KalnikaiteW #feedback #question #social #summary
Social summarization: does social feedback improve access to speech data? (VK, SW), pp. 9–12.
ICMLICML-2008-HeigoldDSN #evaluation #recognition
Modified MMI/MPE: a direct evaluation of the margin in speech recognition (GH, TD, RS, HN), pp. 384–391.
ICPRICPR-2008-DehzangiMCL #classification #fuzzy #learning #using
Fuzzy rule selection using Iterative Rule Learning for speech data classification (OD, BM, CES, HL), pp. 1–4.
ICPRICPR-2008-KottiK #classification #database #gender
Gender classification in two Emotional Speech databases (MK, CK), pp. 1–4.
ICPRICPR-2008-SerCY #classification #hybrid #recognition
A Hybrid PNN-GMM classification scheme for speech emotion recognition (WS, LC, ZLY), pp. 1–4.
SIGIRSIGIR-2008-TsagiasLR
Term clouds as surrogates for user generated speech (MT, ML, MdR), pp. 773–774.
SACSAC-2008-FariaM #recognition #scalability
When a mismatch can be good: large vocabulary speech recognition trained with idealized tandem features (AF, NM), pp. 1574–1577.
TPDLECDL-2007-BernareggiD #named #network
aScience: A Thematic Network on Speech and Tactile Accessibility to Scientific Digital Resources (CB, GCD), pp. 515–517.
HTHT-2007-KehoeP #synthesis #topic
Transforming DITA topics for speech synthesis output (AK, IJP), pp. 147–148.
ITiCSEITiCSE-2007-KheirW #realtime #student #using
Inclusion of deaf students in computer science classes using real-time speech transcription (RK, TW), pp. 261–265.
CSMRCSMR-2007-Herweijer
Keynote Speech (JPH), p. 3.
CHICHI-2007-DahlbackWNA #interface #similarity
Similarity is more important than expertise: accent effects in speech interfaces (ND, QW, CN, JA), pp. 1553–1556.
CHICHI-2007-KaiserBEC #interactive #multimodal
Multimodal redundancy across handwriting and speech during computer mediated human-human interactions (ECK, PB, CE, PRC), pp. 1009–1018.
HCIHCI-IDU-2007-YinC #analysis #automation #metric #towards
Towards Automatic Cognitive Load Measurement from Speech Analysis (BY, FC), pp. 1011–1020.
HCIHCI-MIE-2007-KimLRHH #analysis #design #network #performance #quality
Performance Analysis of Perceptual Speech Quality and Modules Design for Management over IP Network (JK, HWL, WR, SHH, MH), pp. 84–93.
HCIHCI-MIE-2007-LeeP07a #behaviour #generative #synthesis
Customized Message Generation and Speech Synthesis in Response to Characteristic Behavioral Patterns of Children (HJL, JCP), pp. 114–123.
HCIHCI-MIE-2007-XuBAM #empirical #fault #recognition
An Empirical Study on Users’ Acceptance of Speech Recognition Errors in Text-Messaging (SX, SB, MA, DM), pp. 232–242.
HCIHCI-MIE-2007-ZhuL #case study #recognition
Study on Speech Emotion Recognition System in E-Learning (AZ, QL), pp. 544–552.
ICEISICEIS-HCI-2007-ArjunanWKY #human-computer #recognition #using
Silent Bilingual Vowel Recognition — Using fSEMG for HCI based Speech Commands (SPA, HW, DKK, WCY), pp. 68–78.
SIGIRSIGIR-2007-HeerenWOHJ
Radio Oranje: searching the queen’s speech(es) (WH, LvdW, RO, AvH, FdJ), p. 903.
SIGIRSIGIR-2007-IrcingOH
First experiments searching spontaneous Czech speech (PI, DWO, JH), pp. 835–836.
CHICHI-2006-KuriharaGOI #multimodal #predict #recognition
Speech pen: predictive handwriting based on ambient multimodal recognition (KK, MG, JO, TI), pp. 851–860.
CHICHI-2006-MunteanuBPTJ #recognition #usability
The effect of speech recognition accuracy rates on the usefulness and usability of webcast archives (CM, RB, GP, EGT, DJ), pp. 493–502.
ICPRICPR-v1-2006-PaoCYL #recognition
Mandarin Emotional Speech Recognition Based on SVM and NN (TLP, YTC, JHY, PJL), pp. 1096–1100.
ICPRICPR-v1-2006-XieL #animation #markov #modelling #using
Speech Animation Using Coupled Hidden Markov Models (LX, ZQL), pp. 1128–1131.
ICPRICPR-v2-2006-AndelicSKK #hybrid #kernel #modelling #using
A Hybrid HMM-Based Speech Recognizer Using Kernel-Based Discriminants as Acoustic Models (EA, MS, MK, SEK), pp. 1158–1161.
ICPRICPR-v3-2006-HalavatiSTCR #approach #novel #performance #recognition #robust #word
A Novel Approach to Very Fast and Noise Robust, Isolated Word Speech Recognition (RH, SBS, HT, AC, MR), pp. 190–193.
ICPRICPR-v3-2006-YouCBLT #analysis
Emotional Speech Analysis on Nonlinear Manifold (MY, CC, JB, JL, JT), pp. 91–94.
ICPRICPR-v4-2006-Choi #cumulative #recognition #robust #using
A Noise Robust Front-end for Speech Recognition Using Hough Transform and Cumulative Distribution Mapping (EHCC), pp. 286–289.
ICPRICPR-v4-2006-JinW #music
Speech Separation from Background of Music Based on Single-channel Recording (XCJ, ZFW), pp. 278–281.
ICPRICPR-v4-2006-KrugerSKAW #recognition
Mixture of Support Vector Machines for HMM based Speech Recognition (SEK, MS, MK, EA, AW), pp. 326–329.
ICPRICPR-v4-2006-LeilaC #performance #recognition
Efficient Gaussian Mixture for Speech Recognition (LZ, GC), pp. 294–297.
ICPRICPR-v4-2006-LinO #network #recognition
Switching Auxiliary Chains for Speech Recognition based on Dynamic Bayesian Networks (HL, ZO), pp. 258–261.
ICPRICPR-v4-2006-LiuH #automation #predict #segmentation
A Bayesian Predictive Method for Automatic Speech Segmentation (ML, TSH), pp. 290–293.
ICPRICPR-v4-2006-MaierHNNHRS #evaluation #recognition
Intelligibility of Children with Cleft Lip and Palate: Evaluation by Speech Recognition Techniques (AKM, CH, EN, EN, TH, FR, MS), pp. 274–277.
ICPRICPR-v4-2006-ZiokoMW #segmentation
Phoneme segmentation of speech (BZ, SM, RCW), pp. 282–285.
SIGIRSIGIR-2006-LiuO #effectiveness #metric #retrieval
One-sided measures for evaluating ranked retrieval effectiveness with spontaneous conversational speech (BL, DWO), pp. 673–674.
SACSAC-2006-DoyleB #effectiveness #interactive #mobile
Combining speech and pen input for effective interaction in mobile geospatial environments (JD, MB), pp. 1182–1183.
DATEDATE-2006-LahiriBCM #clustering
Battery-aware code partitioning for a text to speech system (AL, AB, MC, SM), pp. 672–677.
AIIDEAIIDE-2005-GorniakR #comprehension #game studies
Speaking with your Sidekick: Understanding Situated Speech in Computer Role Playing Games (PG, DR), pp. 57–62.
SIGIRSIGIR-2005-CarvalhoC #classification #email #on the
On the collective classification of email “speech acts” (VRdC, WWC), pp. 345–352.
DACDAC-2005-NedevschiPB #hardware #low cost #power management #recognition #user interface
Hardware speech recognition for user interfaces in low cost, low power devices (SN, RKP, EAB), pp. 684–689.
SOSPSOSP-2005-Tanenbaum
Keynote speech (AST).
CHICHI-2004-VemuriDBS #recognition #using
Improving speech playback using time-compression and speech recognition (SV, PD, WB, CS), pp. 295–302.
CHICHI-2004-WhittakerA #editing #semantics
Semantic speech editing (SW, BA), pp. 527–534.
CIKMCIKM-2004-MekhaldiLI #clustering #documentation #using
Using bi-modal alignment and clustering techniques for documents and speech thematic segmentations (DM, DL, RI), pp. 69–77.
ICPRICPR-v1-2004-CoskerMRH #animation #markov #using
Speech Driven Facial Animation using a Hidden Markov Coarticulation Model (DC, ADM, PLR, YH), pp. 128–131.
ICPRICPR-v2-2004-BeierholmB #music #using
Speech Music Discrimination Using Class-Specific Features (TB, PMB), pp. 379–382.
ICPRICPR-v3-2004-GutkinK #classification #representation
Structural Representation of Speech for Phonetic Classification (AG, SK), pp. 438–441.
SIGIRSIGIR-2004-OardSDHMWRFGMKS #information retrieval
Building an information retrieval test collection for spontaneous conversational speech (DWO, DS, DSD, XH, GCM, JW, BR, MF, SG, JM, LK, SS), pp. 41–48.
DocEngDocEng-2003-MekhaldiLI #documentation
Thematic alignment of recorded speech with documents (DM, DL, RI), pp. 52–54.
CIKMCIKM-2003-GilbertZ #information retrieval #user interface
Speech user interfaces for information retrieval (JEG, YZ), pp. 77–82.
MLDMMLDM-2003-LazliS #fuzzy #logic #probability #recognition #using
Connectionist Probability Estimators in HMM Arabic Speech Recognition Using Fuzzy Logic (LL, MS), pp. 379–388.
SIGIRSIGIR-2003-HayashiOBMMMHHI #multi
Speech-based and video-supported indexing of multimedia broadcast news (YH, KO, KB, OM, YM, SM, MH, TH, NI), pp. 441–442.
JCDLJCDL-2002-HauptmannJN #information retrieval #multi #recognition #using #video
Multi-modal information retrieval from broadcast video using OCR and speech recognition (AGH, RJ, TDN), pp. 160–161.
DiGRACGDC-2002-Tosca #community
The EverQuest Speech Community (SPT).
CHICHI-2002-SuhmBMFGGP #case study #comparative #natural language
A comparative study of speech in the call center: natural language call routing vs. touch-tone menus (BS, JB, DM, BF, DG, KG, PP), pp. 283–290.
CHICHI-2002-WhittakerHASBISZR #interface #named
SCANMail: a voicemail interface that makes speech browsable, readable and searchable (SW, JH, BA, LAS, MB, PLI, LS, GZ, AER), pp. 275–282.
ICMLICML-2002-MeyerB #scalability #towards
Towards “Large Margin” Speech Recognizers by Boosting and Discriminative Training (CM, PB), pp. 419–426.
ICPRICPR-v1-2002-BraySE #gesture #recognition
Recognition of Gestures in the Context of Speech (MB, HS, JOE), pp. 356–359.
ICPRICPR-v2-2002-WachsmuthS #analysis #image #probability #process
Integrated Analysis of Speech and Images as a Probabilistic Decoding Process (SW, GS), pp. 588–592.
ICPRICPR-v3-2002-Bourlard #pattern matching #pattern recognition #recognition #statistics
Some Recent Advances in Speech Recognition with Potential Applications in Other Statistical Pattern Recognition Areas (HB), p. 727.
ICPRICPR-v3-2002-KatzMDK #analysis #automation #linear #robust
Robustness of Linear Discriminant Analysis in Automatic Speech Recognitio (MK, HGM, HD, DK), pp. 371–374.
ICPRICPR-v3-2002-QuanTH #recognition #robust
A Robust Method for the Vietnamese Handwritten and Speech Recognition (VHQ, PNT, NDHH), pp. 732–735.
ICPRICPR-v3-2002-TanakaKFI #modelling
Constructing Speech Processing Systems on Universal Phonetic Codes Accompanied with Reference Acoustic Models (KT, HK, NF, YI), pp. 728–731.
ICPRICPR-v4-2002-StephensonMB #automation #network #recognition
Mixed Bayesian Networks with Auxiliary Variables for Automatic Speech Recognition (TAS, MMD, HB), p. 293–?.
JCDLJCDL-2001-CooperVBC #enterprise
Building searchable collections of enterprise speech data (JWC, MV, DKB, MC), pp. 226–234.
CHICHI-2001-GongL #performance
Shall we mix synthetic speech and human speech?: impact on users’ performance, perception, and attitude (LG, JL), pp. 158–165.
CHICHI-2001-LaiCGT #comprehension #on the #web
On the road and on the Web?: comprehension of synthetic and human speech while driving (JL, KC, PAG, OT), pp. 206–212.
CHICHI-2001-StifelmanAS #interactive
The audio notebook: paper and pen interaction with structured speech (LS, BA, CS), pp. 182–189.
CIKMCIKM-2001-BrownSCPCAP #towards
Towards Speech as a Knowledge Resource (EWB, SS, AC, DBP, JWC, AA, JP), pp. 526–528.
CIKMCIKM-2001-PonceleonS #automation
Automatic Discovery of Salient Segments in Imperfect Speech Transcripts (DBP, SS), pp. 490–497.
SIGIRSIGIR-2001-PonceleonS #segmentation
Structure and Content-Based Segmentation of Speech Transcripts (DBP, SS), pp. 404–405.
DL-2000-SmithCS #interface
A speech interface for building musical score collections (LAS, EFC, BLS), pp. 165–173.
CHICHI-2000-LaiWC
The effect of task conditions on the comprehensibility of synthetic speech (JL, DW, MC), pp. 321–328.
CHICHI-2000-NassL
Does computer-generated speech manifest personality? an experimental test of similarity-attraction (CN, KML), pp. 329–336.
ICPRICPR-v2-2000-UgenaAA #independence #network #recognition
Speaker-Independent Speech Recognition by Means of Functional-Link Neural Networks (AU, FdA, MEA), pp. 6018–6021.
ICPRICPR-v3-2000-FaruquieMRS #modelling #recognition #scalability #using
Large Vocabulary Audio-Visual Speech Recognition Using Active Shape Models (TAF, AM, NR, LVS), pp. 3110–3113.
ICPRICPR-v3-2000-GravierSC #automation #markov #random #recognition
A Markov Random Field Model for Automatic Speech Recognition (GG, MS, GC), pp. 3258–3261.
ICPRICPR-v3-2000-Ney #classification #modelling #probability #recognition
Stochastic Modeling: From Pattern Classification to Speech Recognition and Translation (HN), pp. 3025–3032.
ICPRICPR-v3-2000-SanchisVJ #performance #recognition #using #verification #word
Efficient Use of the Grammar Scale Factor to Classify Incorrect Words in Speech Recognition Verification (AS, EV, VMJ), pp. 3278–3281.
SIGIRSIGIR-2000-McCarleyF #detection #fault #recognition #topic
Influence of speech recognition errors on topic detection (JSM, MF), pp. 342–344.
CHICHI-1999-KaratHHK #recognition #scalability
Patterns of Entry and Correction in Large Vocabulary Continuous Speech Recognition System (CMK, CH, DBH, JK), pp. 568–575.
HCIHCI-CCAD-1999-SearsB #recognition
Redesigning speech recognition for use by individuals with spinal cord injuries (AS, JBR), pp. 966–969.
HCIHCI-EI-1999-Brandt-PookFWS #recognition
Integrated Recognition and Interpretation of Speech for a Construction Task Domain (HBP, GAF, SW, GS), pp. 550–554.
HCIHCI-EI-1999-CarbonellD #empirical #gesture #human-computer #multimodal #using
Empirical data on the use of speech and gestures in a multimodal human-computer environment (NC, PD), pp. 446–450.
HCIHCI-EI-1999-Fischer #design #human-computer #interface
Repeats, Reformulations, and Emotional Speech: Evidence for the Design of Human-Computer Speech Interfaces (KF), pp. 560–565.
HCIHCI-EI-1999-StedmonB #development #interface
Evaluating Stress in the Development of Speech Interface Technology (AWS, CB), pp. 545–549.
HCIHCI-EI-1999-SundareswaranBCW #3d #artificial reality #distributed #recognition
A Distributed System for Device Diagnostics Utilizing Augmented Reality, 3D Audio, and Speech Recognition (VS, RB, SC, KW), pp. 466–470.
SIGIRSIGIR-1999-SinghalP #documentation #retrieval
Document Expansion for Speech Retrieval (AS, FCNP), pp. 34–41.
SIGIRSIGIR-1999-WhittakerHCHPS #design #named #retrieval #user interface
SCAN: Designing and Evaluating User Interfaces to Support Retrieval From Speech Archives (SW, JH, JC, DH, FCNP, AS), pp. 26–33.
DL-1998-SlaughterOWHW #interface #retrieval #visual notation
A Graphical Interface for Speech-Based Retrieval (LAS, DWO, VLW, JLH, GJW), pp. 305–306.
SIGIRSIGIR-1998-NgZ #fault #retrieval #using
Speech Retrieval Using Phonemes with Error Correction (CN, JZ), pp. 365–366.
ICSEICSE-1998-SrinivasanV #experience #framework #object-oriented #recognition #reuse
Object Oriented Reuse: Experience in Developing a Framework for Speech Recognition Applications (SS, JV), pp. 322–330.
SACSAC-1998-DuruDA #fuzzy #logic #reduction
Fuzzy logic based noise reduction of digitally recorded speech signal (ND, TD, NA), pp. 287–291.
CIAAWIA-1997-KirazE #automaton #implementation #multi #prolog
Multi-tape Automata for Speech and Language Systems: A Prolog Implementation (GAK, EGE), pp. 87–103.
CHICHI-1997-LaiV #named #recognition
MedSpeak: Report Creation with Continuous Speech Recognition (JL, JV), pp. 431–438.
SIGIRSIGIR-1997-KlavansTJ #automation #effectiveness #multi #natural language #semiparsing #using
Effective Use of Natural Language Processing Techniques for Automatic Conflation of Multi-Word Terms: The Role of Derivational Morphology, Part of Speech Tagging, and Shallow Parsing (ET, JK, CJ), pp. 148–155.
SIGIRSIGIR-1997-SheridanWS #performance #retrieval
Cross Language Speech Retrieval: Establishing a Baseline Performance (PS, MW, PS), pp. 99–108.
CHICHI-1996-Raman #interface #named
Emacspeak — A Speech Interface (TVR), pp. 66–71.
ICPRICPR-1996-LuettinTB
Locating and tracking facial speech features (JL, NAT, SWB), pp. 652–656.
ICPRICPR-1996-Nouza #feature model #markov #modelling #recognition
Feature selection methods for hidden Markov model-based speech recognition (JN), pp. 186–190.
ICPRICPR-1996-SchwartzLMRZ #independence #recognition #using
Language-independent OCR using a continuous speech recognition system (RMS, CL, JM, CR, YZ), pp. 99–103.
ICPRICPR-1996-SharmaHPZLCS #gesture #interface #visual notation
Speech/gesture interface to a visual computing environment for molecular biologists (RS, TSH, VIP, YZ, ZL, SMC, KS), pp. 964–968.
ICPRICPR-1996-WouwerSD #analysis #classification
Wavelet-FILVQ classifier for speech analysis (GVdW, PS, DVD), pp. 214–218.
REICRE-1996-SaekiMSK #elicitation #requirements
Structuring utterance records of requirements elicitation meetings based on speech act theory (MS, KM, JS, HK), pp. 21–30.
PDPPDP-1996-GarnerHBT #parallel
A Parallel Processing Environment for Speech Signal Processing Applications (NRG, DMH, PAB, AMT), pp. 470–477.
CHICHI-1995-YankelovichLM #design #user interface
Designing SpeechActs: Issues in Speech User Interfaces (NY, GAL, MM), pp. 369–376.
SACSAC-1995-Bothe #fuzzy #modelling #visual notation
Fuzzy input coding for an artificial neural--network modelling visual speech movements (HHB), pp. 450–454.
CAiSECAiSE-1994-Johannesson #approach #communication #information management #representation
Representation and Communication in Information Systems — A Speech Act Based Approach (PJ), pp. 200–213.
CAiSECAiSE-1994-NellbornH #enterprise #information management #modelling #requirements
Capturing Information Systems Requirements Through Enterprise and Speech Act Modelling (CN, PH), pp. 172–185.
SACSAC-1994-RusnokLC
Freedom’93: a portable speech device (KLR, MSL, JMC), pp. 556–560.
HCIHCI-SHI-1993-GreavesWO #image #using #visual notation
Enhancing Speech Intelligibility Using Visual Images (CG, MW, ), pp. 1097–1102.
HCIHCI-SHI-1993-TamuraCS #image
Effect of Image Presentation to the Cognition of Plural Speech (HT, YC, YS), pp. 62–67.
HCIHCI-SHI-1993-WangSHPW #evaluation #interface #usability
A Usability Evaluation of Text and Speech Redundant Help Messages on a Reader Interface (EMYW, HS, LH, KP, NW), pp. 724–729.
HCIHCI-SHI-1993-Watanabe #feedback
Voice-Responsive Eye-Blinking Feedback for Improved Human-to-Machine Speech Input (TW), pp. 1091–1096.
CHIINTERCHI-1993-StifelmanASH #interface #named
VoiceNotes: a speech interface for a hand-held voice notetaker (LS, BA, CS, EAH), pp. 179–186.
CHICHI-1992-Sellen
Speech Patterns in Video-Mediated Conversations (AS), pp. 49–59.
SIGIRSIGIR-1992-GlavitschS #documentation
A System for Retrieving Speech Documents (UG, PS), pp. 168–176.
CHICHI-1991-ChalfonteFK #comparison
Expressive richness: a comparison of speech and text as media for revision (BLC, RSF, REK), pp. 21–26.
CHICHI-1991-RudnickyH #interactive #modelling #protocol #recognition
Models for evaluating interaction protocols in speech recognition (AIR, AGH), pp. 285–291.
CHICHI-1989-Hauptmann #gesture #image
Speech and gestures for graphic image manipulation (AGH), pp. 241–245.
HCIHCI-CE-1987-BennettG
Evaluating Synthetic Speech Devices (RWB, SLG), pp. 391–398.
HCIHCI-CE-1987-Glenn #human-computer #interactive #question
Where Does Speech Technology Fit in Human-Computer Interaction? (JWG), pp. 431–438.
AdaDIPL-1976-Horning
After-dinner speech (JJH), pp. 444–445.

Bibliography of Software Language Engineering in Generated Hypertext (BibSLEIGH) is created and maintained by Dr. Vadim Zaytsev.
Hosted as a part of SLEBOK on GitHub.