Access the full text.
Sign up today, get DeepDyve free for 14 days.
R. Hamilton, Á. Pascual-Leone, G. Schlaug (2004)
Absolute pitch in blind musiciansNeuroReport, 15
(2007)
Move over Poirot: Belgium Recruits Blind Detectives to Help Fight Crime
Kathryn Zyskowski, M. Morris, Jeffrey Bigham, Mary Gray, Shaun Kane (2015)
Accessible Crowdwork?: Understanding the Value in and Challenge of Microtask Employment for People with DisabilitiesProceedings of the 18th ACM Conference on Computer Supported Cooperative Work & Social Computing
Gabriele Paolacci, Jesse Chandler, Panagiotis G. Ipeirotis (2010)
Running experiments on Amazon mechanical turkJudg. Decis. Mak., 5
Marialena Barouti, Konstantinos Papadopoulos, G. Kouroupetroglou (2013)
Synthetic and Natural Speech Intelligibility in Individuals with Visual Impairments: Effects of Experience and Presentation Rate
(2004)
Visual and Multisensory Processing and Plasticity in the Human Brain
T. Toda, K. Tokuda (2007)
A Speech Parameter Generation Algorithm Considering Global Variance for HMM-Based Speech Synthesis
Walter Lasecki, R. Kushalnagar, Jeffrey Bigham (2014)
Legion scribe: real-time captioning by non-expertsProceedings of the 16th international ACM SIGACCESS conference on Computers & accessibility
C. McKinstry, Rick Dale, Michael Spivey (2008)
Action Dynamics Reveal Parallel Competition in Decision MakingPsychological Science, 19
Anja Moos, Jürgen Trouvain (2007)
Comprehension of ultra-fast speech–blind vs“normally hearing” persons. In Proceedings of the International Congress of Phonetic Sciences (ICPhS’07). Saarland University Saarbrücken
Konstantinos Papadopoulos, Eleni Koustriava (2015)
Comprehension of Synthetic and Natural Speech: Differences among Sighted and Visually Impaired Young Adults
Qisheng Li, Krzysztof Gajos, Katharina Reinecke (2018)
Volunteer-Based Online Studies With Older Adults and People with DisabilitiesProceedings of the 20th International ACM SIGACCESS Conference on Computers and Accessibility
H. Zen, A. Senior, M. Schuster (2013)
Statistical parametric speech synthesis using deep neural networks2013 IEEE International Conference on Acoustics, Speech and Signal Processing
J. Guerreiro, Daniel Gonçalves (2015)
Faster Text-to-Speeches: Enhancing Blind People's Information Scanning with Faster Concurrent SpeechProceedings of the 17th International ACM SIGACCESS Conference on Computers & Accessibility
(2016)
The Blind Boy Who Learned to See with Sound. Retrieved from http://www.bbc.com/news/ disability-35550768
D. Pascolini, S. Mariotti (2011)
Global estimates of visual impairment: 2010British Journal of Ophthalmology, 96
C. Wan, A. Wood, D. Reutens, Sarah Wilson (2010)
Early but not late-blindness leads to enhanced auditory perceptionNeuropsychologia, 48
Akemi Iida, N. Campbell, F. Higuchi, M. Yasumura (2003)
A corpus-based speech synthesis system with emotionSpeech Commun., 40
Brigitte Röder, Frank Rösler, Erwin Hennighausen, Fritz Näcker (1996)
Event-related potentials during auditory and somatosensory discrimination in sighted and blind human subjectsCogn. Brain Res., 4
E. Foulke, T. Sticht (1969)
Review of research on the intelligibility and comprehension of accelerated speech.Psychological bulletin, 72 1
A. Graesser (2016)
Conversations with AutoTutor Help Students LearnInternational Journal of Artificial Intelligence in Education, 26
Andrew Kolarik, S. Cirstea, S. Pardhan, B. Moore (2014)
A summary of research investigating echolocation abilities of blind and sighted humansHearing Research, 310
L. Germine, K. Nakayama, B. Duchaine, C. Chabris, Garga Chatterjee, J. Wilmer (2012)
Is the Web as good as the lab? Comparable performance from Web and lab in cognitive/perceptual experimentsPsychonomic Bulletin & Review, 19
Apple (2017)
VoiceOverhttp://www.apple.com/accessibility/mac/vision/. (Accessed 2017-09-02).
Gabriele Paolacci, Jesse Chandler, Panagiotis Ipeirotis (2010)
Running Experiments on Amazon Mechanical TurkBehavioral & Experimental Economics eJournal
Kenzo Ishizaka, James L. Flanagan (1972)
Synthesis of voiced sounds from a two-mass model of the vocal cordsBell Labs Techn. J., 51
Ronald A. Cole, Jola Jakimik (1980)
A model of speech perceptionPerception and Production of Fluent Speech (1980)
Çağatay Demiralp, Michael Bernstein, Jeffrey Heer (2014)
Learning Perceptual Kernels for Visualization DesignIEEE Transactions on Visualization and Computer Graphics, 20
B. Röder, Frank Rösler, E. Hennighausen, Fritz Näcker (1996)
Event-related potentials during auditory and somatosensory discrimination in sighted and blind human subjects.Brain research. Cognitive brain research, 4 2
Celia Scully (1990)
Articulatory synthesisSpeech Production and Speech Modelling. Springer
Jeffrey Bigham, Anna Cavender, Jeremy Brudvik, J. Wobbrock, R. Ladner (2007)
WebinSitu: a comparative analysis of blind and sighted browsing behavior
B. Röder, Oliver Stock, Siegfried Bien, H. Neville, F. Rösler (2002)
Speech processing activates visual cortex in congenitally blind humansEuropean Journal of Neuroscience, 16
Google (2017)
TalkBackRetrieved September 3, 2017 from http://play.google.com/store/apps/details?id=com.google.android.marvin.talkback&hl=en., 3
Aditya Vashistha, P. Sethi, Richard Anderson (2017)
Respeak: A Voice-based, Crowd-powered Speech Transcription SystemProceedings of the 2017 CHI Conference on Human Factors in Computing Systems
James McClelland, J. Elman (1986)
The TRACE model of speech perceptionCognitive Psychology, 18
Dasom Choi, Daehyun Kwak, Minji Cho, Sangsu Lee (2020)
"Nobody Speaks that Fast!" An Empirical Study of Speech Rate in Conversational Agents for People with Vision ImpairmentsProceedings of the 2020 CHI Conference on Human Factors in Computing Systems
C. Shadle, R. Damper (2001)
Prospects for articulatory synthesis: A position paper
B. Röder, W. Teder-Sälejärvi, A. Sterr, F. Rösler, S. Hillyard, H. Neville (1999)
Improved auditory spatial tuning in blind humansNature, 400
F. Gougoux, R. Zatorre, M. Lassonde, P. Voss, F. Lepore (2005)
A Functional Neuroimaging Study of Sound Localization: Visual Cortex Activity Predicts Performance in Early-Blind IndividualsPLoS Biology, 3
W. Marslen-Wilson, Alan Welsh (1978)
Processing interactions and lexical access during word recognition in continuous speechCognitive Psychology, 10
(1963)
Psychoacoustic speech tests: A modified rhyme test
Keiichi Tokuda, Takayoshi Yoshimura, Takashi Masuko, Takao Kobayashi, Tadashi Kitamura (2000)
Speech parameter generation algorithms for HMM-based speech synthesisProceedings of the IEEE International Conference on Acoustics
(2015)
Beyond Braille: A History of Reading by Ear
Shumin An, Zhenhua Ling, Lirong Dai (2017)
Emotional statistical parametric speech synthesis using LSTM-RNNs2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)
G. W. Micro (2017)
Window-EyesSeptember, 2
J. Lazar, Aaron Allen, Jason Kleinman, C. Malarkey (2007)
What Frustrates Screen Reader Users on the Web: A Study of 100 Blind UsersInternational Journal of Human–Computer Interaction, 22
C. Asakawa, Hironobu Takagi, S. Ino, T. Ifukube (2003)
Maximum listening speeds for the blind
P. Voss, M. Lassonde, F. Gougoux, M. Fortin, J. Guillemot, F. Lepore (2004)
Early- and Late-Onset Blind Individuals Show Supra-Normal Auditory Abilities in Far-SpaceCurrent Biology, 14
K. Ishizaka, J. Flanagan (1972)
Synthesis of voiced sounds from a two-mass model of the vocal cordsBell System Technical Journal, 51
B. Röder, Lisa Demuth, J. Streb, F. Rösler (2003)
Semantic and morpho-syntactic priming in auditory word recognition in congenitally blind adultsLanguage and Cognitive Processes, 18
Diemo Schwarz, G. Beller, Bruno Verbrugghe, Sam Britton (2006)
REAL-TIME CORPUS-BASED CONCATENATIVE SYNTHESIS WITH CATART
N. Miller, G. Maruyama, R. Beaber, Keith Valone (1976)
Speed of speech and persuasion.Journal of Personality and Social Psychology, 34
Anja Moos, Jürgen Trouvain (2007)
COMPREHENSION OF ULTRA-FAST SPEECH - BLIND VS. "NORMALLY HEARING" PERSONS
V. Pulkki, M. Karjalainen (2015)
Communication Acoustics: An Introduction to Speech, Audio and Psychoacoustics
NV Access (2017)
NVDA 2017Retrieved September 2, 2017 from http://www.nvaccess.org/., 2
(2010)
More than meets the eye: A survey of screen-reader browsing strategies
Jose Sotelo, Soroush Mehri, Kundan Kumar, J. Santos, Kyle Kastner, Aaron Courville, Yoshua Bengio (2017)
Char2Wav: End-to-End Speech Synthesis
Danielle Bragg, Cynthia Bennett, Katharina Reinecke, R. Ladner (2018)
A Large Inclusive Study of Human Listening RatesProceedings of the 2018 CHI Conference on Human Factors in Computing Systems
D. Dahan (2010)
The Time Course of Interpretation in Speech ComprehensionCurrent Directions in Psychological Science, 19
Dan Bilefsky (2007)
In Fight Against Terror, Keen Ears Undistracted by Sighthttp://www.nytimes.com/2007/11/17/world/europe/17vanloo.html?mcubz=1.
R. Cole (2016)
Perception and production of fluent speech
Takayoshi Yoshimura, K. Tokuda, T. Masuko, Takao Kobayashi, T. Kitamura (1999)
Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis
W. Marslen-Wilson (1987)
Functional parallelism in spoken word-recognitionCognition, 25
B. Röder, F. Rösler (2003)
Memory for environmental sounds in sighted, congenitally blind and late blind adults: evidence for cross-modal compensation.International journal of psychophysiology : official journal of the International Organization of Psychophysiology, 50 1-2
N. Hoffart (2000)
Basics of Qualitative Research: Techniques and Procedures for Developing Grounded TheoryNephrology Nursing Journal, 27
(2017)
ChromeVox Version 52
M. Furmankiewicz, Anna Sołtysik-Piorunkiewicz, Piotr Ziuziański (2014)
Artificial intelligence systems for knowledge management in e-health : the study of intelligent software agents
(2007)
In Fight Against Terror, Keen Ears Undistracted by Sight. http://www.nytimes.com/2007/11/17/ world/europe/17vanloo.html?mcubz=1
Helena Merriman (2016)
The Blind Boy Who Learned to See with SoundRetrieved from http://www.bbc.com/news/disability-35550768.
Qisheng Li, Sung Joo, J. Yeatman, Katharina Reinecke (2020)
Controlling for Participants’ Viewing Distance in Large-Scale, Psychophysical Online Experiments Using a Virtual ChinrestScientific Reports, 10
Tal August, Katharina Reinecke (2019)
Pay Attention, Please: Formal Language Improves Attention in Volunteer and Paid Online ExperimentsProceedings of the 2019 CHI Conference on Human Factors in Computing Systems
Jeffrey Heer, M. Bostock (2010)
Crowdsourcing graphical perception: using mechanical turk to assess visualization designProceedings of the SIGCHI Conference on Human Factors in Computing Systems
Qisheng Li, Sung Jun Joo, Jason D. Yeatman, Katharina Reinecke (2020)
controlling for participantsâviewing distance in large-scale, 10
Amanda Stent, A. Syrdal, Taniya Mishra (2011)
On the intelligibility of fast synthesized speech for individuals with early-onset blindnessThe proceedings of the 13th international ACM SIGACCESS conference on Computers and accessibility
F. Gougoux, F. Lepore, M. Lassonde, P. Voss, R. Zatorre, P. Belin (2004)
Neuropsychology: Pitch discrimination in the early blindNature, 430
É. Moulines, F. Charpentier (1989)
Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones
A. Black, N. Campbell (1995)
Optimising selection of units from speech databases for concatenative synthesis
H. Théoret, L. Merabet, Á. Pascual-Leone (2004)
Behavioral and neuroplastic changes in the blind: evidence for functionally relevant cross-modal interactionsJournal of Physiology-Paris, 98
Freedom Scientific (2006)
JAWS 18Retrieved September 2, 2017 from http://www.freedomscientific.com/., 2
G. Altmann (1989)
Cognitive Models of Speech Processing: Psycholinguistic and Computational Perspectives - Workshop OverviewAI Mag., 10
Heiga Ze, Andrew Senior, Mike Schuster (2013)
Statistical parametric speech synthesis using deep neural networksProceedings of the 2013 IEEE International Conference on Acoustics, 2013
L. Dunai, I. Lengua, G. Peris-Fajarnés, Fernando Brusola (2015)
Virtual Sound Localization by Blind PeopleArchives of Acoustics, 40
R. Emerson, Rachel Fretz, Linda Shaw (1995)
Writing Ethnographic Fieldnotes
K. Tokuda, Takayoshi Yoshimura, T. Masuko, Takao Kobayashi, T. Kitamura (2000)
Speech parameter generation algorithms for HMM-based speech synthesis2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100), 3
Jürgen Trouvain (2007)
On the comprehension of extremely fast synthetic speech
T. Hull, H. Mason (1995)
Performance of Blind Children on Digit-Span TestsJournal of Visual Impairment & Blindness, 89
Katharina Reinecke, Krzysztof Gajos (2015)
LabintheWild: Conducting Large-Scale Online Experiments With Uncompensated SamplesProceedings of the 18th ACM Conference on Computer Supported Cooperative Work & Social Computing
R. Weeks, B. Horwitz, A. Aziz-Sultan, B. Tian, C. Wessinger, L. Cohen, M. Hallett, J. Rauschecker (2000)
A Positron Emission Tomographic Study of Auditory Localization in the Congenitally BlindThe Journal of Neuroscience, 20
Emerson Foulke, Thomas G. Sticht (1969)
Review of research on the intelligibility and comprehension of accelerated speechReview of research on the intelligibility and comprehension of accelerated speech.Psychological Bulletin, 72
Kirsten Hötting, B. Röder (2009)
Auditory and auditory-tactile processing in congenitally blind humansHearing Research, 258
Santani Teng, Amrita Puri, D. Whitney (2011)
Ultrafine spatial acuity of blind expert human echolocatorsExperimental Brain Research, 216
(1990)
Articulatory synthesis. In Speech Production and Speech Modelling
T. Ye, Katharina Reinecke, L. Robert (2017)
Personalized Feedback Versus Money: The Effect on Reliability of Subjective Data in Online Experimental PlatformsCompanion of the 2017 ACM Conference on Computer Supported Cooperative Work and Social Computing
M. Schröder (2001)
Emotional speech synthesis: a review
J. Horton, David Rand, R. Zeckhauser (2010)
The online laboratory: conducting experiments in a real labor marketExperimental Economics, 14
Gerry T. M. Altmann (Ed (1995)
Cognitive Models of Speech Processing: Psycholinguistic and Computational PerspectivesMIT Press.
D. Ross, I. Olson, J. Gore (2003)
Cortical plasticity in an early blind musician: an fMRl study.Magnetic resonance imaging, 21 7
As conversational agents and digital assistants become increasingly pervasive, understanding their synthetic speech becomes increasingly important. Simultaneously, speech synthesis is becoming more sophisticated and manipulable, providing the opportunity to optimize speech rate to save users time. However, little is known about people’s abilities to understand fast speech. In this work, we provide an extension of the first large-scale study on human listening rates, enlarging the prior study run with 453 participants to 1,409 participants and adding new analyses on this larger group. Run on LabintheWild, it used volunteer participants, was screen reader accessible, and measured listening rate by accuracy at answering questions spoken by a screen reader at various rates. Our results show that people who are visually impaired, who often rely on audio cues and access text aurally, generally have higher listening rates than sighted people. The findings also suggest a need to expand the range of rates available on personal devices. These results demonstrate the potential for users to learn to listen to faster rates, expanding the possibilities for human-conversational agent interaction.
ACM Transactions on Accessible Computing (TACCESS) – Association for Computing Machinery
Published: Jul 21, 2021
Keywords: Synthetic speech
Read and print from thousands of top scholarly journals.
Already have an account? Log in
Bookmark this article. You can see your Bookmarks on your DeepDyve Library.
To save an article, log in first, or sign up for a DeepDyve account if you don’t already have one.
Copy and paste the desired citation format or use the link below to download a file formatted for EndNote
Access the full text.
Sign up today, get DeepDyve free for 14 days.
All DeepDyve websites use cookies to improve your online experience. They were placed on your computer when you launched this website. You can change your cookie settings through your browser.