Beyond single syllables: Large-scale modeling of reading aloud with the Connectionist Dual Process (CDP++) model

Conrad Perry, Johannes C. Ziegler, Marco Zorzi

Research output: Contribution to journalArticle

189 Citations (Scopus)

Abstract

Most words in English have more than one syllable, yet the most influential computational models of reading aloud are restricted to processing monosyllabic words. Here, we present CDP++, a new version of the Connectionist Dual Process model (Perry, Ziegler, & Zorzi, 2007). CDP++ is able to simulate the reading aloud of mono- and disyllabic words and nonwords, and learns to assign stress in exactly the same way as it learns to associate graphemes with phonemes. CDP++ is able to simulate the monosyllabic benchmark effects its predecessor could, and therefore shows full backwards compatibility. CDP++ also accounts for a number of novel effects specific to disyllabic words, including the effects of stress regularity and syllable number. In terms of database performance, CDP++ accounts for over 49% of the reaction time variance on items selected from the English Lexicon Project, a very large database of several thousand of words. With its lexicon of over 32,000 words, CDP++ is therefore a notable example of the successful scaling-up of a connectionist model to a size that more realistically approximates the human lexical system.

Original languageEnglish
Pages (from-to)106-151
Number of pages46
JournalCognitive Psychology
Volume61
Issue number2
DOIs
Publication statusPublished - Sep 2010

Fingerprint

Cytidine Diphosphate
Reading
Word processing
regularity
scaling
Databases
Word Processing
Benchmarking
Neural Networks (Computer)
performance

Keywords

  • Computational modeling
  • Disyllables
  • Reading aloud
  • Word stress

ASJC Scopus subject areas

  • Experimental and Cognitive Psychology
  • Neuropsychology and Physiological Psychology
  • Developmental and Educational Psychology
  • Artificial Intelligence
  • Linguistics and Language

Cite this

Beyond single syllables : Large-scale modeling of reading aloud with the Connectionist Dual Process (CDP++) model. / Perry, Conrad; Ziegler, Johannes C.; Zorzi, Marco.

In: Cognitive Psychology, Vol. 61, No. 2, 09.2010, p. 106-151.

Research output: Contribution to journalArticle

Perry, Conrad ; Ziegler, Johannes C. ; Zorzi, Marco. / Beyond single syllables : Large-scale modeling of reading aloud with the Connectionist Dual Process (CDP++) model. In: Cognitive Psychology. 2010 ; Vol. 61, No. 2. pp. 106-151.
@article{42048120fc3f49c499606df162b77640,
title = "Beyond single syllables: Large-scale modeling of reading aloud with the Connectionist Dual Process (CDP++) model",
abstract = "Most words in English have more than one syllable, yet the most influential computational models of reading aloud are restricted to processing monosyllabic words. Here, we present CDP++, a new version of the Connectionist Dual Process model (Perry, Ziegler, & Zorzi, 2007). CDP++ is able to simulate the reading aloud of mono- and disyllabic words and nonwords, and learns to assign stress in exactly the same way as it learns to associate graphemes with phonemes. CDP++ is able to simulate the monosyllabic benchmark effects its predecessor could, and therefore shows full backwards compatibility. CDP++ also accounts for a number of novel effects specific to disyllabic words, including the effects of stress regularity and syllable number. In terms of database performance, CDP++ accounts for over 49{\%} of the reaction time variance on items selected from the English Lexicon Project, a very large database of several thousand of words. With its lexicon of over 32,000 words, CDP++ is therefore a notable example of the successful scaling-up of a connectionist model to a size that more realistically approximates the human lexical system.",
keywords = "Computational modeling, Disyllables, Reading aloud, Word stress",
author = "Conrad Perry and Ziegler, {Johannes C.} and Marco Zorzi",
year = "2010",
month = "9",
doi = "10.1016/j.cogpsych.2010.04.001",
language = "English",
volume = "61",
pages = "106--151",
journal = "Cognitive Psychology",
issn = "0010-0285",
publisher = "Academic Press Inc.",
number = "2",

}

TY - JOUR

T1 - Beyond single syllables

T2 - Large-scale modeling of reading aloud with the Connectionist Dual Process (CDP++) model

AU - Perry, Conrad

AU - Ziegler, Johannes C.

AU - Zorzi, Marco

PY - 2010/9

Y1 - 2010/9

N2 - Most words in English have more than one syllable, yet the most influential computational models of reading aloud are restricted to processing monosyllabic words. Here, we present CDP++, a new version of the Connectionist Dual Process model (Perry, Ziegler, & Zorzi, 2007). CDP++ is able to simulate the reading aloud of mono- and disyllabic words and nonwords, and learns to assign stress in exactly the same way as it learns to associate graphemes with phonemes. CDP++ is able to simulate the monosyllabic benchmark effects its predecessor could, and therefore shows full backwards compatibility. CDP++ also accounts for a number of novel effects specific to disyllabic words, including the effects of stress regularity and syllable number. In terms of database performance, CDP++ accounts for over 49% of the reaction time variance on items selected from the English Lexicon Project, a very large database of several thousand of words. With its lexicon of over 32,000 words, CDP++ is therefore a notable example of the successful scaling-up of a connectionist model to a size that more realistically approximates the human lexical system.

AB - Most words in English have more than one syllable, yet the most influential computational models of reading aloud are restricted to processing monosyllabic words. Here, we present CDP++, a new version of the Connectionist Dual Process model (Perry, Ziegler, & Zorzi, 2007). CDP++ is able to simulate the reading aloud of mono- and disyllabic words and nonwords, and learns to assign stress in exactly the same way as it learns to associate graphemes with phonemes. CDP++ is able to simulate the monosyllabic benchmark effects its predecessor could, and therefore shows full backwards compatibility. CDP++ also accounts for a number of novel effects specific to disyllabic words, including the effects of stress regularity and syllable number. In terms of database performance, CDP++ accounts for over 49% of the reaction time variance on items selected from the English Lexicon Project, a very large database of several thousand of words. With its lexicon of over 32,000 words, CDP++ is therefore a notable example of the successful scaling-up of a connectionist model to a size that more realistically approximates the human lexical system.

KW - Computational modeling

KW - Disyllables

KW - Reading aloud

KW - Word stress

UR - http://www.scopus.com/inward/record.url?scp=77954971626&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=77954971626&partnerID=8YFLogxK

U2 - 10.1016/j.cogpsych.2010.04.001

DO - 10.1016/j.cogpsych.2010.04.001

M3 - Article

C2 - 20510406

AN - SCOPUS:77954971626

VL - 61

SP - 106

EP - 151

JO - Cognitive Psychology

JF - Cognitive Psychology

SN - 0010-0285

IS - 2

ER -