Einzeltreffer — DigiBib

One of the most effective ways to train animals and humans to selectively change motor behaviors is by reinforcing desired behaviors using reward or punishment. Reinforcement learning theory provides a promising framework for modeling behavioral conditioning. It subsumes that a reinforced behavioral trial biases future trials via the correlation between exploration and reward. While we know that exploration is necessary for reinforcement learning, which part of motor variability constitutes exploration and how reinforcement acts on a trial-by-trial basis to improve future behavior is currently unknown. This thesis aims to differentiate between motor exploration (used for learning via its correlation with reward) and noise (inaccessible for learning) and suggests a simple behavioral model implementing this basic reinforcement learning strategy. Songbirds such as zebra finches provide a tractable model system to study neural mechanisms underlying trial-and-error processes of reinforcement learning. The learning and the neuronal mechanisms for song learning are very similar to the mechanisms for human speech learning. Furthermore, both spectral and temporal aspects of birdsong can be modified independently by delivering real-time auditory feedback (short bursts of noise) as aversive reinforcement contingent on the trials’ pitch (fundamental frequency) or duration. First, we test whether birds require auditory feedback of the exploratory motor behavior to be able to learn to adaptively change their pitch in a targeted direction. We modify the widely used reinforcement learning paradigm; By using visual feedback (brief events of light-off) instead of auditory feedback, we can teach deaf (and hearing) zebra finches to selectively modify their syllable’s pitch. This shows that reinforcement learning is possible without evaluation of vocal performance contrary to song learning in juveniles and song maintenance in adult birds that both critically depend on auditory feedback during performance. Hence, birds do not require ...

Titel:	Inferring the trial-by-trial structure of pitch reinforcement learning in songbirds
Autor/in / Beteiligte Person:	Zai, Anja T. ; Hahnloser, Richard H.R. ; Senn, Walter ; Mante, Valerio
Link:	http://hdl.handle.net/20.500.11850/382074 https://doi.org/20.500.11850/382074 https://doi.org/10.3929/ethz-b-000382074 Volltext
Veröffentlichung:	ETH Zurich, 2019
Medientyp:	Hochschulschrift
DOI:	10.3929/ethz-b-000382074
Schlagwort:	songbird neuroscience Kalman filter variability Deafness Sensory feedback reinforcement learning info:eu-repo/classification/ddc/500 Natural sciences
Sonstiges:	Nachgewiesen in: BASE Sprachen: English Collection: ETH Zürich Research Collection Document Type: doctoral or postdoctoral thesis File Description: application/application/pdf Language: English Rights: info:eu-repo/semantics/openAccess ; http://rightsstatements.org/page/InC-NC/1.0/ ; In Copyright - Non-Commercial Use Permitted

Klicken Sie ein Format an und speichern Sie dann die Daten oder geben Sie eine Empfänger-Adresse ein und lassen Sie sich per Email zusenden.

BibTeX Citavi, JabRef, u.a.
(Literaturverwaltung)

PDF kein Volltext!
(Merkzettel, Notizen)

RIS Endnote, Citavi u.a.
(Literaturverwaltung)

MODS
(XML zur Weiterverarbeitung)

oder

Wählen Sie das für Sie passende Zitationsformat und kopieren Sie es dann in die Zwischenablage, lassen es sich per Mail zusenden oder speichern es als PDF-Datei.

Gewünschter Zitations-Stil:

oder

Bitte prüfen Sie, ob die Zitation formal korrekt ist, bevor Sie sie in einer Arbeit verwenden. Benutzen Sie gegebenenfalls den "Exportieren"-Dialog, wenn Sie ein Literaturverwaltungsprogramm verwenden und die Zitat-Angaben selbst formatieren wollen.

Inferring the trial-by-trial structure of pitch reinforcement learning in songbirds