Content-type: text/html Downes.ca ~ Stephen's Web ~ Pronunciation Assessment with Multi-modal Large Language Models

Stephen Downes

Knowledge, Learning, Community

The authors conclude (5 page PDF), "the proposed scoring systems achieve competitive results compared to the baselines on the Speechocean762 datasets." It's not a surprise to me that an AI could be used to score participants on their pronunciation. What I wonder is how consistent their assessments are when compared with human assessors.

Today: 32 Total: 327 [Direct link] [Share]


Stephen Downes Stephen Downes, Casselman, Canada
stephen@downes.ca

Copyright 2024
Last Updated: Oct 01, 2024 05:19 a.m.

Canadian Flag Creative Commons License.

Force:yes