Banner Portal
Prosodic segmentation and functional correlations

Supplementary Files



Spontaneous speech
Prosodic segmentation

How to Cite

Moneglia M, Cresti E. Prosodic segmentation and functional correlations: the case of Japanese. J. of Speech Sci. [Internet]. 2019 Sep. 20 [cited 2024 Jul. 27];7(2):31-50. Available from:


This paper presents a pilot based on the NUCC corpus aimed at verifying the consistency of the Language into Act Theory (L-AcT) for the annotation of information structure in spoken Japanese. L-AcT focus on the perceptual relevance of prosodic breaks, foresees a strict correspondence between prosodic units and information units and bases the Information structure on the unit bearing the illocutionary cues (Comment). The model also foresees a language independent typology of information functions. The pilot shows that the detection of terminal breaks in speech goes hand in hand with the identification of speech acts by competent speakers. L-AcT works fine in all its basic principles and specifically for the illocutionary definition of the Comment. The main information unit types (Topic, Parenthesis and Appendix and Dialogic Units) also fit with Japanese. Information structure turns out largely language independent, for instance Japanese word order (SOV) applies within the Information unit but it does not across information units.


Abe J. On directionality of Movement: A case of Right Japanese Dislocation, Proceeding of the 58th Conference, The Tohoku English Literacy Society, 54-61, 2004.

Aoyagi H. Particles and functional categories in Japanese. Tokyo: Hituzi, 2006.

Austin J. How to do things with words. London: Arnold, 1962.

Barbosa PA. Prominence- and boundary-related acoustic correlations in Brazilian Portuguese read and spontaneous speech. In Barbosa PA, Madureira S, Reis C (Eds.). Speech Prosody. Campinas: ISCA, 2008. pp. 257–260.

Biber D, Johansson S, Leech G, Conrad S, Finnegan E. The Longman Grammar of Spoken and Written English. London: Longman, 1999.

Cavalcante FC. The topic unit in spontaneous American English: a corpus-based study. PhD Thesis. Belo Horizonte: Universidade Federal de Minas Gerais, 2015.

Cavalcante FA, Ramos AC. The American English spontaneous speech minicorpus. Architecture and comparability. CHIMERA: Romance Corpora and Linguistic Studies, v. 3, p. 2, p. 99-124, 2016.

Cavalcante FA, Raso T, Ramos A. American English Informationally Tagged Minicorpus. 2018. Available at:

Chafe W. Discourse, consciousness, and time: The flow and displacement of conscious experience in speaking and writing. Chicago: UCP, 1994.

Cresti E. Corpus di italiano parlato. Firenze: Accademia della Crusca, 2000.

Cresti E. La Stanza: un‟unita di costruzione testuale del parlato. In Ferrari, A. (Ed.). Sintassi storica e sincronica dell’italiano. Subordinazione, coordinazione e giustapposizione. Atti del X Congresso SILFI, 713-732. Firenze: Cesati, 2010.

Cresti E. Syntactic properties of spontaneous speech in the Language into Act Theory: data on Italian complements and relative clauses. In Raso T, Mello H. (Eds.). Spoken corpora and linguistics studies, Amsterdam: Benjamin, 2014. pp. 365-410.

Cresti E. The empirical foundation of illocutionary classification. In De Meo A, Dovetto FM (Eds.). Proceedings of the International Conference “La Comunicazione Parlata”. Roma: Aracne, 2018. pp. 243-264.

Cresti E, Fujimura I. The information structure of spontaneous spoken Japanese and Italian in comparison: a pilot study. In Manco A (Ed.). Le lingue extra-europee e l’italiano: aspetti didattico- acquisizionali e sociolinguistici. Milano: Officinaventuno, 2018. pp. 167-189.

Cresti E, Moneglia M (Eds.). C-ORAL-ROM. Integrated reference corpora for spoken romance languages. Amsterdam: Benjamins, 2005.

Cresti E, Moneglia M. The illocutionary basis of information structure. In Adamou E, Haude K, Vanhove M (Eds.). Information Structure in Lesser-described Languages. Amsterdam: Benjamins, 2018a. pp. 359-402.

Cresti E, Moneglia M. The definition of the Topic within Language into Act Theory and its identification in spontaneous speech corpora. Revue Romane, v. 5, pp. 30-62, 2018b.

Cresti E, Moneglia M. The Discourse Connector according to the Language into Act Theory: data from IPIC Italiano. In Bidese E, Casalicchio J, Moroni M (Eds.). La linguistica vista dalle Alpi. Bern: Peter Lang, 2019.

Cresti E, Moneglia M. Some notes on the excerpts according to L-AcT. In Izre‟el S, Mello H, Panunzi A, Raso T (Eds.).

. Amsterdam: Benjamins, (forthcoming).

Cresti E, Moneglia M, Martin P. L‟intonation des illocutions naturelles répresentatives: analyse et validation perceptive. In Scarano A (ed.). Macrosyntaxe et pragmatique: l’analyse linguistique del’oral. Roma: Bulzoni, 2003. pp. 243-264.

Cruttenden A. Intonation. Second edition. Cambridge: Cambridge University Press, 1997.

Crystal D. The English Tone of Voice. London: Edward Arnold, 1975.

Danieli M, Garrido JM, Moneglia M, Panizza A, Quazza S, Swerts M. Evaluation of Consensus on

the Annotation of Prosodic Breaks in the Romance Corpus of Spontaneous Speech C-ORAL-ROM”. In Draxler C, van den Heuvel H, Schiel F (Eds.). LREC 2004: Fourth International Conference on Language Resources and Evaluation. Lisbon: LREC, 2004. pp. 1513–1516.

Firenzuoli V. Le Forme Intonative di Valore Illocutivo dell'Italiano Parlato: Analisi Sperimentale di un Corpus di Parlato Spontaneo (LABLITA). PhD Thesis. Firenze: Università di Firenze, 2003.

Frosali F. Il Lessico degli ausili dialogici. In Cresti, E. (Ed.). Prospettive nello studio del lessico italiano, Atti del IX Congresso della Società Internazionale di Linguistica e Filologia Italiana. Firenze: FUP, 2008. pp. 417-424.

Fujimura I, Chiba S, Ohso M. Lexical and grammatical features of spoken and written Japanese in contrast: Exploring a lexical profiling approach to comparing spoken and written corpora. In Raso T, Mello H, Pettorino M (Eds.). Proceedings of the International GSCP 2012 Conference: Speech and Corpora. Firenze: FUP, 2012. pp. 393-398.

Gobbo O. Marcadores discursivos em uma perspectiva informacional: análise prosódica e estatística. Master Thesis. Belo Horizonte: Universidade Federal de Minas Gerais, 2018.

't Hart J, Collier R, Cohen A. A Perceptual Study on Intonation: An Experimental Approach to Speech Melody. Cambridge: CUP, 1990.

Kanada K. The Island Effect in Postverbal Constructions in Japanese, PACLIC v. 27, pp. 459-466, 2010.

Karcevsky S. Sur la phonologie de la phrase. Travaux du Cercle linguistique de Prague IV, pp. 188– 228, 1931.

LEEL. Presents the work of the Laboratório de Estudos Empíricos e Experimentais da Linguagem. Available at:

Li C, Thompson S. Subject and Topic: a new typology of language. In Li C (Ed.). Subject and Topic. New York: Academic Press, 1976. pp. 457- 489.

Lombardi-Vallauri E. What can Japanese -wa tell us about the function of Appendixes. Faits de langue, v. 43, n. 2, pp. 61-86, 2014.

Mittmann-Malvesi M. O C-ORAL-BRASIL e o estudo da fala informal: um novo olhar sobre o Topico no Portugues Brasileiro. PhdThesis. Belo Horizonte: Universidade Federal de Minas Gerais, 2012.

Mittmann-Malvesi M, Barbosa P. An automatic speech segmentation tool based on multiple acoustic

parameters., CHIMERA, v. 3, n. 2, pp. 133-147, 2016.

Moneglia M, Cresti E. C-ORAL-ROM: Prosodic boundaries for spontaneous speech analysis. In Kawaguchi Y, Zaima S, Takagaki T (Eds.). Spoken Language Corpus and Linguistics Informatics. Amsterdam: Benjamins, 2006. pp. 89-114.

Moneglia M, Raso T, Mittmann-Malvessi M, Mello H. Challenging the Perceptual Prominence of Prosodic Breaks in Multilingual Spontaneous Speech Corpora: C-ORAL-ROM/C-ORAL-BRASIL. In Speech Prosody 2010 – Fifith International Conference. Chicago, 2010.

Moneglia M, Raso T. Notes on the Language into Act Theory. In Raso T, Mello H (Eds.). Spoken Corpora and Linguistics Studies. Amsterdam: Benjamins, 2014. pp. 468-494.

Nakagawa N. Information Structure in Spoken Japanese: Particles, Word Order, and Intonation, Phd. Dissertation. Kyoto: Graduate School of Human and Environmental Studies of Kyoto University, 2016.

Nicolás Martínez C, Lombán Somacarrera M. Mini-Corpus del Español para DB-IPIC. CHIMERA: Romance Corpora and Linguistic Studies, v. 5, n. 2, pp. 197-215, 2018.

Ogiso T, Komachi M, Den Y, Matsumoto Y. UniDic for Early Middle Japanese: a Dictionary for Morphological Analysis of Classical Japanese. LREC 2012 Proceedings, pp. 911-915, 2012.

Panunzi A, Gregori L.DB-IPIC: An XML Database for the Representation of Information Structure in Spoken Language. In Mello H, Panunzi A, Raso T (Eds.). Pragmatics and Prosody. Illocution, Modality, Attitude, Information Patterning and Speech Annotation. Firenze: FUP, 2012. pp. 133-150.

Panunzi A, Malvessi-Mittmann M. The IPIC resource and a cross-linguistic analysis of information structure in Italian and Brazilian Portuguese. In Raso T, Mello H (Eds.). Spoken Corpora and Linguistic Studies. Amsterdam: Bejamins, 2014. pp. 129–151

Quirk R, Greenbaum S, Leech G, Svartvik J. A Comprehensive Grammar of the English Language. London/New York: Longman, 1985.

Raso T. Prosodic Constraints for Discourse Markers. In Raso T, Mello H (Eds.). Spoken corpora and Linguistic Studies. Amsterdam: Benjamins, 2014. pp. 411-467.

Raso T, Cavalcante FA, Mittmann-Malvessi M. Prosodic forms of the Topic information unit in a cross–linguistic perspective: a first survey. In De Meo A, Dovetto F (eds.). Atti del convegno SLI – GSCP International Conference (Naples 13–15 June 2016). 2018. pp. 445-468.

Raso T, Mello H (Eds.). C-ORAL-BRASIL I: Corpus de referência de português brasileiro faladoinformal. Belo Horizonte: Editora UFMA, 2012.

Raso T, Mittmann-Malvessi, M. Validação estatística dos critérios de segmentação da fala espontânea no corpus C-ORAL-BRASIL. Revista de Estudos da Linguagem, v. 17, pp. 73-92, 2009.

Raso T, Vieira MA. A description of Dialogic Units/Discourse Markers in spontaneous speech corpora based on phonetic parameters. Chimera, v. 3, n. 2, pp. 221-249, 2016.

Rocha B. Uma metodologia empírica para a identificação e descrição de ilocuções e a sua aplicação para o estudo da Ordem em PB e Italiano. PhD Dissertation. Belo Horizonte: Universidade Federal de Minas Gerais, 2016.

Shibatani M. Japanese grammar and universal grammar. Lingua, v. 57, pp. 103-123, 1982.

Shimojo M. Properties of particle omission revisited. Toronto Working Papers in Linguistics, v. 26,

pp. 123-140, 2006.

Signorini S. Topic e soggetto in corpora di italiano parlato. PhD Thesis. Florence: University of Florence, 2005.

Swerts M. Prosodic features at discourse boundaries of different strength. Journal of the Acoustical Society of America, v. 101, pp. 514-521, 1997.

WINPITCH. Software for prosodic research, with on the fly aligner, real-time spectrograph, multi- tracking F0 analysis, video and audio analysis, and much more (Free installation password required after 30 days of use). Available at:

Creative Commons License

This work is licensed under a Creative Commons Attribution 4.0 International License.

Copyright (c) 2019 Massimo Moneglia, Emanuela Cresti


Download data is not yet available.